How Do I Get Found in AI? (And Why "Good SEO" Isn't Saving You)
Traditional SEO and AI visibility are not the same game. Ranking #1 on Google does not mean ChatGPT will ever mention you.
If you're reading this, you've probably already said the sentence to yourself or to your marketing person: "I rank well in Google. I have good SEO. Shouldn't that mean I'm covered for AI search too?"
Short answer: No.
Long answer: It's not even close, and the gap is widening every single month.
I run Found For AI, where my entire job is auditing and fixing how businesses show up inside AI tools like ChatGPT, Claude, Perplexity, Gemini, and Grok. And the single biggest pattern I see, across every industry, is this: businesses with great traditional SEO are completely invisible in AI search, and they don't even know it.
You can be at the forefront of how to get found in AI, or you can play catch-up later and hope you're doing it right. This post is for the people who want to be at the forefront.
The Misconception That's Costing Businesses Customers Right Now
Traditional SEO and AI visibility are related, but they're not the same game. Ranking #1 on Google does not mean ChatGPT will ever mention you. They use different signals, different infrastructure, and increasingly, different definitions of "relevance."
Google ranks pages. AI agents retrieve, reason, and recommend answers. Those are not the same thing.
Let me show you exactly what I mean with a real client.
Case Study: Loxxie Hair Extensions
Loxxie is a premium hair extension seller and stylist in Holladay, Utah. By every traditional SEO measure, they were doing well. They had clean rankings in Google for their target terms. Organic traffic was steady. If you searched the obvious queries, they showed up.
Then I ran their AI visibility audit. I opened an incognito window and asked ChatGPT, Claude, Perplexity, and Gemini variations of: "What's a good place to get hair extensions in Salt Lake?"
Loxxie didn't show up. Not in the citations. Not in the recommendations. Not even once across any of the major AI tools.
When I audited their site, here's what I found:
- No schema markup. Nothing. No LocalBusiness, no Service, no Product, no FAQ.
- No llms.txt. AI agents had no curated map of what the business actually does.
- A robots.txt that was blocking every AI crawler. Even if they'd had perfect schema, the agents couldn't get in to read it.
Their site was great for humans. It was a locked door for robots.
Here's what we did, in roughly the order of impact:
- Unblocked the AI agents in robots.txt. This was the biggest single fix. Doesn't matter how good your house looks if you've nailed the front door shut.
- Built out a connected schema graph, including LocalBusiness, Service, Product, and a heavily-developed FAQPage schema so they'd surface for the long-tail questions real customers ask.
- Wrote a proper llms.txt that gave AI agents an instant snapshot of who Loxxie is, what they offer, and what makes their premium extensions different.
Today, Loxxie ranks #1 in incognito AI searches for their category in their market.
That's the gap. That's what nobody's talking about.
Why Your Robots.txt Is Probably Sabotaging You
Two patterns I see constantly:
Pattern 1: There's no robots.txt at all. The site never had one, nobody set one up, and AI agents are wandering in with no guidance about what to crawl or how.
Pattern 2: There's a robots.txt, but it's blocking everything. This is the more painful version, because it usually comes from a developer or agency doing what used to be the right thing.
Back in the day, "bots" were bad. Scrapers, spammers, content thieves. Blocking bots was good hygiene. Most robots.txt files were written with that mental model: keep the bad guys out.
But the rules changed. There's now a category of bot — AI agents and AI search crawlers — that you absolutely want to let in. They're how your future customers will find you. And the old "block all bots" playbook locks them out by default.
Most sites haven't caught up. Their robots.txt is still set up for 2015's internet.
Action item: Right now, before you do anything else, go to yoursite.com/robots.txt and look at it. If it's blocking AI crawlers, you have a 5-minute fix that could change everything.
What Is llms.txt, Actually?
This is the newest piece of the puzzle and most people either haven't heard of it or are getting it wrong.
The misconception: People think llms.txt is designed to tell an AI model what to "learn" during its initial training phase. They imagine it as a way to influence the model itself.
The reality: llms.txt is primarily designed for inference time — specifically for RAG (Retrieval-Augmented Generation). It helps AI agents (the ones inside ChatGPT, Claude, Perplexity, Cursor, and others) find the right documentation when answering a user's specific prompt.
Think of it as a curated map. When someone asks an AI agent about your business, the agent doesn't want to crawl your entire site, parse out the headers, footers, navigation, ads, and cookie banners just to find the answer. That wastes context window tokens. Your llms.txt hands it the most relevant pages directly.
A good llms.txt is the simplest, easiest way for AI to get a clean snapshot of your business: what you do, who you serve, what makes you different, and where the important details live.
A bad llms.txt is a copy-pasted sitemap. Don't do that.
Schema in the Age of AI: Why "Pile of Business Cards" Isn't Good Enough
If you've done traditional SEO, you've probably done some schema. Maybe LocalBusiness, maybe FAQ, maybe Product. Most people think of it as a checklist for Google's rich snippets — getting the stars, prices, or recipe cards to show up in search results.
That's fine. It's also massively underselling what schema can do.
Here's the shift: for AI agents and LLMs, a collection of disconnected JSON-LD blobs is like a pile of business cards. Each one is useful individually, but they don't tell the story of the business. They don't connect. An AI looking at them sees fragments, not an entity.
The real value lies in the graph — connecting your schema together so an AI can reason across it. And this is where almost everyone is missing the nuances.
The @id Is Not Optional
If you don't use @id in your schema, you're creating what are called blank nodes. They're floating pieces of data with no identity.
The mistake I see constantly: Defining an Organization on the homepage and a WebPage on the About page, with no link between them. To an LLM, these are two unrelated entities that happen to share a name. They're not the same business in the model's eyes.
Use @id like a primary key. Connect your Organization, WebPage, LocalBusiness, Service, and Product entities so the AI can see they all belong to one knowledge graph.
Schema Belongs Everywhere, Not Just on Search-Optimized Pages
Most people only think to add schema to pages they're trying to rank for. That's a search-era mindset.
In an agent-first world, you should have schema for every meaningful section of your site — not because Google needs it for a rich snippet, but because AI agents need it to understand you quickly. When agents are everywhere (which is closer than most people think), the businesses that have invested in a proper schema graph will get found instantly. The ones with a homepage LocalBusiness blob and nothing else will get passed up.
Content: The Terrain Beneath the Map
The technical layer (schema, llms.txt, robots.txt) is the map. But the prose on your page is the terrain. And the terrain matters enormously.
Here's why: LLMs don't "read" your site the way a human does. They vectorize it. They break your text into high-dimensional mathematical representations called embeddings. If your content is vague, flowery, or buried in clever metaphors, the signal gets lost in the noise. Your site becomes harder for the AI to retrieve as a relevant answer.
So how do you write for AI extractability without sacrificing human readability? Two principles I drill into every client:
1. Definition-First Sentence Structure
AI agents use Named Entity Recognition (NER) to identify the who, what, and where of a page. Make it easy for them.
- Human-centric fluff: "In a world where efficiency is key, our revolutionary platform seeks to change the way you think about spreadsheets."
- AI-optimized hook: "ExcelSync is a SaaS automation tool that synchronizes real-time data between Microsoft Excel and Salesforce."
The second version immediately tells the AI the subject (ExcelSync), the category (SaaS automation tool), and the function (synchronizes data). That makes it dramatically more retrievable when a user asks, "What tools sync Excel with Salesforce?"
2. High Information Density (and No Pronoun Debt)
LLMs have a limited context window. When a RAG system chunks your page into 500-word segments, you want every single segment to be self-contained. That's information density.
The trick: kill your "pronoun debt." Instead of writing "This feature allows you to...", write "The Real-Time API feature allows you to..."
If the AI only grabs one paragraph from the middle of your page, that paragraph should still tell the full story of what it's about. Vague pronouns break that. Specific named entities preserve it.
This is the kind of edit you can make to your site this week that will move the needle.
How Do You Even Measure This?
Here's where the AI visibility space gets a little uncomfortable: the measurement layer is still half-baked.
Traditional SEO has Google Search Console, GA4, rank trackers, and an entire ecosystem of tools. AI visibility doesn't yet. Most of the legacy tools haven't caught up.
So here's what we actually do:
- Incognito searches across the major AI tools. Open ChatGPT, Claude, Perplexity, Gemini, and Grok. Ask the same questions. Document the answers with screenshots, dates, and exact prompts. This becomes your before/after.
- Bing Webmaster Tools. Microsoft has rolled out a new AI performance section that actually shows citations — meaning when your site is being referenced inside AI answers. This is currently the best public-facing tool for showing actual needle-moving over time. (Yes, Microsoft is ahead of Google here. I know.)
It's not as clean as a rank tracker. But documented incognito tests plus Bing's citation data is what I use to show clients real before/after movement, and it works.
Your Actual Step-by-Step Action Plan
If you're sitting there thinking "okay, where do I start?", here's the order I'd give you:
1. Check your robots.txt. Go to yoursite.com/robots.txt right now. Either you have nothing (problem) or you're blocking AI agents (bigger problem). Fix this first. It's free and it takes five minutes.
2. Audit your current schema. We have a free audit tool at foundforai.com/tools that will scan your site and tell you what schema you have, what you're missing, and what's broken. Use it.
3. Build out your llms.txt. We also have a free llms.txt generator in the tools section. Don't just copy your sitemap into it. Write a real curated map of your business.
4. Run the incognito test. Open an incognito window and test yourself across at least three AI tools (ChatGPT, Claude, Gemini, plus Grok if you want a fourth). Ask two types of questions:
- Direct: "Where can I find [your business niche] in [your specific target location]?"
- Ambiguous: Ask it the kind of question a customer with your problem would ask, but don't mention your category directly. For Loxxie, instead of "where do I get hair extensions," it might be "I want to do something new and fun with my hair because I'm sick of how it looks." Does your business get referenced as a solution? If the answer is no, think hard about adding those types of FAQs to your site so the AI can connect the dots.
5. Rewrite your most important pages with definition-first sentences and no pronoun debt.
That's the playbook. You can do it yourself, or you can hire help. Either way, do it.
Where This Is All Going
Here's the thing most business owners aren't ready for: every customer's AI chatbot is going to serve them different results based on their personal preferences.
We're heading away from a world where one search query returns one ranked list of results that everyone sees. We're heading toward a world where the same question gets ten different answers for ten different people, based on what their AI agent knows about them: their preferences, their values, their past behavior, their budget, their priorities.
That means understanding your ideal client matters more than it ever has. You need to know exactly who they are so you can serve up the specific signals their AI agent is looking for when it tries to match them with a business. If your site isn't clearly marked up with the information that helps an agent understand who you're for, you'll get passed up. Not because you're worse than your competitor, but because your competitor's site was easier to match.
You can be at the forefront of how to get found in AI, or you can play catch-up later and hope you're doing it right.
Where to Go From Here
If this post lit something up for you, here's what I'd do next:
- Run the free tools at foundforai.com/tools, including the schema audit, the llms.txt generator, and the validator. They'll tell you exactly where you stand in about ten minutes.
- Book a call if you want help building out the full AI visibility layer for your business. We do this every day, and we'd rather you be visible now than play catch-up in twelve months.
The businesses that are doing this now are going to be the ones that own their category when AI agents are how customers find everything. Don't be the business that ranked great on Google in 2026 and was invisible on ChatGPT in 2027.
Get found.
Not sure where your site stands right now?
The free AI visibility tools at foundforai.com/tools will tell you exactly what schema you have, what's missing, and what's broken — in about ten minutes. Or book a call and we'll walk through it together.
→ Run the free toolsNo pitch. No obligation. Just clarity.