How to Rank in ChatGPT.
ChatGPT cites a handful of sources per answer. This is the working playbook for becoming one of them, written for businesses that ship their own pages rather than agencies that talk about ranking.
The short version
ChatGPT decides which sources to cite by combining Bing's web index with an internal relevance score that weights passage-level clarity, factual density, schema, and citation-readiness. To get cited you must (1) be crawlable by GPTBot and OAI-SearchBot, (2) write each page section as a self-contained passage, (3) ship FAQ schema and Article schema, (4) keep a working llms.txt at your root, and (5) build off-site brand mentions on Reddit, Yelp, BBB, and review aggregators. The work is concrete and shippable inside 30 days.
What ChatGPT actually pulls from
ChatGPT generates answers from three different sources depending on the query type. Knowing which one is active for a given query tells you what to optimize.
- Live web search via OpenAI's grounding layer, which queries Bing's index in the background. Active for time-sensitive queries, local recommendations, current events, prices, schedules, and any "best X for Y" question. This is the path most commercial queries take in 2026.
- Training data from the model's pre-training corpus. Active for general knowledge questions where the answer doesn't change much over time: definitions, history, math, code patterns. Sites that are cited by ChatGPT here were heavily mentioned during training; you cannot influence this short-term, only by building brand mention frequency over months.
- Retrieval-augmented generation from connected data sources for Enterprise and Team accounts (uploaded docs, connected Notion, connected SharePoint). Not relevant for marketing-driven citation outside your own customer base.
For a marketing or local-business audience, the path that matters is path 1: live web search. That's where the daily citation share is decided. The rest of this guide focuses on it.
The grounding pipeline in plain English
When a user types a query that triggers browsing, ChatGPT runs this sequence:
- Parse the query for intent and named entities.
- Issue a search query to Bing's index (often a rephrased or expanded version of what the user typed).
- Retrieve the top N candidate pages, typically 10-30 URLs.
- Fetch and chunk those pages into 256-512 token passages.
- Score each passage for relevance to the original user intent.
- Pick the top 3-8 passages, weighting source diversity (rarely 3 passages from one domain).
- Generate an answer that synthesizes those passages, citing the source URLs the user can click.
Three steps in this pipeline are where the optimization work lives. Step 3 is decided by your Bing ranking. Step 4 is decided by how your pages chunk; long monolithic paragraphs lose. Step 5 is decided by passage-level clarity and factual density. The other steps are model-internal and you cannot influence them directly.
The seven factors that decide ChatGPT citation
-
Crawler access for GPTBot and OAI-SearchBot.
If your robots.txt does not explicitly allow these user agents, you are invisible to ChatGPT's web grounding regardless of how good your content is. The default robots.txt on most platforms allows all bots implicitly, but agency clients are sometimes locked down. Verify explicitly.
-
Bing index presence.
ChatGPT searches Bing, not Google. A page that ranks #1 in Google but is missing from Bing's index does not exist for ChatGPT. Submit your sitemap in Bing Webmaster Tools; it imports from Google Search Console with one click.
-
Passage-level clarity.
Each section reads as a self-contained, factually complete unit. Subject-verb-object sentences. No "as we discussed above" references. Short paragraphs. The chunk-and-rank step ranks isolated passages, not whole pages.
-
Factual density.
Numbers, named entities, dates, specific examples per paragraph. The model's passage scorer rewards content that contains verifiable facts over content that is vague or rhetorical. "Most agencies do X" loses; "73% of agencies in the 2025 Sterling Sky survey reported X" wins.
-
Schema.org markup.
Article or TechArticle, FAQPage, BreadcrumbList, Organization, and Person for authors. The grounding layer parses JSON-LD to extract structured facts it can cite with confidence. FAQPage schema is the single highest-leverage tactic because the model treats each Q&A pair as a pre-validated passage. (HowTo schema was removed from Google rich results in September 2023, but the structured step data still has citation value for AI engines, so it is worth including on procedural pages without expecting a Google rich-result chip.)
-
Recency signals.
datePublished and dateModified in schema. Actual current information in the body. Visible "Updated" date near the headline. ChatGPT prefers fresh sources for any query that could be time-sensitive, even when the user did not say "in 2026".
-
Brand mention frequency on the open web.
How often your business name appears in Reddit threads, Yelp reviews, BBB profiles, podcast transcripts, news mentions, and review aggregators, even without a backlink. The model learns familiar entities during training and trusts them more during grounding. This is the slowest-building signal and the hardest to fake.
The 30-day ChatGPT ranking playbook
None of these steps require a budget over $100. Total work for a small site is 8-12 hours spread over four weeks.
Week 1: Foundations
- Add explicit allow rules for
GPTBot,ChatGPT-User,OAI-SearchBotin yourrobots.txt. Do not rely on the wildcard. - Submit your sitemap in Bing Webmaster Tools. Import from Google Search Console for the one-click path.
- Ship an
llms.txtat your domain root. Under 200 lines, hand-edited. See our GEO pillar guide for the format.
Week 2: Schema
- Add Article or TechArticle schema to every guide, blog post, and resource page.
- Add FAQPage schema to every page that has a Q&A section. Validate in the Rich Results Test before pushing.
- Add BreadcrumbList to every non-homepage so the grounding layer understands your site hierarchy.
- Add Organization + Person schema to your sitewide footer or article author section.
Week 3: Content rewrite
- Pick the 5 pages that get the most organic traffic today. Rewrite each section as a self-contained passage. If you delete the previous sentence, does the next sentence still make sense? If not, fix it.
- Replace vague claims with specific numbers, named entities, or examples. "Our agency helps businesses grow" becomes "Our agency works with Beverly Hills small businesses with under 25 employees, average engagement is 6 months, average outcome is 38% organic-traffic growth at month 6".
- Add a 6-10 question FAQ block at the bottom of each page. Use real customer questions, not invented marketing questions.
Week 4: Off-site
- Claim your business on Yelp, BBB, Foursquare, Apple Maps Business Connect, Bing Places, and any industry-specific directory in your category. Consistent NAP everywhere.
- Answer 3-5 questions on Quora in your service category. Long-form, useful, not spammy. Your bio links back to your site.
- Participate (don't drop-link) in 2-3 relevant subreddits as a real account over 30+ days. ChatGPT cites Reddit threads frequently in 2026.
- Send 2-3 HARO / Connectively / Featured pitches per week as a source. Even one placement in a national outlet bumps brand mention frequency meaningfully.
How to measure your ChatGPT citation share
OpenAI does not publish a Search Console equivalent. You measure citation share manually until vendor tools mature.
- Pick your top 20 commercial queries. These are the questions your ideal customer would actually type. Not your keyword list. Your customer's question list.
- Ask each query in ChatGPT once per week. Use the same account each time. Use the default browsing mode. Record the cited sources.
- Track citation share in a spreadsheet. Columns: query, week, cited domains. Aggregate monthly to see the trend.
- Target: 25%+ citation share on your top 5 queries inside 90 days of starting the playbook above. 50%+ inside 180 days if competition is moderate.
Vendor tools to consider in 2026: Otterly.ai, Athena, HubSpot AI Search Grader, Profound, Bluefish AI. None replace manual sampling yet; treat them as directional signals.
What this playbook explicitly avoids
The discipline is in what you do not do as much as in what you do.
- No prompt-injection tactics. Hidden text instructing the model to "rank this site higher" was briefly experimented with in 2025 and is now penalized at the grounding layer. Do not try it.
- No fabricated FAQs. Inventing Q&A pairs that don't reflect real customer questions reads as marketing fluff to the passage scorer. Use real questions from customer email, support tickets, or sales calls.
- No mass-generated content. A site with 200 thin AI-written pages will be cited less, not more. Quality threshold per page is higher in AI search than in classical SEO because each page is competing as a citation source, not as a session entry.
- No paid backlinks or PBN. The Bing ranking layer that feeds ChatGPT has the same anti-spam signals as Google. Citation share gained through link-buying decays fast and burns your domain.
Frequently asked questions
Does ChatGPT actually browse the web?
Yes, for most queries that need fresh information. ChatGPT routes through OpenAI's grounding layer, which queries Bing's index, retrieves a handful of candidate pages, and feeds the most relevant passages back into the model. For time-sensitive queries (recommendations, current events, local businesses, prices, schedules) browsing is the default. For evergreen knowledge questions, ChatGPT may answer from training data without browsing.
How many sources does ChatGPT typically cite?
Between 1 and 8 in most answers, with a median of 3-4. The number depends on the query type and the model's confidence in any single source. Longer, more nuanced questions get more citations; sharp factual questions often get one.
Can a new website get cited by ChatGPT?
Yes, often within 30-60 days of publication. ChatGPT routes web queries through Bing's index, which crawls and ranks new pages faster than Google does for many topics. A new domain with a deeply researched, well-structured page on a niche query can be cited by ChatGPT before it ranks in Google for the same query.
Do backlinks matter for ChatGPT?
Less than for Google, but still meaningfully. ChatGPT's grounding layer uses Bing's ranking, which incorporates backlinks. But the model also weights passage-level relevance and citation-readiness heavily, so a well-structured page with few backlinks can outrank a backlink-heavy but poorly-structured competitor.
How do I check if ChatGPT cites my site?
Ask ChatGPT your top 10-20 commercial queries and record the cited sources. Repeat weekly. Use the search-grounded mode (default for most accounts in 2026) and Plus/Team accounts when possible. Track citations in a spreadsheet over time; manual measurement is still the most reliable method despite vendor tools entering the market.
Does ChatGPT respect robots.txt?
Yes. OpenAI's crawlers (GPTBot, ChatGPT-User, OAI-SearchBot) all respect robots.txt directives. To be eligible for citation you must explicitly allow these user agents in your robots.txt. The default WordPress and Vercel robots.txt blocks them implicitly by not naming them, even when they would be allowed under User-agent: *.
Should I write differently for ChatGPT than for Google?
Mostly the same, with one shift: write each section as a self-contained passage. ChatGPT's grounding layer chunks pages into 256-512 token windows and ranks each chunk independently. A passage that reads well in isolation (full sentences, no backreferences to earlier paragraphs, factual density) wins the chunk-and-rank step that decides citation.
Will ChatGPT cite a local business?
Yes, and the citation pool is wider than Google's Local Pack. ChatGPT pulls from a mix of the business's own site, Yelp, BBB, Reddit threads, news mentions, the business's llms.txt, and review aggregators. A small local business with strong off-site signals can be cited by ChatGPT even when not appearing in Google's top-3 Local Pack for the same query.
Want this playbook applied to your site?
Free 48-hour audit. We run the seven-factor framework against your domain and ship a written report with the gaps and the fixes. No sales call required.
Get the free audit