How to Get Your Site Cited by ChatGPT and Perplexity (2026)
Getting cited by ChatGPT, Perplexity, and Google AI Overviews comes down to three things: let the AI crawlers in, answer the question in the first sentence of each section, and back your claims with data and schema. Do those well and you become a source the models quote, not just a page they skip.
This guide walks through exactly how to do that, in the order that matters most.
What "getting cited" actually means
AI search engines do not rank ten blue links. They read a handful of trusted, crawlable pages, extract the passages that directly answer the query, and quote or paraphrase them with a citation. Your goal is to be one of those extracted passages.
That changes the job. You are no longer optimising a whole page for one keyword. You are making each section individually quotable.
Step 1: Let the AI crawlers in
You cannot be cited by a crawler you have blocked. Before anything else, check your robots.txt for these user agents and make sure they are not disallowed:
- GPTBot and OAI-SearchBot (ChatGPT)
- ClaudeBot (Claude)
- PerplexityBot (Perplexity)
- Google-Extended (Google's AI training and AI Overviews signal)
Blocking one of these is the single most common reason a site is invisible to AI search. If you want the traffic, allow the bot.
Step 2: Write answer-first
AI engines prefer content that is well organised, easy to parse, and dense with meaning. The strongest move you can make is to lead every section with a direct, self-contained answer in the first one or two sentences, then expand.
Studies of AI citations consistently find that answer-first formatting, clear headings, and bullet structure raise the chance of being quoted by around 40 percent. Why: the model can lift a clean, complete sentence without having to stitch context from three paragraphs.
A simple test: read the first sentence under each heading on its own. Does it answer the heading as a question? If not, rewrite it so it does.
Step 3: Add citations, quotes, and statistics
Content that includes citations, direct quotes from credible sources, and concrete statistics is significantly more likely to be pulled into AI answers. Numbers and named sources signal trustworthiness, and they give the model something specific to quote.
Replace vague claims ("this can really help your rankings") with specific, sourced ones ("pages that load in under 2.5 seconds for 75 percent of visits pass the LCP threshold"). The specific version gets cited; the vague one does not.
Step 4: Mark it up with schema
Schema markup helps AI engines understand what your content is and how the pieces relate. The tags that matter most for citation are Article, FAQPage, HowTo, and Organization. A frequently-asked-questions block with matching FAQPage JSON-LD is especially effective, because each Q&A is already a tidy, quotable unit.
This very page uses Article and FAQPage schema for exactly that reason.
Step 5: Keep it fresh
Different engines weight freshness differently: Perplexity leans hard on recency, ChatGPT rewards encyclopedic authority, and Google still leans on your existing organic position. The practical rule that satisfies all three is to revisit and update important pages at least every 90 days, and to update the visible "last updated" date when you do.
The one-paragraph llms.txt question
You will see advice to add an llms.txt file. It is a low-cost, mildly positive signal that points some AI crawlers at your priority content, so there is little harm in adding one. But it is not a shortcut: Google has said its own AI features do not use it, and ChatGPT and Perplexity still primarily cite pages they can crawl and trust. Add it last, after the five steps above, not instead of them.
A 10-minute checklist
- Confirm
robots.txtallows GPTBot, ClaudeBot, PerplexityBot, and Google-Extended. - Rewrite the first sentence under each heading to answer it directly.
- Add at least one sourced statistic or quote per major section.
- Add
ArticleandFAQPageschema. - Set a reminder to refresh the page every 90 days.
Do the first four today and you will already be ahead of most sites competing for the same answers.
Frequently asked questions
Does adding an llms.txt file get me cited by AI?
Not on its own. llms.txt is a low-cost positive signal that some AI crawlers respect, but ChatGPT and Perplexity primarily cite pages they can crawl, parse cleanly, and trust. Google has said its own AI features do not use llms.txt at all. Treat it as one small part of a wider answer-first content and schema strategy.
How long does it take to start showing up in AI answers?
If a page already ranks on the first page of Google, it can be pulled into AI Overviews within days of being well-structured. For brand-new pages, expect the normal indexing and ranking curve first: weeks to a few months, since AI engines lean heavily on existing search authority.
Do I need to block or allow AI crawlers?
To be cited, you must allow them. Check that your robots.txt does not disallow GPTBot, ClaudeBot, PerplexityBot, or Google-Extended. Blocking these is the most common reason a site is invisible to AI search.
Want this done for you and verified against real field data?
Book an audit