Does GEO replace SEO?

No, they reinforce each other. To be cited by ChatGPT Search or AI Overviews, you first need to be in the top 10 organic on Google and Bing (AI engines retrieve their sources via these indexes). GEO adds to traditional SEO for post-retrieval ranking in LLM synthesis.

How long to see GEO results?

Faster than classic SEO: 2-6 weeks after publishing optimized content. AI engines recrawl and re-ingest frequently, and freshness is a strong signal for Perplexity and Gemini Live. Track via Profound, Athena HQ or monthly manual tests on 50 prompts.

Should I block GPTBot and CCBot?

Strategic decision: for a media publisher negotiating a licensing deal with OpenAI, yes. For an SMB wanting to be cited as a source by ChatGPT Search and benefit from leads, no — you must allow at least OAI-SearchBot (retrieval) and ideally GPTBot too (training). Blocking today = invisible tomorrow.

How do I know if I am cited by ChatGPT or Perplexity?

Three methods: (1) monthly manual tests — type 30-50 business prompts on the engines and note citations, (2) third-party tools — Profound, Athena HQ, Peec AI, Otterly.ai (50-500 €/month), (3) GA4 referral — create a custom 'AI Search' channel with sources chatgpt.com, perplexity.ai, gemini.google.com, copilot.microsoft.com.

Is llms.txt actually used by LLMs?

Adoption ongoing in 2026. Anthropic, Mintlify, Stripe, Cloudflare, Vercel, Supabase, Perplexity have implemented or use llms.txt. Not all LLMs consume it yet, but it has become a quality signal for site LLM-friendliness. Very low implementation cost — to do systematically.

Back to blog

SEO & AEO25 April 202613 min

GEO in 2026: Getting Cited by ChatGPT, Perplexity and Google AI Overviews

TL;DR

GEO (Generative Engine Optimization) optimizes for being cited by AI engines, not just indexed by Google. Princeton 2024 study: sourced citations (+40.6%), expert quotations (+41%) and numerical statistics (+37.2%) increase extraction odds. 2026 stack: extractable structure (autonomous chunks), rich schemas (FAQPage, HowTo, Speakable), llms.txt, robots.txt allowing OAI-SearchBot/PerplexityBot, plus Wikidata + Knowledge Graph entity.

ByJulien Daniel

Founder & CTO, OptionWeb

ChatGPT, Perplexity and Gemini logos with citation extraction arrows

65% of Google searches no longer generate a click in 2026 according to SparkToro. Users get their answers directly in AI Overviews, ChatGPT Search, Perplexity, Gemini or Copilot. If your site is not cited as a source in those answers, you are invisible — regardless of your classic SEO ranking. Generative Engine Optimization (GEO) addresses this new game. Here are the concrete techniques OptionWeb applies on client sites in 2026.

1. GEO vs SEO: the fundamental difference

SEO optimizes to appear in a list of results. GEO optimizes to be the answer extracted by an LLM. Two different games with techniques that partially overlap.

Aspect	Classic SEO	GEO
Target	Top 10 Google SERP	Citation in LLM synthesis
Winning format	Full optimized page	Autonomous extractable passages
Signals	Backlinks, authority, relevance	Sourced citations, statistics, clear structure
Measurement	Position, clicks, impressions	Share of Model, citation rate, sentiment
Entity role	Important	Critical (Knowledge Graph, Wikidata)
Content format	Long-form, complete	Modular, independent chunks

2. Mapping AI engines in 2026

Knowing each engine's retrieval source allows you to target effort.

Engine	Retrieval source	Synthesis model	AI market share
Google AI Overviews	Google index	Gemini	~55-60%
ChatGPT Search	Bing + OAI-SearchBot crawl + partnerships	GPT-4o/o-series	~20-25%
Perplexity	Bing + Google + PerplexityBot crawl	Claude/GPT-4o (your choice)	~8-10%
Microsoft Copilot	Bing	GPT-4o	~5-8%
Gemini app	Google index	Gemini	~5%
Claude (web search)	Brave Search	Claude	~1-2%

Strategic conclusion: optimizing for Google + Bing = covering 95% of AI retrieval. Bing SEO (often neglected) becomes critical for ChatGPT Search and Copilot.

3. Understanding RAG to optimize

RAG (Retrieval Augmented Generation) follows a 5-step pipeline:

Query rewriting: the LLM reformulates the user query into 3-10 sub-queries
Retrieval: a search engine (Google, Bing, Brave) returns 10-20 documents
Chunk extraction: documents are split into 100-500 token passages
Similarity scoring: vector embedding to rank the most relevant chunks
Synthesis: the LLM generates a response from the top-K chunks and chooses which to cite

Concrete implications: each paragraph must be autonomous (understandable in isolation), named entities must be explicit (no ambiguous pronouns), statistics and citations reinforce the 'reliability' perceived by the LLM.

4. The 9 techniques from the Princeton study

Aggarwal et al. (NeurIPS 2024) tested 9 optimizations on GPT-4 under real RAG conditions. Results on the Subjective Impression metric (weight in the generated response):

Technique	Extraction lift	Implementation
Expert quotations	+41.0%	Citations with name, title, organization
Sourced citations	+40.6%	Inline academic or authoritative refs
Numerical statistics	+37.2%	3+ stats per article, sources linked
Stylistic authority	+13.8%	Expert tone, precise technical vocabulary
Technical terms	+9.1%	Domain language
Ease of reading	+7.9%	Clear structure, short paragraphs
Unique words	-1.2%	Diverse vocabulary — little effect
Keyword stuffing	-10.3%	Penalizing — avoid
Fluency optimization	-1.8%	Marketing-speak — penalizing

5. Extractable content structure

Chunk-friendly atomic pattern:

H2 or H3 as a question/factual statement — Match user queries. E.g.: 'What is Consent Mode v2?' rather than 'Consent Mode v2'.
First sentence = autonomous answer — Self-contained, no pronouns, with named entity. Understandable outside the document context.
2-4 evidence sentences — Statistics, citations, concrete examples. Reinforces reliability.
Optional details after — The LLM extracts the first sentences. Details are for the human reader.

Anti-patterns to avoid: river sentences >40 words, pronouns without nearby antecedent ("it", "this", "these"), key info hidden in paragraph 5, content in images without rich alt, broken heading hierarchy (H2 → H4 without H3).

6. llms.txt and robots.txt for AI

llms.txt is an emerging standard (Jeremy Howard, fast.ai, September 2024). Markdown file at the root of public/ summarizing the curated structure of the site for LLMs.

public/llms.txtmarkdown

# OptionWeb

> Agence web belge depuis 2014. Spécialisée Next.js, hébergement cloud EU, SEO technique pour PME européennes.

## Services
- [Création de sites web](https://optionweb.dev/fr/creation-sites-web): sites Next.js 100/100 SEO
- [Hébergement cloud](https://optionweb.dev/fr/hebergement-cloud): infrastructure managed EU
- [SEO Marketing](https://optionweb.dev/fr/seo-marketing): SEO technique + AEO + GEO

## Blog
- [Hébergement Belgique 2026](https://optionweb.dev/fr/blog/hebergement-belgique-2026/)
- [Next.js vs WordPress](https://optionweb.dev/fr/blog/nextjs-vs-wordpress/)

For robots.txt, balanced 2026 configuration for an SMB wanting to maximize AI visibility:

public/robots.txttxt

# Autoriser tous les bots de retrieval IA
User-agent: OAI-SearchBot
Allow: /

User-agent: PerplexityBot
Allow: /

User-agent: ChatGPT-User
Allow: /

User-agent: Perplexity-User
Allow: /

User-agent: Google-Extended
Allow: /

User-agent: ClaudeBot
Allow: /

User-agent: anthropic-ai
Allow: /

User-agent: GPTBot
Allow: /

# Default
User-agent: *
Allow: /
Sitemap: https://optionweb.dev/sitemap.xml

7. Entity SEO: Wikidata and Knowledge Graph

LLMs reason about entities, not keywords. An entity well-defined in Wikidata + Google Knowledge Graph = massive reliability signal for LLMs.

Action plan:

Create a Wikidata item (Q-number) for your organization. More accessible than Wikipedia (notability criteria less strict).
Add serious external identifiers: P856 (official site), P3608 (EU VAT), P3376 (Belgian BCE), P4264 (LinkedIn), P2671 (Google Knowledge Graph ID if you have it).
Emit Schema.org Organization on the site with sameAs to Wikidata, LinkedIn, Crunchbase, official social profiles.
For authors: Schema Person + sameAs to verifiable profiles (LinkedIn, ORCID, Google Scholar, etc.).

How to measure your AI visibility

Three approaches depending on budget:

Method	Cost	Accuracy
Manual tests (50 monthly prompts)	0 €	Good but subjective
SaaS tools (Profound, Athena, Otterly)	100-500 €/month	Excellent, automated
GA4 custom 'AI Search' channel	0 €	Real traffic but lagging indicator

Core KPIs to track:

Share of Model (SoM) — % of target prompts where your brand is mentioned in the response
Citation Rate — % of prompts where you are linked as a clickable source
Sentiment — Tone (neutral/positive/negative) in responses that mention you
Position in synthesis — 1st cited source = more clicks than the following ones
GA4 AI Search referral — Real traffic measurable via UTMs or channel attribution

Tags#geo#aeo#ai-overviews#chatgpt-search#perplexity#llms-txt

SEO & AEO13 min

Technical SEO for a static Next.js site: complete 2026 checklist

Everything to configure technically for a Next.js static export site to reach 100/100 on SEO: metadata, JSON-LD, sitemap, hreflang, robots, Speakable, advanced schemas.

10 April 2026

Globe with arrows linking 11 country flags for multilingual SEO

SEO & AEO14 min

Multilingual SEO and hreflang in practice: 11-language experience guide

Complete guide to multilingual SEO: URL architecture, reciprocal hreflang, x-default, sub-sitemaps, language detection, content strategy, and typical pitfalls after 18 months in production.

1 May 2026

Maintenance13 min

Web Accessibility and the European Accessibility Act 2025: WCAG 2.2 Guide

The European Accessibility Act came into force on 28 June 2025. Which businesses are concerned, what WCAG 2.2 and 2.4 obligations apply, and how to audit and fix an SME website in 2026.

27 April 2026