GEO Analyser: Analyse Your Content for Generative Engine Optimisation

September 30, 2025
Written By Richard Baxter

I work on the messy middle between data, content, and automation - pipelines, APIs, retrieval systems, and building workflows for task efficiency. 

I’ve been watching AI search traffic for the past year, and it’s become impossible to ignore. Not because traditional Google is dead – it’s not – but because ChatGPT referrals jumped from 14,500 visits per month in early 2024 to over 120,000 per month by May according to the Previsible AI Traffic Report.


The problem? You’ve no idea whether these AI engines are actually citing your content, and if they are, whether they’re representing it accurately.

Google’s global search market share fell below 90% in late 2024 for the first time since 2015, hovering around 88% through most of 2024. Meanwhile, 58% of marketers report declining clicks and web traffic from traditional search, specifically attributing traffic declines to Google’s AI Overviews launch in May.

That’s where GEO Analyser comes in. It’s an open source, free, self-hosted tool that analyses your content specifically for AI search engine optimisation. The clever bit? v2.0 runs entirely on your laptop – no Cloudflare Workers, no external services, no deployment complexity.

Just add one config block to Claude Desktop and start analysing!

What Changed in v2.0

GEO Analyser v2.0 is a complete architectural reimagining (fair enough – v1.x setup was too complex). I’ve eliminated external service dependencies and rebuilt the tool to run entirely locally, powered by Claude Sonnet 4.5. Also, I noticed a small handful of individuals lifted the original cloudlfare worker and deployed single page apps with ads on. No citation – not cool. Those people are now on their own as their code is now unsupported. Sorry, not sorry.

Anywa- I’ve simplief things. The result? Setup drops from 7 steps to 2! Analysis quality improves with better semantic understanding (becuase I’ve imporved it based on learnings from the Query Fan Out Analyser). And you’re no longer dependent on external infrastructure.

Key Changes:

  • No Cloudflare Workers deployment needed
  • No Jina API key required (the new version has its own scraper and htmlcleaner)
  • Better semantic analysis (Sonnet 4.5)
  • Text analysis mode (paste content directly)
  • Cleaner content extraction
  • Simpler configuration

One API key. One config block. Better results.

I know what I’m doing, take me to the repo

What is the GEO Analyser?

GEO Analyser is an MCP server that evaluates content for AI search engine optimisation using methodology based on MIT research. It analyses URLs or pasted text to identify what makes content more likely to be cited by AI systems like ChatGPT, Perplexity, and Google’s AI Overviews.

AI engines don’t just look at keywords – they extract semantic triples (subject-predicate-object relationships), identify named entities, and evaluate content structure. Traditional SEO metrics barely correlate with AI citation rates.

In my opinion, there are a set range of parameters you can work to that improves your findability in AI search, without screwing up the article and making it sound weird.

Why This Matters

I’ve been tracking the shift in how people search for information – to be perfectly frank, I’m finding myself using legacy Google search less and less, opting for deep research with Gemini (check out my MCP for Gemini with search grounding).

Look at this result for “all inclusive holidays November 2025”:

Mercury Holidays dominates this query – that’s your competition.

I’ve never heard of Mercury Holidays – maybe they’re huge – but something tells me this wouldn’t be a difficult SEO play to beat. That 40% improvement in citation rates with GEO isn’t just a number, it’s a massive opportunity! It represents the difference between being invisible to AI search engines and your site becoming the primary source.

I’ve seen blog posts with excellent traditional SEO that never get cited by AI engines because the information isn’t structured for extraction. Horrendous waste of good content.

How It Works (The Cool Bit)


v2.0 Architecture:

Claude Desktop (MCP)
    ↓
GEO Analyzer (local)
    ├─ Content Fetcher (cheerio)
    ├─ Pattern Analyzer (MIT research)
    ├─ Semantic Analyzer (Sonnet 4.5)
    └─ Report Formatter


The analysis happens in three layers:

1. Pattern Analysis (MIT Research-Based)

This layer evaluates structural patterns that correlate with AI citation rates:

  • Sentence length: 15-20 words optimal (too short = choppy, too long = complex)
  • Claim density: 4+ claims per 100 words target
  • Date markers: Temporal specificity (“January 2025” vs “recently”)
  • Entity diversity: Named people, organisations, products
  • Structure analysis: Headers, lists, clear organisation

These metrics come from MIT research (PDF URL: https://arxiv.org/pdf/2311.09735) on what AI models extract most reliably. I think this pattern layer is what makes the tool powerful rather than just another content analyser.

2. Semantic Analysis (Sonnet 4.5)

Claude analyses the content to extract:

  • Semantic triples: Subject-predicate-object relationships
  • Named entities: PERSON, ORG, PRODUCT, LOCATION
  • Entity relationships: How entities connect
  • Diversity metrics: Breadth of coverage

This was Workers AI (Llama 3.3) in v1.x. Sonnet 4.5 provides significantly better accuracy in entity recognition and triple extraction so, the difference is night and day when you’re working with technical content (which I do constantly).

3. GEO Scoring (0-10 Scale)

The system combines both analyses into a single score:

  • 0-3: Poor AI extractability (generic content, no structure)
  • 4-6: Moderate extractability (some good elements)
  • 7-8: Good extractability (well-structured, specific)
  • 9-10: Excellent extractability (AI citation likely)

Plus actionable recommendations for improvement ranked by priority.

Recomendations from the latest GEO Analysis

What You Need to Get Started

Requirements:

  • Claude Desktop installed
  • Anthropic API key (you already have one I’m sure)
  • npx/npm (comes with Node.js)


That’s it. No Cloudflare account, no Wrangler CLI, no Worker deployment, no Jina API key. Seriously – I’m done with complex deployment processes for otherwise simple, free mcp tools. I still use Claoudflare Workers on big projects, though.

Step-by-Step Setup

Step 1: Add to Claude Desktop Config

Open your Claude Desktop config:

  • Mac: ~/Library/Application Support/Claude/claude_desktop_config.json
  • Windows: %APPDATA%\Claude\claude_desktop_config.json

Add this block to your config (I use this exact setup on my main machine):

{
  "mcpServers": {
    "geo-analyzer": {
      "command": "npx",
      "args": ["-y", "@houtini/geo-analyzer@latest"],
      "env": {
        "ANTHROPIC_API_KEY": "sk-ant-your-key-here"
      }
    }
  }
}

Replace sk-ant-your-key-here with your actual Anthropic API key.

Step 2: Restart Claude Desktop

Close and reopen Claude Desktop. The GEO Analyzer tools will now be available.

That’s it. Start analysing immediately.

Using It in Your Workflow

Analyse a Published URL

Analyze https://yoursite.com/article for "content marketing"

The query parameter provides context for relevance scoring (optional but helpful in my testing).

Analyse Pasted Text (New in v2.0)

Analyze this text for "SEO optimisation":

[Paste your content here - minimum 500 characters]


Perfect for draft content before publishing. I use this constantly when writing articles to check GEO scores before they go live.

Output Formats

Detailed (default):

  • Full pattern analysis
  • Complete semantic triples
  • Entity breakdown
  • Actionable recommendations


Summary:

  • GEO score
  • Top 3 priorities
  • Quick wins

Specify with: analyze_url with output_format=summary

Example Analysis Output

GEO Score: 6.2/10

PATTERN ANALYSIS:
- Sentence length: 18.3 avg (✓ optimal 15-20)
- Claim density: 2.1 per 100 words (✗ target 4+)
- Date markers: 3 found (✓ good temporal specificity)
- Entity diversity: 0.42 (✓ good variety)

SEMANTIC ANALYSIS:
- Triples extracted: 47
- Named entities: 12 (8 PERSON, 3 ORG, 1 PRODUCT)
- Entity relationships: Strong connections

RECOMMENDATIONS:
1. HIGH: Increase claim density (add 15+ specific claims)
2. MEDIUM: Add more product/tool mentions
3. LOW: Improve header structure (2-3 levels)

When I run my articles through this, I target 8+ scores before publishing (for professional work anyway).

Who This Is For

Content creators optimising for AI search visibility
SEO professionals adapting to AI-first search
Marketing teams measuring AI extractability
Developers building content analysis pipelines

If you’re publishing content and want to know if AI engines can extract value from it, this tool provides that insight. I built it because I needed this for my own content workflow (and couldn’t find anything importnat that did the job properly).

Related Posts

Swapping the Engine: How to Run Claude Code on Local Silicon for Zero Pennies

Claude Code’s real power isn’t the Anthropic model sitting behind it, it’s the agentic : the file-system access, the tool use, the way it chains tasks together without you babysitting every step. I figured this out the expensive way. I ran a batch of log-parsing scripts through the API for a client project last month … <a title="GEO Analyser: Analyse Your Content for Generative Engine Optimisation" class="read-more" href="https://houtini.com/geo-analyser-analyse-your-content-for-generative-engine-optimisation/" aria-label="Read more about GEO Analyser: Analyse Your Content for Generative Engine Optimisation">Read more</a>

Claude Desktop System Requirements: Windows & macOS

Have you found yourself becoming a heavy AI user? For Claude Desktop, what hardware matters, what doesn’t, and where do Anthropic’s official specs look a bit optimistic? In this article:  Official Requirements | Windows vs macOS | What Actually Matters | RAM | MCP Servers | Minimum vs Comfortable | Mistakes Official Requirements Anthropic doesn’t … <a title="GEO Analyser: Analyse Your Content for Generative Engine Optimisation" class="read-more" href="https://houtini.com/geo-analyser-analyse-your-content-for-generative-engine-optimisation/" aria-label="Read more about GEO Analyser: Analyse Your Content for Generative Engine Optimisation">Read more</a>

Best GPUs for Running Local LLMs: Buyer’s Guide 2026

I’ve been running various LLMs on my own hardware for a while now and, without fail, the question I see asked the most (especially on Reddit) is “what GPU should I buy?” The rules for buying a GPU for AI are nothing like the rules for buying one for gaming – CUDA cores barely matter, … <a title="GEO Analyser: Analyse Your Content for Generative Engine Optimisation" class="read-more" href="https://houtini.com/geo-analyser-analyse-your-content-for-generative-engine-optimisation/" aria-label="Read more about GEO Analyser: Analyse Your Content for Generative Engine Optimisation">Read more</a>

A Beginner’s Guide to Claude Computer Use

I’ve been letting Claude control my mouse and keyboard on and off to test this feature for a little while, and the honest answer is that it’s simultaneously the most impressive and most frustrating AI feature I’ve used. It can navigate software it’s never seen before just by looking at the screen – but it … <a title="GEO Analyser: Analyse Your Content for Generative Engine Optimisation" class="read-more" href="https://houtini.com/geo-analyser-analyse-your-content-for-generative-engine-optimisation/" aria-label="Read more about GEO Analyser: Analyse Your Content for Generative Engine Optimisation">Read more</a>

A Beginner’s Guide to AI Mini PCs – Do You Need a DGX Spark?

I’ve been running a local LLM on a variety of bootstrapped bit of hardward, water-cooled 3090’s and an LLM server I call hopper full of older Ada spec GPUs. When NVIDIA, Corsair, et al. all started shipping these tiny purpose-built AI boxes – the DGX Spark, the AI Workstation 300, the Framework Desktop – I … <a title="GEO Analyser: Analyse Your Content for Generative Engine Optimisation" class="read-more" href="https://houtini.com/geo-analyser-analyse-your-content-for-generative-engine-optimisation/" aria-label="Read more about GEO Analyser: Analyse Your Content for Generative Engine Optimisation">Read more</a>

Content Marketing Ideas: What It Is, How I Built It, and Why I Use It Every Day

Content Marketing Ideas is the tool I’ve built to relcaim the massive amount of time I have to spend monitoring my sources for announcementsm ,ew products, release – whatever. The Problem with Content Research in 2026 Most front line content marketing workflow follows the same loop. You read a lot, you notice patterns, you get … <a title="GEO Analyser: Analyse Your Content for Generative Engine Optimisation" class="read-more" href="https://houtini.com/geo-analyser-analyse-your-content-for-generative-engine-optimisation/" aria-label="Read more about GEO Analyser: Analyse Your Content for Generative Engine Optimisation">Read more</a>

Receive the latest articles in your inbox

Join the Houtini Newsletter

Practical AI tools, local LLM updates, and MCP workflows straight to your inbox.