automatelab-ai-seo
Vendor-agnostic MCP server that audits, scores, and rewrites web pages for AI-citation eligibility. No API keys. No registration. Works in Claude Desktop, Cursor, Cline, Windsurf, VS Code (Copilot / Continue), and any client that speaks the Model Context Protocol.
README
@automatelab/ai-seo-mcp
Vendor-agnostic MCP server that audits, scores, and rewrites web pages for AI-citation eligibility. No API keys. No registration.
Works in Claude Desktop, Cursor, Cline, Windsurf, VS Code (Copilot / Continue), and any client that speaks the Model Context Protocol.
Why it's important
AI assistants are eating search. ChatGPT, Perplexity, Google AI Overviews, and Claude with web access answer a fast-growing share of "what is X" and "how do I Y" queries - and each answer cites a handful of pages. Being one of those cited sources is binary: you are in the answer, or you are invisible.
The signals that decide citation eligibility are not what classic SEO tools measure. Lighthouse will not flag missing FAQPage schema. Search Console will not tell you that GPTBot is allowed but OAI-SearchBot is blocked. Ahrefs does not score citation worthiness. Screaming Frog does not validate llms.txt.
A page that no AI assistant cites is a page that, for a growing share of users, does not exist.
This MCP gives you a short list of no-brainer fixes that dramatically increase the odds of getting cited by an AI assistant versus unprepared text. It audits the specific signals AI assistants use - schema completeness, FAQ structure, AI-crawler allowlists, llms.txt, entity density, freshness, authority - and for each gap it returns the exact change to make. No opaque scores, no guesswork. Most of the fixes are one-time edits to a template or a robots.txt and pay off on every page you publish from then on.
What it does
Modern search increasingly happens inside AI assistants. ChatGPT Search, Perplexity, Google AI Overviews, Claude with web access, and Microsoft Copilot all cite pages they consider authoritative, well-structured, and machine-readable. The signals that drive those citations overlap with classic SEO but are not the same.
This MCP gives any AI client a toolkit to inspect a URL and answer questions like:
- Is this page set up to be cited by AI Overviews and Perplexity?
- What schema is missing, malformed, or deprecated?
- Are GPTBot, ClaudeBot, OAI-SearchBot, and PerplexityBot allowed by
robots.txt? - Does the page have a chance at the Answer Engine slot for a given query?
- Rewrite this passage into a citation-ready answer block.
13 tools. Deterministic where possible. Rule-based scoring with explicit rubrics, not black-box numbers.
Install
npx -y @automatelab/ai-seo-mcp
Requires Node 20 or later. No global install needed.
Claude Desktop
Add to %APPDATA%\Claude\claude_desktop_config.json on Windows, or ~/Library/Application Support/Claude/claude_desktop_config.json on macOS:
{
"mcpServers": {
"ai-seo": {
"command": "npx",
"args": ["-y", "@automatelab/ai-seo-mcp"]
}
}
}
Restart Claude Desktop. The 13 tools appear in the tool tray.
Cursor
Add to .cursor/mcp.json in the project root or ~/.cursor/mcp.json globally:
{
"mcpServers": {
"ai-seo": {
"command": "npx",
"args": ["-y", "@automatelab/ai-seo-mcp"]
}
}
}
Cline
Add to VS Code settings or .cline/mcp_settings.json:
{
"mcpServers": {
"ai-seo": {
"command": "npx",
"args": ["-y", "@automatelab/ai-seo-mcp"],
"disabled": false,
"autoApprove": []
}
}
}
Windsurf, Continue, VS Code Copilot
Use the same command / args pattern. Any MCP client that supports stdio transport works.
Quickstart
After adding the config snippet above and restarting your client, try these prompts:
- "Run an AI-SEO audit on
https://example.comand tell me the top three things to fix." - callsaudit_pageand returns a scored report with prioritized findings. - "Check which AI crawlers my site allows in
robots.txtforhttps://yourdomain.com." - callscheck_robotsand lists per-crawler allow/disallow. - "Score how citation-worthy this blog post is for Perplexity and ChatGPT:
https://...." - callsscore_citation_worthiness. - "Generate an
llms.txtforhttps://yourdomain.comfrom its sitemap." - callsgenerate_llms_txt. - "Rewrite this passage for Answer Engine Optimization: [paste passage]." - calls
rewrite_for_aeo(uses MCP sampling if your client supports it).
No API keys. No accounts. The first call may take a few seconds while npx downloads the package; subsequent calls are instant.
Tools
| Tool | Purpose |
|---|---|
audit_page |
Composite AI-SEO audit with 8-dimension scoring (schema, technical, structure, robots, freshness, authority, entity density, sitemap). |
audit_schema |
Validate JSON-LD against Schema.org rules and AI-citation best practice. Flags deprecated patterns. |
audit_canonical |
Canonical link integrity, trailing-slash hygiene, og:url consistency. |
check_robots |
Parse robots.txt and report per-crawler allow/disallow for all known AI crawlers. Surfaces the GPTBot-blocked-but-OAI-SearchBot-allowed trap. |
check_sitemap |
Validate XML sitemaps: presence, URL count, lastmod freshness, image/video extensions. |
check_technical |
HEAD tag audit: canonical, OpenGraph, Twitter Card, hreflang, HTTPS, noindex, title hygiene. |
score_ai_overview_eligibility |
Score a page's probability of appearing in Google AI Overviews using current correlation factors. |
generate_llms_txt |
Generate llms.txt and optionally llms-full.txt from a domain's sitemap. |
validate_llms_txt |
Lint an existing llms.txt for spec compliance and broken links. |
score_citation_worthiness |
Score how citable a page or text block is for Perplexity, ChatGPT, Google AI Overviews, and Claude. |
rewrite_for_aeo |
Rewrite content for Answer Engine Optimization (BLUF structure, FAQ format, schema additions). |
rewrite_for_geo |
Rewrite content for Generative Engine Optimization (entity definitions, comparison tables, synthesis-ready structure). |
extract_entities |
Extract named entities, sameAs links, and citation-density score from a page's content and structured data. |
Example
In Claude Desktop, after wiring the server above:
You: Run an AI-SEO audit on
https://automatelab.tech/how-to-connect-zapier-to-notion.
Claude calls audit_page. Result (truncated):
{
"url": "https://automatelab.tech/how-to-connect-zapier-to-notion",
"fetched_at": "2026-05-15T10:32:00Z",
"score": 61,
"grade": "C",
"dimension_scores": {
"schema": 45, "technical": 80, "structure": 40,
"robots": 90, "freshness": 85, "authority": 40,
"entity_density": 21, "sitemap": 100
},
"findings": [
{
"severity": "critical",
"category": "structure",
"where": "<body>",
"message": "No FAQ structure found (no FAQPage schema or H3 question headings).",
"fix": "Add FAQ H3 headings ending in '?' with answer paragraphs, and a FAQPage JSON-LD block.",
"estimated_impact": "high"
},
{
"severity": "warning",
"category": "authority",
"where": "page-level",
"message": "Low authority signals - missing Organization or author Person schema.",
"fix": "Add Organization JSON-LD and Article.author as a Person node with sameAs links.",
"estimated_impact": "high"
}
]
}
Claude then summarizes the findings and proposes fixes. For rewrite tools, the host model (Claude / GPT / etc.) does the rewriting via MCP sampling.
Environment variables
All variables are optional. Set them in the MCP client config under "env".
| Variable | Default | Description |
|---|---|---|
USER_AGENT |
automatelab-ai-seo-mcp/0.1.0 (+https://github.com/AutomateLab-tech/ai-seo) |
HTTP User-Agent on all fetches. |
FETCH_TIMEOUT_MS |
15000 |
Per-request timeout in milliseconds. |
MAX_BYTES |
5242880 |
Maximum response body size in bytes (5 MB). |
RESPECT_ROBOTS |
true |
Global default for robots.txt compliance. Set "false" to disable. |
INTER_REQUEST_DELAY_MS |
1500 |
Minimum delay between requests to the same host within a tool call. |
Polite by default
Every tool that hits the network goes through one fetch path that:
- Respects
robots.txtfor the configured User-Agent (override withRESPECT_ROBOTS=false). - Identifies itself honestly via
User-Agent(no spoofing as a browser). - Sleeps
INTER_REQUEST_DELAY_MSbetween requests to the same host within a tool call. - Times out at
FETCH_TIMEOUT_MSper request. - Caps response body at
MAX_BYTESand refuses non-HTML / non-XML payloads where inappropriate.
This MCP is not a scraper. It audits public pages the way Lighthouse or Screaming Frog do, one URL at a time.
What it does not do
- No JavaScript-rendered scraping. Only static HTML is fetched. SPAs that render content client-side will produce incomplete audits.
- No paid-API integrations (Search Console, Ahrefs, Semrush). Bring those signals manually if you need them.
- No site-wide crawling. Tools that need multiple URLs read from the public sitemap.
- No live SERP checks. AI Overview eligibility is a deterministic heuristic, not a guarantee.
- English-language tuning. Technical checks (schema, robots, canonical) are language-agnostic; content scoring assumes English.
- Entity extraction is regex-based (title-case phrases, acronyms, CamelCase). False positives and misses are expected on v0.1.
Contributing
Bug reports, feature ideas, and PRs welcome. See CONTRIBUTING.md.
Security
To report a vulnerability, see SECURITY.md.
License
MIT - see LICENSE.
Acknowledgements
Built by automatelab.tech
Schema.org, robots.txt, and llms.txt are open standards. Crawler user-agent data is sourced from Anthropic, OpenAI, Google, Perplexity, and Cloudflare documentation. The MCP itself is built on the Model Context Protocol by Anthropic.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.