goodreads-mcp
A read-only MCP server for Goodreads that enables LLMs to search for books, retrieve detailed book info with ratings and reviews, and explore recommendations, series, and author bibliographies using public data sources without requiring authentication.
README
📚 goodreads-mcp
A read-only MCP server for Goodreads — built without the Goodreads API, because there hasn't been one since December 2020. Lets an LLM find and research books, ratings, and reviews. Tools ride on RSS feeds, the JSON autocomplete endpoint, and the __NEXT_DATA__ blob embedded in book pages. No login, no cookies, no writes — public data only.
tools
| tool | stability |
|---|---|
search_books |
stable (JSON endpoint) |
get_book |
stable (__NEXT_DATA__ via .xml path) — details, ratings histogram, series, review-language breakdown |
get_reviews |
GraphQL — paginated reader reviews (text, rating, likes, date, spoiler flag, permalink) with server-side min_rating / max_rating and exclude_spoilers; limit up to 100 |
similar_books |
GraphQL — "readers also enjoyed" recommendations |
author_books |
GraphQL — an author's bibliography (from any of their books) |
series_books |
GraphQL — books in a series with reading-order placement |
get_editions |
GraphQL — published editions (format, ISBN, publisher, date) |
book_lists |
GraphQL — Listopia lists a book appears on (title, votes, size) |
popular_books |
GraphQL — most popular books by release year (or year+month), ranked |
get_shelf |
stable (RSS) — public shelves |
list_shelves |
best effort (HTML) — public profiles |
The discovery tools all take a book_id and return results carrying book_id/title/author/rating/url, so an agent can chain them — e.g. similar_books → get_reviews on a recommendation. This is the structured book graph a general web search can't assemble.
WAF note: Goodreads book HTML pages now sit behind an AWS WAF JavaScript challenge (HTTP 202) that plain HTTP clients can't solve.
get_bookroutes around it via the.xml-suffixed page, so it still works without a browser. If Goodreads ever extends the WAF to a path we depend on, the client raisesWAFChallengewith a clear message instead of a confusing parse error.
install
cd goodreads-mcp
python3.10 -m venv .venv && .venv/bin/pip install -e .
Requires Python ≥ 3.10.
config (optional)
No login or cookies — everything is public data. The only setting is your numeric user_id, the default for the shelf tools. It's the number in goodreads.com/user/show/<ID>-yourname; you can also pass user_id to each shelf tool per call.
mkdir -p ~/.config/goodreads-mcp
cat > ~/.config/goodreads-mcp/config.json << 'EOF'
{ "user_id": "12345678" }
EOF
Env var GOODREADS_USER_ID overrides the file.
Claude Desktop config
~/Library/Application Support/Claude/claude_desktop_config.json:
{
"mcpServers": {
"goodreads": {
"command": "/path/to/goodreads-mcp/.venv/bin/goodreads-mcp"
}
}
}
Or for development, mcp dev goodreads_mcp/server.py gives you the Inspector UI to poke each tool.
first-run verification
The endpoints are unofficial, so verify in this order:
search_books("project hail mary")— should just workget_book("54493401")— confirms the.xml/WAF workaround; check the histogram is populatedget_reviews("54493401")— should return real review textget_shelf("to-read")— checks youruser_id+ RSSlist_shelves()— best-effort shelf-name scrape
tests
.venv/bin/pip install -e ".[test]"
.venv/bin/pytest # offline parser/unit tests
GOODREADS_LIVE=1 .venv/bin/pytest # + live network smoke tests
design notes
- Request-first, no browser automation. Everything is
httpxagainst JSON/RSS/embedded-JSON/GraphQL surfaces; the only HTML regex is inlist_shelvesand the GraphQL config discovery. - GraphQL backbone (reviews).
get_reviewscalls Goodreads' AppSync GraphQL endpoint — the same backend the website uses. The web app ships a public read-only API key in its JS bundle; the client scrapes the endpoint + key from that bundle at runtime and caches them, so a key rotation self-heals (client.graphql_config). A hardcoded pair is kept as a fallback. This is what enables real pagination (past the ~30 reviews a page embeds) and server-side rating filters. GraphQL partial-success is respected: a deleted review's sub-resource just comes backnullrather than failing the call. - WAF-aware. Book pages sit behind an AWS WAF JS challenge;
get_bookuses the.xmlpath that isn't gated, and the client raisesWAFChallengeif it ever gets a challenge body so failures are loud, not silent. (The GraphQL endpoint is a separate AppSync host and isn't WAF-gated.) - Polite client. Single persistent session, browser-faithful headers, exponential backoff on 429/503;
get_reviewscaps paging at 100 reviews. - Caveats: all of this is unofficial and depends on markup/endpoints/keys that can drift.
shipped since v0.1
- richer book data —
get_booknow includes the ratings histogram, series/position, and review-language breakdown;series_booksandsimilar_bookscover series and recommendations;get_reviewsreturns paginated, filterable reader reviews. - author bibliography —
author_booksreturns an author's works (ranked by popularity) plus a link to their author page (author_url).
ideas for v2
- author page detail (bio, photo, follower count) — not currently exposed cleanly: the author page is legacy server-rendered HTML with no structured JSON, and there's no discoverable GraphQL contributor-detail query, so this would require brittle DOM scraping.
author_bookslinks to the page instead. compare_books([ids])— fan-out helper to rank several titles by rating/histogram- caching layer for repeated lookups
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.