Brand Knowledge MCP Server
Provides structured dealer brand data including inventory, promotions, reviews, and dealer profile via MCP tools, enabling LLMs to answer accurate brand-related queries.
README
Brand Knowledge Agent — Architecture Document
Phase 1: Knowledge Feed Layer Author: Julianna Lacaden Date: June 22, 2026 Status: Planning — Pending supervisor confirmation of scope
⚠️ Working Assumption
This document was drafted proactively during an unassigned period. The scope assumes the company's directive — "strengthen brand to feed to LLMs" — maps to Phase 1: a structured, LLM-queryable knowledge layer for dealer brand data. This must be confirmed with the supervisor before development begins.
1. Problem Statement
Car dealer clients are underrepresented or inaccurately represented in LLM-generated answers. When a user asks an LLM about a specific dealer — their inventory, promotions, or reputation — the model either hallucinates, falls back to generic information, or omits the dealer entirely.
The goal of this agent is to build a structured, MCP-accessible knowledge layer that feeds accurate, up-to-date dealer brand data to an LLM at query time, so responses about that dealer are grounded in real information.
This is Phase 1. Phase 2 (not scoped here) will build a monitoring agent on top of this layer to measure how accurately LLMs represent the brand without the knowledge feed — establishing a before/after baseline.
2. Architecture Overview
Dealer Brand Data (mock)
│
▼
┌─────────────────────────┐
│ MCP Server │ ← FastMCP (Python), stdio transport
│ Tools: │
│ - get_inventory │
│ - get_promotions │
│ - get_reviews │
│ - get_dealer_profile │
│ │
│ Resource: │
│ - dealer://{id}/profile│ ← requires raw MCP SDK (see §4.1)
└───────────┬─────────────┘
│ stdio
▼
┌─────────────────────────┐
│ Orchestration Agent │ ← LangGraph (Python)
│ Model: qwen2.5 (Ollama)│
│ Routing: conditional │
└───────────┬─────────────┘
│
▼
┌─────────────────────────┐
│ Test Interface │ ← CLI or lightweight HTML
│ "What inventory does │
│ Dealer X have?" │
└─────────────────────────┘
- - - - - - - - - - - - - - - - (Phase 2, not built here)
┌─────────────────────────┐
│ Monitoring Agent │ ← Queries multiple LLMs, scores brand
│ (future) │ accuracy against MCP server as ground
└─────────────────────────┘ truth
3. Scope
In scope (Phase 1)
- MCP server with four tools and one resource exposing mock dealer data
- LangGraph orchestration agent that queries MCP tools to answer brand questions
- Type-coercion helpers on the MCP server side for model compatibility
- Mock dealer data schema (no real client data required in Phase 1)
- CLI test interface or simple HTML page
- Documentation of all architectural decisions and known limitations
Out of scope (Phase 1)
- Real dealer data integration (pending client onboarding)
- Phase 2 monitoring/scoring agent
- Multi-dealer support (single mock dealer is sufficient for Phase 1 validation)
- Frontend UI beyond a minimal test interface
- Deployment or hosting
4. Technical Design
4.1 MCP Server
Framework: FastMCP (Python) Transport: stdio
stdio is selected over HTTP for two reasons grounded in prior debugging:
- HTTP (Streamable HTTP) introduced "Connection closed" errors requiring hours of debugging in an earlier build (June 11, 2026).
genkitx-mcpSSE transport produced 400 errors against FastMCP's HTTP endpoint (June 19, 2026). stdio has been validated as stable across both CrewAI and Genkit integrations.
Tools (four):
| Tool | Input | Output | Notes |
|---|---|---|---|
get_inventory |
dealer_id: str |
List of vehicle listings | Core tool, validated first |
get_promotions |
dealer_id: str |
List of active promos | |
get_reviews |
dealer_id: str, limit: int |
List of customer reviews | |
get_dealer_profile |
dealer_id: str |
Dealer name, location, USPs | Duplicates resource content for Tools-only adapters |
Resource (one):
dealer://{id}/profile
Returns structured JSON with dealer name, location, contact info, and USPs.
Limitation note: If the orchestration layer uses CrewAI's
MCPServerAdapter, this resource will not be exposed — the adapter only surfaces Tools. Theget_dealer_profiletool is included as a fallback so the resource content remains accessible regardless of the adapter used. If Resources are required, use raw MCP SDK instead of the adapter.
Type coercion helpers (mandatory):
qwen2.5 and other local models occasionally pass integer arguments as strings (e.g. "3" instead of 3) and list arguments as comma-separated strings or JSON-encoded strings (e.g. "[1,2]" instead of [1, 2]). These helpers must be included in the MCP server from the start:
def _to_int(val) -> int:
if isinstance(val, int):
return val
return int(str(val).strip())
def _to_int_list(val) -> list[int]:
if isinstance(val, list):
return [int(v) for v in val]
val = str(val).strip().strip("[]")
return [int(v.strip()) for v in val.split(",") if v.strip()]
This pattern was validated on June 19, 2026. Do not defer this to debugging — apply it at build time.
4.2 Orchestration Layer
Framework: LangGraph (Python)
LangGraph is selected over CrewAI for the following reasons:
| Consideration | LangGraph | CrewAI |
|---|---|---|
| MCP Resources/Prompts | Accessible via raw MCP SDK | Blocked — MCPServerAdapter exposes Tools only |
| Conditional routing | First-class (conditional edges) | Workaround via task chaining |
| Loop-back support | Built-in (validated June 18) | Not native |
| Phase 2 fit | Credibility scoring = conditional edges (already prototyped) | Would require rebuilding orchestration layer |
LangGraph's conditional edges and loop-back patterns were validated in the June 18 multi-node research pipeline. Phase 2's monitoring agent (score → loop if accuracy below threshold) reuses the same pattern. Choosing CrewAI now means rebuilding the orchestration layer when Phase 2 is scoped.
Model: qwen2.5 (via Ollama, local)
qwen2.5 is selected over llama3.2. This is a closed finding from two independent validation sessions (June 11 CrewAI integration, June 19 Genkit integration): llama3.2 sends tool arguments as strings instead of typed values, breaking schema validation. qwen2.5 handles function calling correctly.
Graph structure (Phase 1):
[START]
│
▼
[router_node] ─── decides which MCP tool to call based on query
│
▼
[tool_call_node] ─── calls MCP server via stdio, gets result
│
▼
[response_node] ─── formats LLM answer grounded in tool output
│
▼
[END]
Phase 2 will insert a [scoring_node] with a conditional edge between [response_node] and [END] — the same pattern used in the June 18 credibility-scoring pipeline.
4.3 Mock Data Schema
Phase 1 does not require real client data. The following mock schema is sufficient to validate all four tools and the resource:
{
"dealer_id": "toyota-metro-01",
"name": "Toyota Metro",
"location": "Quezon City, Metro Manila",
"contact": "+63 2 8123 4567",
"usps": [
"10-year powertrain warranty",
"Same-day financing approval",
"Free first year PMS"
],
"inventory": [
{ "model": "Toyota Vios", "year": 2025, "price_php": 798000, "stock": 4 },
{ "model": "Toyota Fortuner", "year": 2025, "price_php": 1950000, "stock": 2 }
],
"promotions": [
{ "title": "Zero interest June promo", "valid_until": "2026-06-30", "details": "0% interest for 24 months on Vios" }
],
"reviews": [
{ "author": "Juan D.", "rating": 5, "text": "Fast processing and friendly staff." },
{ "author": "Maria S.", "rating": 4, "text": "Good experience but waiting area could be improved." }
]
}
Flag in any handoff: this is mock data. Real dealer data integration is a Phase 2 dependency, not a Phase 1 blocker.
4.4 Test Interface
A minimal CLI interface is sufficient for Phase 1 validation:
python test_agent.py --query "What promotions does Toyota Metro have this month?"
Optional: a simple HTML page (similar to the Genkit test page built June 16–17) that sends a query and displays the agent's response. This is not a deliverable — it's a test harness.
5. Known Limitations and Risks
| Limitation | Impact | Mitigation |
|---|---|---|
| Mock data only | Cannot validate against real dealer content | Flag clearly; Phase 2 requires real data |
MCPServerAdapter exposes Tools only |
Resource dealer://{id}/profile inaccessible via CrewAI adapter |
get_dealer_profile tool included as fallback; raw MCP SDK as alternative |
| qwen2.5 inconsistency on ambiguous inputs | Observed in June 18 credibility scoring: same input produced scores of 67 and 20 on separate runs | Document as known limitation; do not build scoring logic on top of qwen2.5 without a deterministic fallback |
| Single-dealer scope | Phase 1 validates architecture, not scale | Multi-dealer support deferred to post-Phase-1 |
| Supervisor scope unconfirmed | This doc may be planning against the wrong target | Send scope confirmation question before development begins |
6. Build Sequence
This is the order in which to build, not a daily schedule. Each step must be validated before the next begins.
- Define and validate mock data — JSON file, no code yet
- Build
get_inventorytool only — smallest working MCP tool, test with MCP Inspector - Wire one LangGraph node to one MCP tool — thin vertical slice proving the pipeline
- Add remaining three tools —
get_promotions,get_reviews,get_dealer_profile - Add MCP Resource (
dealer://{id}/profile) — only if raw MCP SDK is used; skip if adapter is final choice - Add router node — conditional routing based on query type
- Add type coercion helpers — do not skip; apply before any model testing
- End-to-end test — three sample queries, log results
- Document findings — bugs, model behavior, transport observations
7. Phase 2 Sketch (not scoped, for reference only)
Phase 2 builds a monitoring agent on top of this architecture. Its job: take the same dealer query, send it to multiple LLMs without the MCP knowledge feed, then compare their answers against the MCP server's ground truth and score accuracy.
This reuses:
- The same MCP server (Phase 1 output becomes the ground truth source)
- LangGraph conditional edges (scoring node → loop-back if score below threshold)
- The credibility-scoring pattern already validated June 18
Phase 2 is not blocked on Phase 1 being complete — it can be planned in parallel once Phase 1's architecture is confirmed. It is blocked on having a real answer to "what does accurate brand representation actually look like?" — which requires real dealer data.
8. Open Questions (requires supervisor input)
- Scope confirmation: Is Phase 1 correct — build a knowledge feed first, monitoring second?
- Real data timeline: When will actual dealer data be available, and in what format?
- Model preference: Is there a company preference for cloud models (Gemini, OpenAI) over local Ollama for the orchestration layer?
- Deployment target: Is this a local tool, a server-side service, or an API endpoint?
- Multi-dealer requirement: Does Phase 1 need to handle more than one dealer, or is single-dealer sufficient for validation?
This document is a living planning artifact. It should be updated after each supervisor sync and after each major build milestone.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.