career-scout-mcp
A production-grade MCP server demonstrating the wrapping pattern for AI-augmented data pipelines, specifically a job-search scoring pipeline.
README
career-scout-mcp
A production-grade Model Context Protocol (MCP) server demonstrating the wrapping pattern for AI-augmented data pipelines. Built as a standalone artifact: one LXC container, one Cloudflare Tunnel, one repo. Self-hosted via Ollama + LiteLLM SDK.
This server demonstrates the pattern I would apply to wrap Career Scout — my private job-search scoring pipeline. Synthetic data committed here for portability and reproducibility.
Documentation
Full architecture and design decisions: career-scout-mcp.stojadinovic.at
Stack
- Python 3.13 (mypy strict)
- MCP SDK with decorator-based primitive registration
- LiteLLM SDK — provider-agnostic LLM routing, model-swappable via env
- Ollama + Qwen 2.5 3B (default) — self-hosted, biomedical-research-portable
- Pydantic for config + tool schemas
- loguru structured JSON logging with secret redaction
- Debian 13 LXC, cloudflared edge termination, nginx static docs
Prerequisites
- Python 3.13 (uv manages this automatically)
- uv — dependency and environment management
- Ollama — default local LLM provider for
qwen2.5:3b
Memory: Ollama's headroom calc for qwen2.5:3b requires ~6 GiB of available memory (it counts buff/cache as unavailable). A 4 GiB system may fail to load the model even though it's 1.9 GB on disk.
Debian 13
sudo apt-get update && sudo apt-get install -y curl ca-certificates zstd
curl -LsSf https://astral.sh/uv/install.sh | sh
curl -fsSL https://ollama.com/install.sh | sh
ollama pull qwen2.5:3b
Note:
zstdis required by the Ollama installer for archive extraction on minimal Debian; not all base images include it.
macOS
brew install uv ollama
ollama serve &
ollama pull qwen2.5:3b
Windows
uv installer · Ollama installer, then ollama pull qwen2.5:3b.
Quick start (local stdio)
uv sync
uv run python -m career_scout_mcp
The server exposes 4 tools, 5 resources (6 URIs), and 2 prompts via stdio. Connect from Claude Desktop, Claude Code, or OpenCode by pointing them at this binary.
Try it out
The fastest way to exercise the server is via MCP Inspector:
npx @modelcontextprotocol/inspector uv run python -m career_scout_mcp
Opens a browser UI at localhost:6274 where you can list resources, render prompts, and invoke tools end-to-end against your local Ollama.
Development
Dev workflow uses OpenCode + standard Python tooling. See CONTRIBUTING.md.
Security
See SECURITY.md for reporting. Key posture:
- All SQL parameterized (never f-string)
- Pydantic input validation on every tool entry
- Path traversal prevention on resource URIs
- systemd hardening (non-root, ProtectSystem=strict, etc.)
- MCP server NEVER publicly exposed (stdio default, HTTP bound 127.0.0.1 only)
- TLS via Cloudflare edge — no local cert management surface
- Docs deploy via manual
scripts/deploy_docs.sh. MCP server is never publicly exposed — stdio default; HTTP transport loopback-only behind Bearer auth (hmac.compare_digest).
License
MIT — see LICENSE.
Built by Stefan Stojadinovic, Vienna. Contact: stefan@stojadinovic.at
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.