MCP Servers

SIEMen

Enables AI agents to read, write, and search security findings, engagement memory, and semantic triage via MCP tools on Cloudflare Workers.

README

SIEMen

Security-focused AI data layer on Cloudflare Workers. Purpose-built for AI security agents — combines fast KV scratch cache, semantic vector search over findings, per-engagement agent memory, and a similarity-based LLM triage cache. Exposes both a REST API and a built-in MCP server so Claude/AI agents can read, write, and search security data directly.

Stack

Layer	Technology	Details
Runtime	Cloudflare Workers	Edge-native, globally distributed, zero cold starts
Framework	Hono v4 + TypeScript	Lightweight routing, CORS, middleware
Database	Cloudflare D1 (SQLite)	Engagements, findings, memory, semantic cache log
Vector Search	Cloudflare Vectorize	768-dim BGE embeddings, cosine similarity, namespace-isolated per engagement
KV Cache	Workers KV	Fast scratch store + semantic cache values
Embeddings	Workers AI (BGE-768)	Primary — OpenRouter fallback on failure
Rate Limiting	Workers Rate Limiting API	60 req/min per caller (cross-isolate, durable)
Auth	Web Crypto SHA-256	Timing-safe Bearer token verification
MCP	JSON-RPC 2.0	9 tools on `POST /mcp` — same handlers as REST

What It Does

Fast KV Scratch Cache

Per-engagement key/value store for in-flight agent notes, intermediate results, and shared state across agent steps. Namespaced to prevent cross-engagement leakage. Optional TTL (max 24h).

Semantic Vector Search

Embed and store security findings, CVEs, and controls with finding_store. Retrieve the closest matches by semantic similarity with finding_search. Namespace-scoped to engagement — an agent searching for "SQL injection" can only surface findings from its own engagement.

Per-Engagement Agent Memory

Agents store notes with engagement_remember and recall them by recency or vector similarity with engagement_recall. Supports sub-scoping by session_id for multi-step workflows.

Semantic LLM Triage Cache

semantic_triage routes security prompts through a two-level cache before hitting an LLM:

Exact-hash short-circuit — identical prompts reuse prior analysis instantly
Cosine similarity check — prompts above the configured threshold (default 0.92) return a cached response without an LLM call
Miss — calls the configured LLM gateway, writes result to KV + Vectorize for future hits

Cuts LLM spend significantly when agents analyze structurally similar findings across engagements.

MCP Tools

Tool	Description
`engagement_open`	Create or register a new pentest engagement
`sec_cache_get`	Read from the fast KV scratch store
`sec_cache_set`	Write to the fast KV scratch store (optional TTL)
`finding_store`	Embed and store a finding, CVE, or control
`finding_search`	Semantic RAG over findings scoped to an engagement
`engagement_remember`	Store agent memory for an engagement
`engagement_recall`	Recall memories by recency or vector similarity
`semantic_triage`	Semantic-cache-backed LLM triage
`cache_stats`	Get cache hit/miss stats for an engagement

REST Routes

All /v1/* routes require Authorization: Bearer <SIEMEN_API_KEY>.

Method	Path	Description
GET	`/health`	Health check (unauthenticated)
POST	`/v1/engagements`	Create/upsert an engagement
POST	`/v1/findings`	Embed + store a finding/CVE/control
POST	`/v1/findings/search`	Semantic search over findings
GET	`/v1/stats?engagement_id=`	Cache hit/miss stats
GET	`/v1/kv/:ns/:key`	KV scratch read
PUT	`/v1/kv/:ns/:key`	KV scratch write
POST	`/v1/memory`	Store agent memory
GET	`/v1/memory?engagement_id=`	Recall agent memories
POST	`/v1/triage`	Semantic triage with LLM cache
POST	`/mcp`	MCP JSON-RPC endpoint (all 9 tools)

D1 Schema

Apply migrations in order with wrangler d1 execute siemen-db --remote --file migrations/<file>.sql:

engagements — engagement registry (id, name, client, status, timestamps)
memory — agent memory per engagement (content, tags, vector_id, session_id)
findings — security findings/CVEs/controls (kind, title, body, severity, asset, external_id, vector_id)
semantic_cache_log — cache outcome log (prompt_hash, outcome, similarity, model, tokens, created_at)

Deploy

# 1. Create infrastructure
wrangler vectorize create siemen-vectors --dimensions=768 --metric=cosine
wrangler d1 create siemen-db
wrangler kv namespace create siemen-sc-cache
wrangler kv namespace create siemen-kv

# 2. Update wrangler.toml with the returned IDs

# 3. Apply D1 migrations (run in order)
wrangler d1 execute siemen-db --remote --file migrations/migration-001-engagements.sql
wrangler d1 execute siemen-db --remote --file migrations/migration-002-memory.sql
wrangler d1 execute siemen-db --remote --file migrations/migration-003-findings.sql
wrangler d1 execute siemen-db --remote --file migrations/migration-004-semantic-cache-log.sql

# 4. Set secrets
wrangler secret put SIEMEN_API_KEY
wrangler secret put OPENROUTER_API_KEY
wrangler secret put LLM_GATEWAY_KEY

# 5. Deploy
npm run deploy

MCP Configuration

Add to your Claude Desktop / Claude Code MCP config:

{
  "mcpServers": {
    "siemen": {
      "url": "https://your-worker.workers.dev/mcp",
      "headers": {
        "Authorization": "Bearer ${SIEMEN_API_KEY}"
      },
      "alwaysAllow": [
        "engagement_open",
        "sec_cache_get",
        "finding_search",
        "engagement_recall",
        "cache_stats"
      ]
    }
  }
}

Set SIEMEN_API_KEY in your shell environment. Write tools (finding_store, sec_cache_set, engagement_remember, semantic_triage) require explicit approval by default.

Security

All /v1/* and /mcp routes require a Bearer token verified with Web Crypto SHA-256 (no Node.js crypto — edge-compatible)
Findings are namespace-scoped to engagement_id in Vectorize — cross-engagement leakage is impossible at the query layer
Secrets via wrangler secret put only — never in source or wrangler.toml
Rate limited at 60 req/min per token identity (not per IP)

See SECURITY.md for vulnerability reporting.

License

MIT

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured