MCP Servers

SIMPA

SIMPA is a Model Context Protocol (MCP) service that learns from every interaction to continuously improve prompt quality. It remembers what worked, refines what didn't, and automatically selects the best prompts for any situation.

README

SIMPA - Self-Improving Meta Prompt Agent

🚀 Transform your AI agents with self-optimizing prompt intelligence

🌟 Why SIMPA?

Every agent you deploy faces the same challenge: getting the prompt right. SIMPA solves this by:

📊 Learning from feedback - Automatically improves based on execution scores
🔍 Semantic search - Finds similar successful prompts using vector similarity
🧠 Smart selection - Chooses between refinement and reuse based on proven performance
🔗 MCP Native - Seamlessly integrates with any MCP-compatible agent controller

🏗️ Architecture

flowchart TB
    subgraph Controller["Agent Controller"]
        A[Agent Request]
    end
    
    subgraph SIMPA["SIMPA MCP Service"]
        direction TB
        R[Refiner] --> S[Selector]
        S --> V[Vector Store]
        S --> L[LLM Service]
        L --> E[Embedding Service]
    end
    
    subgraph Storage["Knowledge Base"]
        direction TB
        P[(PostgreSQL + pgvector)]
        H[Prompt History]
    end
    
    A -->|original_prompt| R
    V -->|similar_prompts| S
    S -->|refined_prompt| A
    S -->|store & learn| P
    P -->|usage_stats| S
    H -->|feedback_loop| S
    
    style Controller fill:#e1f5fe
    style SIMPA fill:#fff3e0
    style Storage fill:#e8f5e9

🔄 Prompt Lifecycle

SIMPA sits between the Agent Orchestrator and Implementation Agents, continuously learning from each interaction:

flowchart LR
    AO[Agent Orchestrator] -->|prompt| SR[SIMPA Prompt<br/>Refinement]
    SR -->|refined-prompt| IA[Implementation<br/>Agent]
    IA -->|Actions, Results<br/>& Products| RA[Reviewing Agent]
    RA -->|refined-prompt-score| SR2[SIMPA]
    SR2 -->|learn & improve| SR
    
    style AO fill:#e1f5fe,color:#000000
    style SR fill:#fff3e0,color:#000000
    style IA fill:#fce4ec,color:#000000
    style RA fill:#f3e5f5,color:#000000
    style SR2 fill:#fff3e0,color:#000000

The Flow:

Agent Orchestrator → Sends raw prompt to SIMPA
SIMPA → Returns refined-prompt (structured with ROLE, GOAL, REQUIREMENTS)
Implementation Agent → Executes actions using refined prompt, produces results/products
Reviewing Agent → Evaluates outcomes, generates refined-prompt-score
SIMPA → Receives score, learns what works, improves future refinements

This closed feedback loop ensures prompts get better with every execution.

✨ Features

Feature	Description
🤖 MCP Protocol	Native Model Context Protocol support for universal agent integration
🔎 Vector Search	pgvector-powered similarity search for prompt retrieval
📈 Self-Improvement	Sigmoid-based probability for intelligent refinement vs reuse
🎯 Multi-Provider	OpenAI, Anthropic, and Ollama support for embeddings and LLM
📊 Observability	Structured logging with structlog and comprehensive metrics
🛡️ Security	PII detection and input validation built-in
🧪 Tested	274 automated tests with 100% pass rate

📋 Prerequisites

Before installing SIMPA, ensure you have the following:

Required

Component	Version	Purpose
Python	3.10+	Runtime environment
PostgreSQL	14+	Database with pgvector extension
Docker	Latest	Required for running tests with TestContainers
Git	Latest	Clone repository

Note: PostgreSQL and Ollama are expected to be installed and running separately (not via Docker) for normal operation. Docker is only required for the automated test suite.

For Ollama (Local Models - Recommended)

Component	Purpose
Ollama	Local LLM & embedding inference
nomic-embed-text	Embedding model (pull via `ollama pull nomic-embed-text`)
llama3.2	LLM for prompt refinement (pull via `ollama pull llama3.2`)

For Cloud Providers (Optional)

🔐 Security Best Practice: Provider API keys (OpenAI, Anthropic, Google, Azure) should be kept in your user home directory at ~/.env rather than in the project .env file. This prevents accidental commits of sensitive credentials to version control.

Create ~/.env with your provider keys:
# ~/.env - User-level secrets (not committed)
OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...
GOOGLE_API_KEY=...
AZURE_OPENAI_KEY=...
SIMPA will automatically load keys from ~/.env if available.

OpenAI API Key - Get from platform.openai.com
Anthropic API Key - Get from console.anthropic.com
Azure OpenAI - Azure subscription with OpenAI service
Google Gemini - Get from Google AI Studio

System Requirements

Resource	Minimum	Recommended
RAM	4 GB	8 GB+
Disk	2 GB free	10 GB+
CPU	2 cores	4 cores+

Note: For local Ollama models, CPU is sufficient but GPU acceleration significantly improves performance.

🚀 Quick Start

Option 1: Docker Compose (Recommended for Development/Testing)

This option runs PostgreSQL and Ollama in Docker containers for easy development and testing:

# Clone and setup
git clone https://github.com/yourusername/simpa-mcp.git
cd simpa-mcp
cp .env.example .env

# Start all services (PostgreSQL + Ollama in Docker)
make dev-setup

# Download models (one-time)
make pull-models

# Run migrations
make migrate

# Run tests
make test

For Production Use: Install PostgreSQL and Ollama directly on your system instead of using Docker. See the Manual Setup section below.

Option 2: Manual Setup (Production/Existing Services)

Use this if you already have PostgreSQL and Ollama installed locally.

Prerequisites:

PostgreSQL 14+ with pgvector extension installed
Ollama running locally (with nomic-embed-text and llama3.2 pulled)

# Install dependencies
pip install -e ".[dev]"

# Configure environment
cp .env.example .env
# Edit .env to match your PostgreSQL and Ollama settings

# Run migrations
alembic upgrade head

# Start MCP server
python -m src.main

Quick PostgreSQL setup with Docker (if needed):

# Only if you don't have PostgreSQL installed locally
docker run -d --name simpa-db \
  -e POSTGRES_USER=simpa \
  -e POSTGRES_PASSWORD=simpa \
  -e POSTGRES_DB=simpa \
  -p 5432:5432 \
  pgvector/pgvector:pg16

🔌 Adding SIMPA to Your MCP Configuration

SIMPA works with any MCP-compatible client (Cursor, Claude Desktop, Windsurf, etc.).

Step 1: Install SIMPA Server

Option A: Global Installation (Easiest)

# Clone the repository
git clone https://github.com/dsidlo/simpa-mcp.git
cd simpa-mcp

# Create virtual environment
python -m venv .venv

# Activate virtual environment
# On macOS/Linux:
source .venv/bin/activate
# On Windows:
# .venv\Scripts\activate

# Install in editable mode
pip install -e .

# Install MCP dependencies
pip install fastmcp asyncpg pgvector sqlalchemy

# Setup environment
cp .env.example .env
# Edit .env with your configuration (see Configuration section below)

# Run database migrations
alembic upgrade head

Option B: Docker (Recommended for Production)

# Build the MCP server image
docker build --target production -t simpa-mcp:latest .

# Or use docker compose (includes PostgreSQL + pgvector)
docker-compose up -d

Step 2: Configure Your MCP Client

Add SIMPA to your MCP client's configuration file:

Cursor (`~/.cursor/mcp.json`)

{
  "mcpServers": {
    "simpa-mcp": {
      "command": "uv",
      "args": [
        "--directory",
        "/absolute/path/to/simpa-mcp",
        "run",
        "--env",
        "/absolute/path/to/simpa-mcp/.env",
        "python",
        "-m",
        "src.main"
      ],
      "env": {
        "PYTHONPATH": "/absolute/path/to/simpa-mcp/src"
      }
    }
  }
}

Claude Desktop (`~/Library/Application Support/Claude/claude_desktop_config.json`)

{
  "mcpServers": {
    "simpa-mcp": {
      "command": "/absolute/path/to/simpa-mcp/.venv/bin/python",
      "args": [
        "-m",
        "src.main"
      ],
      "env": {
        "DATABASE_URL": "postgresql://simpa:simpa@localhost:5432/simpa",
        "EMBEDDING_PROVIDER": "ollama",
        "EMBEDDING_MODEL": "nomic-embed-text",
        "OLLAMA_BASE_URL": "http://localhost:11434",
        "LLM_MODEL": "ollama/llama3.2",
        "PYTHONPATH": "/absolute/path/to/simpa-mcp/src"
      }
    }
  }
}

Generic MCP Configuration

{
  "mcpServers": {
    "simpa-mcp": {
      "name": "SIMPA Prompt Refinement",
      "description": "Self-improving prompt optimization service",
      "command": "python",
      "args": [
        "-m",
        "src.main",
        "--mcp",
        "stdio"
      ],
      "workingDirectory": "/absolute/path/to/simpa-mcp",
      "envFile": "/absolute/path/to/simpa-mcp/.env"
    }
  }
}

Using uv (Recommended)

This configuration ensures the server runs from the source directory and uses uv for dependency management:

{
  "mcpServers": {
    "simpa-mcp": {
      "command": "/bin/bash",
      "args": [
        "-c",
        "cd /path/to/simpa-mcp && uv run python src/main.py --log-level debug --log-file /tmp/simpa-mcp.log"
      ]
    }
  }
}

Note: Replace /path/to/simpa-mcp with your actual installation path. Using bash -c with cd ensures the server runs from the project root where pyproject.toml and .env are located.

Step 3: Install MCP Inspector (Optional, for Testing)

# Install MCP Inspector globally
npm install -g @anthropics/mcp-inspector

# Test your SIMPA server
mcp-inspector --server "uv --directory /path/to/simpa-mcp run python -m src.main"

Step 4: Verify Installation

In your MCP client (Cursor/Claude Desktop), you should see:

✅ Available Tools: refine_prompt, update_prompt_results
✅ Server Status: Connected
✅ Capabilities: Prompt refinement enabled

🛠️ Troubleshooting

"Command not found: uv"

Install uv first:

curl -LsSf https://astral.sh/uv/install.sh | sh

"ModuleNotFoundError: No module named 'src'"

Ensure PYTHONPATH includes the src directory:

export PYTHONPATH="/absolute/path/to/simpa-mcp/src:$PYTHONPATH"

Database Connection Errors

Verify PostgreSQL is running with pgvector:

# Check if pgvector extension is available
psql -d simpa -c "CREATE EXTENSION IF NOT EXISTS vector;"

MCP Server Not Responding

Test manually:

cd /path/to/simpa-mcp
source .venv/bin/activate
python -m src.main --help

🔧 Configuration

SIMPA can be configured via environment variables and command-line arguments.

Environment Variables

How configuration works: SIMPA uses Pydantic Settings to automatically load environment variables from .env files. When you set an environment variable, it automatically becomes available via settings.VARIABLE_NAME in the code—no explicit os.getenv() calls needed. Environment variables are case-insensitive (EMBEDDING_MODEL and embedding_model work the same).

⚡ Critical Parameters (Required)

These parameters must be configured to bring up the MCP service:

Variable	Description	Why Required
`DATABASE_URL`	PostgreSQL connection URL	Stores prompt knowledge base
`OPENAI_API_KEY`	OpenAI API key	Required only if using OpenAI models. Other providers need their respective keys.

All other parameters can be left undefined — they default to known, usable values suitable for most deployments.

Minimal Configuration Example

The simplest working .env file (using local Ollama models):

# Only REQUIRED parameter - everything else defaults automatically
DATABASE_URL=postgresql://user@localhost:5432/simpa

For OpenAI instead of Ollama, just add the API key:

# Required
DATABASE_URL=postgresql://user@localhost:5432/simpa
OPENAI_API_KEY=sk-your-key-here
# LLM_MODEL defaults to ollama/llama3.2, but you can override:
# LLM_MODEL=openai/gpt-4

Optional Parameters (With Working Defaults)

All sections below have sensible defaults. You only need to change them if you have specific requirements:

Database Connection Details

Variable	Description	Default
`DATABASE_URL`	PostgreSQL connection URL	`postgresql://dsidlo@localhost:5432/simpa`

Embedding Service

Variable	Description	Default
`EMBEDDING_PROVIDER`	Embedding provider (`ollama` or `openai`)	`ollama`
`EMBEDDING_MODEL`	Embedding model name	`nomic-embed-text`
`EMBEDDING_DIMENSIONS`	Vector dimensions (768 for nomic-embed-text)	`768`
`OLLAMA_BASE_URL`	Ollama API base URL	`http://localhost:11434`

LLM Service

Variable	Description	Default
`LLM_MODEL`	LLM model (LiteLLM format: `provider/model`)	`ollama/llama3.2`
`LLM_TEMPERATURE`	Sampling temperature (0.0 - 2.0)	`0.7`

Supported Models (via LiteLLM):

ollama/llama3.2 - Local Ollama models
gpt-4, gpt-3.5-turbo - OpenAI
claude-3-opus-20240229, claude-3-sonnet-20240229 - Anthropic
gemini/gemini-pro, gemini/gemini-ultra - Google
azure/<deployment-name> - Azure OpenAI

API Keys (Only if using cloud LLM providers)

Only needed if you use cloud-based LLMs instead of local Ollama models. These are loaded automatically by LiteLLM based on model prefix:

Variable	Description
`OPENAI_API_KEY`	OpenAI API key
`ANTHROPIC_API_KEY`	Anthropic API key
`GEMINI_API_KEY`	Google Gemini API key
`AZURE_API_KEY`	Azure OpenAI API key
`AZURE_API_BASE`	Azure OpenAI endpoint base URL
`COHERE_API_KEY`	Cohere API key

Embedding Cache

Variable	Description	Default
`EMBEDDING_CACHE_ENABLED`	Enable LRU cache for embeddings	`true`
`EMBEDDING_CACHE_MAX_SIZE`	Maximum cache entries (100-10000)	`1000`
`EMBEDDING_CACHE_MAX_TEXT_LENGTH`	Maximum text length to cache	`10000`

LLM Cache

Variable	Description	Default
`LLM_CACHE_ENABLED`	Enable LLM response caching	`true`
`LLM_CACHE_TTL_SECONDS`	Cache TTL in seconds (60-86400)	`3600`
`LLM_CACHE_MAX_ENTRIES`	Maximum cache entries (100-100000)	`10000`
`LLM_CACHE_DB_PATH`	Path to cache SQLite database	`./llm_cache.db`

Fast-Path Hash Match

Variable	Description	Default
`HASH_FAST_PATH_ENABLED`	Enable hash-based exact match lookup	`true`
`HASH_FAST_PATH_MIN_SCORE`	Minimum score for hash reuse (1.0-5.0)	`4.0`

Refinement Strategy

Variable	Description	Default
`SIMILARITY_BYPASS_THRESHOLD`	Cosine similarity threshold for bypass (0.9-1.0)	`0.95`
`SIMILARITY_BYPASS_MIN_SCORE`	Minimum score for high-similarity bypass (1.0-5.0)	`4.5`
`SIGMOID_K`	Sigmoid steepness parameter	`1.5`
`SIGMOID_MU`	Sigmoid midpoint (50% threshold)	`3.0`
`MIN_REFINEMENT_PROBABILITY`	Minimum refinement probability (0.0-1.0)	`0.05`

Vector Search

Variable	Description	Default
`VECTOR_SEARCH_LIMIT`	Number of similar prompts to retrieve (1-50)	`5`
`VECTOR_SIMILARITY_THRESHOLD`	Minimum similarity score (0.0-1.0)	`0.7`

BM25 Hybrid Search

Variable	Description	Default
`BM25_SEARCH_ENABLED`	Enable BM25 keyword search	`true`
`BM25_K1`	BM25 term saturation parameter (0.1-3.0)	`1.2`
`BM25_B`	BM25 document length normalization (0.0-1.0)	`0.75`
`BM25_LIMIT`	Number of BM25 results (1-20)	`5`
`BM25_VECTOR_LIMIT`	Number of vector results in hybrid (1-20)	`5`
`HYBRID_SEARCH_ENABLED`	Enable hybrid search combining vector + BM25	`true`
`LLM_RERANK_ENABLED`	Enable LLM re-ranking of results	`true`
`LLM_RERANK_CANDIDATES`	Number of candidates for re-ranking (2-20)	`10`

MCP Server

Variable	Description	Default
`MCP_TRANSPORT`	Transport protocol (`stdio` or `sse`)	`stdio`
`MCP_PORT`	Server port for SSE transport (1024-65535)	`8000`

Logging

Variable	Description	Default
`LOG_LEVEL`	Logging level (`TRACE`, `DEBUG`, `INFO`, `WARNING`, `ERROR`, `CRITICAL`)	`INFO`
`JSON_LOGGING`	Enable structured JSON logging	`true`

Security

Variable	Description	Default
`MAX_PROMPT_LENGTH`	Maximum prompt text length (100-100000)	`10000`
`ENABLE_PII_DETECTION`	Enable basic PII detection	`true`

Project Association

Variable	Description	Default
`REQUIRE_PROJECT_ID`	Require project_id for all refinements	`false`

Diff Saliency

Variable	Description	Default
`DIFF_SALIENCY_ENABLED`	Enable diff saliency filtering	`true`
`DIFF_SALIENCY_THRESHOLD`	Minimum saliency score (0.0-1.0)	`0.6`
`DIFF_MAX_STORED_PER_REQUEST`	Maximum diffs per request (1-50)	`10`

Complete Example `.env` File

# Database (Required)
DATABASE_URL=postgresql://simpa:simpa@localhost:5432/simpa

# Embedding Service
EMBEDDING_PROVIDER=ollama
EMBEDDING_MODEL=nomic-embed-text
EMBEDDING_DIMENSIONS=768
OLLAMA_BASE_URL=http://localhost:11434

# LLM Service
LLM_MODEL=ollama/llama3.2
LLM_TEMPERATURE=0.7

# API Keys (if using cloud providers)
# OPENAI_API_KEY=sk-...
# ANTHROPIC_API_KEY=sk-ant-...
# GEMINI_API_KEY=...

# MCP Server
MCP_TRANSPORT=stdio
MCP_PORT=8000

# Caching (optional, defaults are reasonable)
EMBEDDING_CACHE_ENABLED=true
LLM_CACHE_ENABLED=true

# Refinement behavior (optional)
SIMILARITY_BYPASS_THRESHOLD=0.95
SIGMOID_K=1.5
SIGMOID_MU=3.0

# Logging
LOG_LEVEL=INFO
JSON_LOGGING=true

Command Line Options

SIMPA supports several command line flags for runtime configuration:

# Show all available options
python -m src.main --help

Option	Description	Default
`--init-db`	Initialize the database schema and exit	-
`--transport {stdio,sse}`	MCP transport protocol	`stdio`
`--log-level {trace,debug,info,warn,error,fatal}`	Logging level	`info`
`--log-file PATH`	Path to log file	`/tmp/simpa-mcp.log`
`--log-console`	Also log to console (stderr) ⚠️ Not recommended for MCP stdio mode	-
`--env PATH`	Path to `.env` file	`~/.env`
`--project-id-required`	Require `project_id` for all refinement requests	-

Examples:

# Initialize database
python -m src.main --init-db

# Run with SSE transport on custom port (also set MCP_PORT in .env)
python -m src.main --transport sse

# Debug logging to custom file
python -m src.main --log-level debug --log-file /var/log/simpa.log

# Use custom env file
python -m src.main --env ~/my-project/.env

# Require project_id for all prompts
python -m src.main --project-id-required

# Combination of options
python -m src.main --env ./.env.local --log-level debug --transport sse

Environment File (`--env`)

By default, SIMPA loads environment variables from your home directory at ~/.env. You can specify a custom .env file using the --env option:

# Use a custom env file
python -m src.main --env ~/my-project/.env

# Or use a project-specific .env
python -m src.main --env ./.env.local

Loading order:

If --env is specified and the file exists, it is loaded first
If --env is not specified, ~/.env is loaded if it exists
The project ./.env in the current directory is loaded last (overrides previous values)

This allows you to keep sensitive credentials (API keys) in ~/.env while keeping project-specific settings in the project .env.

Project-Associated Prompt Development (`--project-id-required`)

Enable strict project association mode to enforce that all prompts must be linked to a project:

# Require project_id for all prompt refinements
python -m src.main --project-id-required

When enabled, calling refine_prompt without a project_id returns a helpful response guiding the agent to:

List existing projects - View available projects to find a suitable match
Create a new project - Use create_project if no suitable project exists
Resubmit with project_id - Retry the refinement with the chosen project

Why use project association?

Cross-project learning: Prompts refined for one Python web project can benefit similar Flask/Django projects
Knowledge clustering: Projects with similar tech stacks (React+Node, Python+PostgreSQL) share prompt patterns
Relevance scoring: Prompt selection considers project context for better matches
Team organization: Different teams/projects have distinct prompt preferences and patterns

Example workflow:

# Start server with strict project mode
python -m src.main --project-id-required

# Agent workflow:
# 1. First call without project_id → returns list of existing projects
# 2. Agent picks or creates project → gets project_id
# 3. Resubmit with project_id → prompt is refined and associated with project

🛠️ MCP Tools

`refine_prompt`

Intelligently refine prompts before agent execution.

# Request
{
  "original_prompt": "Write a function to sort a list",
  "agent_type": "developer",
  "main_language": "python"
}

# Response
{
  "refined_prompt": "Write a Python function that takes a list of integers...",
  "prompt_key": "uuid-v4",
  "action": "refine|new|reuse",
  "confidence_score": 0.95,
  "similar_prompts_found": 3
}

`update_prompt_results`

Provide feedback to improve future prompts.

# Request
{
  "prompt_key": "uuid-v4",
  "action_score": 4.5,
  "test_passed": true,
  "files_modified": ["main.py"],
  "lint_score": 0.95
}

# Response
{
  "success": true,
  "usage_count": 5,
  "average_score": 4.25
}

📝 Prompt Refinement Examples

SIMPA transforms vague user requests into structured, actionable specifications.

Example 1: Developer Agent

Original Prompt:

Build a REST API for managing tasks.

Refined Prompt:

ROLE: Senior Backend Developer
GOAL: Build a REST API for managing tasks.
CONSTRAINTS: Your output will be only a descriptive overview of what the API will do.
REQUIREMENTS:
- Define all REST endpoints (GET, POST, PUT, DELETE) with their URLs and purposes
- Explicitly specify request/response JSON formats for each endpoint
- Include pagination, filtering, and sorting capabilities for task listing
- Describe authentication mechanism (JWT or API key based)
- Define error response formats and standard HTTP status codes
- Outline rate limiting strategy if applicable
OUTPUT: Structured API specification document.
SUCCESS: All endpoints are well-defined, formats are validated, and specification is ready to implement.
AUTONOMY: You may choose implementation technology and framework.
FALLBACK: If a task requirement is unclear, ask for clarification or document your assumptions.

Example 2: Architect Agent

Original Prompt:

Design a caching strategy for the product catalog.

Refined Prompt:

ROLE: Software Architect
GOAL: Design a caching strategy for the product catalog.
CONSTRAINTS: Output will be descriptive only; no actual implementation code.
REQUIREMENTS:
- Identify data access patterns (read-heavy, write-heavy, or mixed)
- Design cache-invalidation strategy (time-based, event-driven, or hybrid)
- Define cache key structure and naming conventions
- Specify cache-aside, write-through, or write-behind patterns
- Address cache consistency, eviction policies, and TTL configuration
- Consider multi-tier caching (in-memory + distributed)
- Document potential issues: stale data, thundering herd, cache penetration
OUTPUT: Architecture document detailing caching strategy.
SUCCESS: Design addresses scalability, performance, and data consistency.
AUTONOMY: Choose appropriate caching technology (Redis, Memcached, Caffeine).
FALLBACK: If data volume or access patterns are unclear, document assumptions.

Example 3: Tester Agent

Original Prompt:

Write tests for the login module.

Refined Prompt:

ROLE: QA Engineer
GOAL: Write tests for the login module.
CONSTRAINTS: Provide complete test scenarios and detailed expected outcomes, no code.
TEST CATEGORIES:
- Unit Tests: Each function/method tested in isolation with mocks
- Integration Tests: Service interactions (DB, external auth providers)
- Edge Cases: Empty input, SQL injection attempts, XSS payloads
- Boundary Tests: Max username length, password complexity rules
- Error Paths: Invalid credentials, account locked, service unavailable
- Load Tests: Concurrent login attempts
REQUIREMENTS:
- Use GIVEN/WHEN/THEN format for each scenario
- Define specific test data (usernames, passwords)
- Specify expected status codes and error messages
- Identify test dependencies and setup requirements
OUTPUT: Comprehensive test specification document.
SUCCESS: All happy paths and edge cases are covered.
AUTONOMY: Choose testing frameworks and assertion styles.
FALLBACK: If authentication flow is unclear, document assumptions.

Example 4: Reviewer Agent (with scope constraints)

Original Prompt:

Review error handling in the codebase.

Scope Context:

{
  "target_dirs": ["src/services/", "src/utils/"],
  "target_files": ["src/services/user_service.py"],
  "scope": "error handling patterns only",
  "focus": ["security", "logging", "user-experience"]
}

Refined Prompt:

ROLE: Security-Focused Code Reviewer
GOAL: Review error handling patterns in the codebase.
CONSTRAINTS: Review only within assigned scope and files:
  - target_dirs: src/services/, src/utils/
  - target_files: src/services/user_service.py
  - focus: security, logging, user-experience
  - scope: error handling patterns only
CONTEXT: Production code review process
OUTPUT: Line-by-line comments and summary report
SUCCESS: Critical issues identified, recommendations actionable
AUTONOMY: Can use static analysis tools within scope
FALLBACK: Ask if scope unclear

Review Checklist:
- Security: Exception leaks sensitive data, proper sanitization
- Logging: Appropriate log levels, no PII exposure
- User Experience: Helpful error messages, graceful degradation
- Code Quality: Consistent patterns, avoid catch-all exceptions
- Documentation: Error scenarios documented, recovery paths clear

Note: When scope context is provided (target_dirs, target_files, scope, focus), SIMPA injects these constraints into the refined prompt above the CONSTRAINTS section, limiting the agent's work to the specified boundaries.

🧠 Self-Improvement Algorithm

SIMPA uses a sigmoid function to intelligently balance exploration (refinement) vs exploitation (reuse):

p_refine(S) = 1 / (1 + exp(k * (S - mu)))

Where:

S = Average score (1.0 - 5.0)
k = Steepness (default: 1.5)
mu = Midpoint (default: 3.0)

Refinement Probability:

Score	Probability
⭐ 1.0	~95% 🔄 Refine heavily
⭐⭐ 2.0	~82% 🔄 Likely refine
⭐⭐⭐ 3.0	~50% ⚖️ Balance point
⭐⭐⭐⭐ 4.0	~18% ✅ Start reusing
⭐⭐⭐⭐⭐ 5.0	~5% ✅ Reuse proven

📊 Database Schema

`refined_prompts` - The Prompt Knowledge Base

Column	Type	Purpose
`id`	UUID	Primary key
`prompt_key`	UUID	Public identifier for MCP tools
`created_at`	TIMESTAMP	When prompt was first refined
`updated_at`	TIMESTAMP	Last modification time
`last_used_at`	TIMESTAMP	Last time this prompt was executed
`embedding`	vector(768)	Semantic embedding for similarity search
`agent_type`	VARCHAR(100)	Agent specialization (e.g., "developer")
`refinement_type`	VARCHAR(20)	Strategy used (default: "sigmoid")
`main_language`	VARCHAR(50)	Primary programming language
`other_languages`	JSON	Additional languages used
`domain`	VARCHAR(100)	Domain/topic classification
`tags`	JSON	Array of descriptive tags
`original_prompt_hash`	VARCHAR(64)	Hash for fast exact-match lookup
`original_prompt`	TEXT	Raw input prompt
`refined_prompt`	TEXT	Optimized/expanded version
`refinement_version`	INTEGER	Version number for iterative refinements
`prior_refinement_id`	UUID	Self-reference for refinement chains
`project_id`	UUID	FK to projects (optional context)
`usage_count`	INTEGER	Total times used
`average_score`	FLOAT	Running average of action scores (1.0-5.0)
`score_weighted`	FLOAT	Bayesian-weighted score for ranking
`context`	JSON	Scope context (focus, target_dirs, etc.)
`is_active`	BOOLEAN	Soft delete flag

`projects` - Project Context

Column	Type	Purpose
`id`	UUID	Primary key
`project_name`	VARCHAR(255)	Unique project name
`description`	TEXT	Project description
`main_language`	VARCHAR(50)	Primary language for this project
`other_languages`	JSON	Other languages used
`library_dependencies`	JSON	Frameworks/libraries (e.g., ["react", "django"])
`project_structure`	JSON	Directory structure hints (src_dirs, test_dirs, etc.)
`created_at`	TIMESTAMP	Project creation time
`updated_at`	TIMESTAMP	Last update time
`is_active`	BOOLEAN	Soft delete flag

`prompt_history` - Learning Data

Column	Type	Purpose
`id`	UUID	Primary key
`project_id`	UUID	FK to projects (optional context)
`prompt_id`	UUID	FK to refined_prompts
`created_at`	TIMESTAMP	When this record was created
`request_id`	UUID	Optional trace/request ID
`executed_by_agent`	VARCHAR(100)	Which agent executed this prompt
`executed_at`	TIMESTAMP	Execution timestamp
`action_score`	FLOAT	Quality score for this execution (1.0-5.0)
`test_passed`	BOOLEAN	Whether tests passed
`lint_score`	FLOAT	Code quality score
`security_scan_passed`	BOOLEAN	Security check results
`files_modified`	JSON	List of modified files
`files_added`	JSON	List of new files created
`files_deleted`	JSON	List of deleted files
`diffs`	JSON	Code diffs organized by language
`execution_duration_ms`	INTEGER	Time taken to execute (milliseconds)
`agent_output_summary`	TEXT	Summary of agent output
`validation_results`	JSON	Test/lint/validation details
`saliency_metadata`	JSON	Diff saliency analysis data

Relationships

projects ||--o{ refined_prompts : "has many"
projects ||--o{ prompt_history : "has many"
refined_prompts ||--o{ prompt_history : "has many"
refined_prompts ||--o{ refined_prompts : "refinement chain"

projects → refined_prompts: One-to-many (a project has multiple prompts)
projects → prompt_history: One-to-many (a project has multiple history entries)
refined_prompts → prompt_history: One-to-many (a prompt has multiple execution records)
refined_prompts → refined_prompts: Self-referential (refinement chains via prior_refinement_id)

Indexes

Performance-optimized indexes on frequently queried columns:

Table	Column(s)	Purpose
`refined_prompts`	`prompt_key`	Unique lookup by public key
`refined_prompts`	`agent_type`	Filter by agent specialization
`refined_prompts`	`main_language`	Filter by language
`refined_prompts`	`domain`	Filter by domain/topic
`refined_prompts`	`project_id`	Join with projects table
`refined_prompts`	`embedding`	Vector similarity search (pgvector HNSW)
`projects`	`project_name`	Unique project name lookup
`projects`	`main_language`	Filter by language
`prompt_history`	`prompt_id`	Join with refined_prompts
`prompt_history`	`project_id`	Join with projects table

🧪 Development

Running Tests

# All tests (requires Docker)
pytest

# Integration tests only
pytest tests/integration -v

# With coverage
pytest --cov=src --cov-report=html

Current Status: 274 tests passing ✅

Database Migrations

# Create new migration after model changes
alembic revision --autogenerate -m "description"

# Apply migrations
alembic upgrade head

# Rollback
alembic downgrade -1

🐳 Docker

Note: Docker is primarily used for testing SIMPA in an isolated environment. It can also be used as an alternative to installing PostgreSQL directly on your machine during development.

For production deployments, you may prefer running SIMPA directly with your existing PostgreSQL instance rather than containerizing both services.

Quick Start with Docker Compose (Testing)

The easiest way to test SIMPA without installing PostgreSQL locally:

# Start PostgreSQL with pgvector in Docker
docker-compose up -d postgres

# Initialize the database
python -m src.main --init-db

# Run the MCP server
python -m src.main

This uses the docker-compose.test.yml which only starts the PostgreSQL service—SAMPA runs natively on your machine using the containerized database.

Production Deployment

# Build optimized image
docker build --target production -t simpa-mcp:latest .

# Run with environment
docker run -d \
  --name simpa-mcp \
  -e DATABASE_URL=postgresql://... \
  -e OPENAI_API_KEY=sk-... \
  simpa-mcp:latest

Multi-stage Targets

Target	Purpose	Size
`builder`	Compile dependencies	Base
`development`	Live code mounting	~2GB
`production`	Optimized runtime	~700MB

📚 Documentation

Document	Description
SIMPA Process Architecture	System architecture, data flow, and component design
Test Suite Development	Comprehensive testing guide and test development
API Reference - MCP tool documentation
Architecture Decisions - ADRs and design patterns

📈 What's Next?

[ ] Multi-agent prompt coordination
[ ] Prompt lineage tracking
[ ] A/B testing framework
[ ] Prompt security scanning
[ ] Custom embedding models

🤝 Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Make your changes
Add tests (we have 274 as examples!)
Submit a pull request

📄 License

MIT License - see LICENSE for details

SIMPA - Making AI prompts smarter, one interaction at a time. <a href="https://github.com/yourusername/simpa-mcp">GitHub</a> • <a href="https://example.com/docs">Documentation</a> • <a href="https://example.com/discord">Discord</a>

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured

SIMPA

README

SIMPA - Self-Improving Meta Prompt Agent

🌟 Why SIMPA?

🏗️ Architecture

🔄 Prompt Lifecycle

✨ Features

📋 Prerequisites

Required

For Ollama (Local Models - Recommended)

For Cloud Providers (Optional)

System Requirements

🚀 Quick Start

Option 1: Docker Compose (Recommended for Development/Testing)

Option 2: Manual Setup (Production/Existing Services)

🔌 Adding SIMPA to Your MCP Configuration

Step 1: Install SIMPA Server

Option A: Global Installation (Easiest)

Option B: Docker (Recommended for Production)

Step 2: Configure Your MCP Client

Cursor (~/.cursor/mcp.json)

Claude Desktop (~/Library/Application Support/Claude/claude_desktop_config.json)

Generic MCP Configuration

Using uv (Recommended)

Step 3: Install MCP Inspector (Optional, for Testing)

Step 4: Verify Installation

🛠️ Troubleshooting

"Command not found: uv"

"ModuleNotFoundError: No module named 'src'"

Database Connection Errors

MCP Server Not Responding

🔧 Configuration

Environment Variables

⚡ Critical Parameters (Required)

Minimal Configuration Example

Optional Parameters (With Working Defaults)

Database Connection Details

Embedding Service

LLM Service

API Keys (Only if using cloud LLM providers)

Embedding Cache

LLM Cache

Fast-Path Hash Match

Refinement Strategy

Vector Search

BM25 Hybrid Search

MCP Server

Logging

Security

Project Association

Diff Saliency

Complete Example .env File

Command Line Options

Environment File (--env)

Project-Associated Prompt Development (--project-id-required)

🛠️ MCP Tools

refine_prompt

update_prompt_results

📝 Prompt Refinement Examples

Example 1: Developer Agent

Example 2: Architect Agent

Example 3: Tester Agent

Example 4: Reviewer Agent (with scope constraints)

🧠 Self-Improvement Algorithm

📊 Database Schema

refined_prompts - The Prompt Knowledge Base

projects - Project Context

prompt_history - Learning Data

Relationships

Indexes

🧪 Development

Running Tests

Database Migrations

🐳 Docker

Quick Start with Docker Compose (Testing)

Production Deployment

Multi-stage Targets

📚 Documentation

📈 What's Next?

🤝 Contributing

Cursor (`~/.cursor/mcp.json`)

Claude Desktop (`~/Library/Application Support/Claude/claude_desktop_config.json`)

Complete Example `.env` File

Environment File (`--env`)

Project-Associated Prompt Development (`--project-id-required`)

`refine_prompt`

`update_prompt_results`

`refined_prompts` - The Prompt Knowledge Base

`projects` - Project Context

`prompt_history` - Learning Data