Boring Gemini
An autonomous AI development agent that enables full-stack coding, automated verification, RAG-powered code search, and quality assurance through MCP tools. Supports Gemini CLI, Claude Code CLI, with features like parallel verification, security scanning, and spec-driven development.
README
Boring: Your Autonomous Coding Partner for Vibe Development
Enterprise-grade Autonomous AI Development Agent
A full-language automated coding and verification engine built for Cursor / Claude Desktop / VS Code / Gemini CLI.
๐ Core Advantages
| Feature | Description |
|---|---|
| ๐ Polyglot & CLI Native | Seamless switching between Gemini CLI and Claude Code CLI, zero API key required |
| ๐ก๏ธ Parallel Verification | Multi-threaded parallel verification, 3-5x performance boost |
| ๐ง RAG Memory | Hybrid Search (Vector + Keyword) + dependency graph for real-time retrieval |
| ๐ก๏ธ Shadow Mode | High-risk operations require human approval, with persistent config |
| ๐ Spec-Driven | 100% specification consistency from PRD to Code |
| ๐ Quality Gates | CI/CD multi-tier gates + multi-language linting + 20+ file type security scanning |
๐ฆ Quick Installation
Option 1: Smithery (Recommended)
npx -y @smithery/cli@latest install boring/boring --client gemini-cli
Option 2: pip
pip install boring-aicoding
# Or full installation
pip install "boring[all]"
MCP Configuration
In mcp_config.json or IDE settings:
{
"mcpServers": {
"boring": {
"command": "npx",
"args": ["-y", "@smithery/cli", "run", "@boring/boring", "--config", "{}"]
},
"context7": {
"command": "npx",
"args": ["-y", "@upstash/context7-mcp"]
}
}
}
๐ฎ Two Usage Modes
Mode 1: MCP/Smithery (Recommended for most users)
Use Boring tools directly inside Gemini CLI, Cursor, or Claude Desktop:
You (in Gemini CLI): "Help me build a FastAPI auth service"
or
"/vibe_start Build an auth service"
Gemini + Boring: "Sure! Let me ask a few questions first..."
โ
No PROMPT.md required
โ
Interactive conversation
โ
Works with any MCP-compatible client
Mode 2: Autonomous Loop (boring start)
For long-running, fully automated development via CLI:
# Requires PROMPT.md in project root (created via boring-setup)
boring-setup my-project # 1. Initialize project (Required!)
cd my-project # 2. Enter project directory
boring start # 3. Start autonomous loop
boring start --provider claude-code # Use Claude Code CLI
boring start --provider gemini-cli # Use Gemini CLI
boring run "Fix all lint errors" # One-shot command
Required files for boring start:
your-project/
โโโ PROMPT.md # โ
Required - Instructions for AI
โโโ @fix_plan.md # Optional - Task checklist
โโโ GEMINI.md # Optional - Project context
๐ Complete Documentation
Tutorials & Guides
| Document | Description | Audience |
|---|---|---|
| Getting Started | Installation, first steps, MCP setup | New users |
| ๐ฅ Practical Demo | 10-min hands-on showcase, 6 real-world examples | ๐ Vibe Coders |
| Complete Tutorial | Quick start, core workflows, practical examples | All developers |
| Advanced Developer Guide | Architecture deep dive, tool development, internals | Senior developers |
| Professional Playbook | 18 expert workflows with /slash commands |
Senior developers |
Reference & Appendices
| Document | Description | Audience |
|---|---|---|
| Tool Reference (Appendix A) | Complete 55+ MCP tools reference | Quick lookup |
| FAQ (Appendix B) | Installation, troubleshooting, API key questions | When encountering issues |
| Prompt Philosophy (Appendix C) | Architect persona design principles | Prompt engineers |
| Architect Mode | Hostile Architect evaluation mode | Production code review |
โก Performance & Architecture
1. Incremental Verification
- Smart Caching:
.boring_cache/verification.jsonstores file hashes. - Speed: Re-verifying 100+ unchanged files takes <2 seconds.
- Force Mode: Use
boring verify --forceto bypass cache.
2. Incremental RAG Indexing
- State Tracking: Only re-indexes changed files.
- CLI:
boring rag index(incremental by default).
3. Private AI & Tool Switching
- Supported Modes: Gemini CLI (recommended), Claude Code CLI (recommended), Ollama (local), SDK (API Key).
- Auto-detection: System automatically detects local command tools at startup.
- Configuration:
# Run with custom timeout boring start --timeout 60 # One-Shot Command (New!) boring run "Review this code and fix bugs" boring start --provider claude-code boring verify --provider gemini-cli
4. Quality Trend Tracking
- History Recording: Audit scores stored in
.boring_brain/quality_history.json. - Visualization: Use
boring_quality_trendtool to draw ASCII trend charts.
5. Parallel Verification (V10.13)
- Concurrent Processing: Uses
ThreadPoolExecutorto maximize CPU utilization for large projects. - Speed Boost: 3x-5x faster verification on clean builds.
- Real-time Progress: Rich CLI progress bar independent of CI logs.
6. Contrastive Evaluation
- A/B Testing: Use
evaluate --level PAIRWISEto compare two implementations side-by-side. - LLM Judge: AI selects winner based on correctness, logic, and efficiency.
- Bias Mitigation: Automatic position bias handling via A/B/A order verification.
7. Developer Experience
- Config File:
.boring.tomlfor project-specific rules. - Custom Prompts: Override Judge Prompts in
[boring.prompts]. - Linter Override: Customize tool parameters in
[boring.linter_configs].
๐ ๏ธ MCP Toolset (Consolidated & Dynamic)
Boring V10.16 adopts Dynamic Discovery Architecture to solve context overflow caused by too many tools.
๐ Dynamic Discovery (AI Only)
boring://capabilities: Read this resource to discover all available capabilities.boring://tools/{category}: Read detailed tool usage for a specific category.
๐งฐ Core Tools (Consolidated)
To reduce context consumption, we consolidated 50+ tools into 14 high-level entry points:
| Category | Main Tool | Description |
|---|---|---|
| Security | boring_security_scan |
SAST, secret detection, dependency scanning (Bandit/Safety) |
| Transactions | boring_transaction |
Atomic Git operations (Start/Commit/Rollback) |
| Background | boring_task |
Async background tasks (Verify/Test/Lint) |
| Context | boring_context |
Cross-session memory save/load |
| Profile | boring_profile |
User preferences and cross-project learning |
| Verification | boring_verify |
Multi-level code verification (Basic/Standard/Full) |
| RAG Memory | boring_rag_search |
Semantic search and dependency context retrieval. Use boring_rag_reload to hot-reload dependencies after installation. |
| Agents | boring_multi_agent |
[PROMPT GENERATOR / EXECUTOR] Returns prompts or executes workflow in background (execute=True). Helper tools: boring_prompt_plan, boring_prompt_fix |
| Shadow | boring_shadow_mode |
Safe sandbox for high-risk operations |
| Git | boring_commit |
Semantic commit message generation from task.md (used by smart_commit prompt) |
| Workspace | boring_workspace_switch |
Multi-project workspace switching |
| Knowledge | boring_learn |
Project knowledge extraction and storage |
| Plugins | boring_run_plugin |
External plugin execution |
| Evaluation | boring_evaluate |
LLM-as-Judge code scoring |
๐ Quick Start Prompts
One-click workflows designed for Claude Desktop / Gemini CLI users:
| Prompt | Purpose | Usage |
|---|---|---|
vibe_start |
Launch complete development workflow | /vibe_start Build a FastAPI auth service |
quick_fix |
Auto-fix all code issues | /quick_fix |
full_stack_dev |
Full-stack app development | /full_stack_dev my-app "Next.js + FastAPI" |
๐ก Vibe Coding Mode: Describe your idea, let AI handle the rest!
๐ Quick Start
Use Slash Commands to jumpstart your development:
-
Start Boring:
boring start -
Enter Command:
/vibe_start Build a FastAPI auth serviceOr for specific tech stacks:
/full_stack_dev my-app "Next.js + Tailwind + Supabase"
Note: All templates and shortcuts are now consolidated into Slash Commands (Prompts) for a unified experience.
List available templates
boring templates list
**Built-in Templates:**
| Template ID | Description |
|-------------|-------------|
| `fastapi-auth` | FastAPI + JWT authentication service |
| `nextjs-dashboard` | Next.js admin dashboard |
| `cli-tool` | Python CLI tool (Typer) |
| `vue-spa` | Vue 3 single-page application |
---
## ๐ Live Monitoring
Two monitoring options for terminal lovers and visual dashboards:
- **Terminal Dashboard (TUI)**: Run `boring-monitor`. Displays status, API call counts, and recent logs in terminal.
- **Web Dashboard**: Run `boring-dashboard`. Streamlit-powered visual interface with project trends and knowledge base inspection.
---
## ๐ Supported Languages
| Language | Syntax Check | Linter | Test Runner |
|----------|--------------|--------|-------------|
| Python | โ
compile() | โ
ruff | โ
pytest |
| JS/TS | โ
node --check | โ
eslint | โ
npm test |
| Go | โ
go fmt | โ
golangci-lint | โ
go test |
| Rust | โ
rustc | โ
cargo clippy | โ
cargo test |
| Java | โ
javac | - | โ
mvn/gradle |
| C/C++ | โ
gcc/g++ | โ
clang-tidy | - |
---
## ๐ก Pro Tips
### Tip 1: SpecKit Workflow (5 Steps)
Before writing code, Boring guides you through:
1. `speckit_constitution` โ Establish Principles (Constitution)
2. `speckit_clarify` โ Clarify Requirements
3. `speckit_plan` โ Create Implementation Plan
4. `speckit_checklist` โ Build Acceptance Criteria
5. `speckit_analyze` โ Verify Consistency (Spec vs Plan)
> **"Measure Twice, Cut Once"** - AI implementation!
### Tip 2: Use Hybrid Mode
| Task Type | Recommended Tool |
|-----------|------------------|
| Small changes | `boring_apply_patch` |
| Large features | `run_boring` + SpecKit |
| Quality check | `boring_evaluate` |
### Tip 3: Accumulate Experience
Develop โ AI fixes errors โ Record to .boring_memory Project ends โ boring_learn โ Extract patterns to .boring_brain Next project โ AI auto-references!
### Tip 4: Custom Lint Rules
Create `ruff.toml`:
```toml
line-length = 120
[lint]
ignore = ["T201", "F401"] # Allow print() and unused imports
๐ Quick Tutorials
1. New Project Development
You: Help me create a TypeScript API project
AI: (runs speckit_plan) Generating implementation_plan.md...
You: Approve this plan
AI: (runs boring_multi_agent) Starting PlanโCodeโReview loop...
2. Code Verification
You: Verify the code quality of this project
AI: (runs boring_verify --level FULL)
โ
Syntax check passed
โ ๏ธ Found 3 lint issues
โ
Tests passed (12/12)
3. RAG Search
You: I want to find code that handles user authentication
AI: (runs boring_rag_search "user authentication")
Found 3 related functions:
1. auth.py:verify_token (L23-45)
2. middleware.py:require_auth (L67-89)
...
๐ Git Hooks
Automatically verify code before commit/push:
boring hooks install # Install
boring hooks status # Status
boring hooks uninstall # Remove
| Hook | Trigger | Verification Level |
|---|---|---|
| pre-commit | Every commit | STANDARD |
| pre-push | Every push | FULL |
| quick-check | Every commit | QUICK (multi-language) |
๐ V10.16.3 New Features (Security & Stability)
1. Enhanced Shadow Mode ๐ก๏ธ
What is Shadow Mode? Shadow Mode is the security core of Boring, acting as a mandatory interception layer between the AI and your file system.
- Function: It intercepts all destructive operations (e.g., file writes, deletions, command execution).
- Workflow: AI requests operation -> Shadow Mode evaluates impact -> Puts in pending queue -> Requires YOUR approval (
boring_shadow_approve) to execute. - Strict Enforcement: In v10.16.3, we patched all bypasses. Even background agent patches or minor edits require approval if you are in
STRICTmode.
Three Modes:
- DISABLED (โ ๏ธ Risky): No interception. For isolated containers only.
- ENABLED (๐ก๏ธ Default): Auto-approves low-risk ops (reads), blocks high-risk ops.
- STRICT (๐ Secure): Intercepts ALL write operations. Highest security level.
2. Robust Transactions ๐พ
Non-Interactive Git: boring_transaction now automatically bypasses GPG/credential prompts, preventing CI hangs.
3. Smart RAG Environment ๐ง
Auto-Discovery: RAG tools now automatically find user-installed Python packages (chromadb), solving "module not found" issues in isolated environments.
๐ V10.16.0 New Features
1. Quality Gates (CI/CD)
Project includes .github/workflows/quality-gates.yml:
# Auto-runs on push to GitHub
Tier 1: Lint & Format # ruff check, ruff format
Tier 2: Security Scan # bandit, safety
Tier 3: Unit Tests # pytest --cov-fail-under=39
Tier 4: Integration Tests # main branch only
2. Project Configuration (.boring.toml)
Create .boring.toml in project root for custom quality policies:
[boring.quality_gates]
min_coverage = 40 # Minimum coverage
max_complexity = 15 # Maximum complexity
max_file_lines = 500 # Maximum file lines
3. Evaluation Rubric (LLM Judge)
Use standardized rubrics for code quality evaluation:
boring_evaluate --target src/main.py --level DIRECT
4. Quick Multi-language Check
# Install Quick Check Hook
boring hooks install
๐ V10.15 New Features
1. Incremental Verification (Git-based)
# Verify only Git-changed files
boring verify --incremental
# MCP call
boring_verify(incremental=true)
2. Multi-project RAG Search
boring_rag_search(
query="authentication middleware",
additional_roots=["/path/to/other-project"]
)
3. Dependency Graph Visualization
boring_visualize --scope full --output mermaid
4. Parallel Review (Multi-Reviewer)
boring_agent_review --parallel
5. VS Code Integration (JSON-RPC Server)
Enables native development experience in editors:
- Real-time Error Hints: Red squiggles in code on save
- Quality Score CodeLens: Display
Quality: 4.5/5above functions - Sidebar Semantic Search: Natural language code search
- One-click Quick Fix: Auto-fix via lightbulb icon
// .vscode/settings.json
{
"boring.enableServer": true,
"boring.port": 8765
}
6. Other IDE Support (LSP & CLI)
- Cursor / VS Code derivatives: Full support via MCP Server
- IntelliJ / PyCharm / Vim: Run
boring lsp start --port 9876for JSON-RPC server - CLI Mode: All automation available via
boringcommand
7. Error Diagnostics
Auto-analyze errors and suggest fixes:
boring_diagnose --error "ModuleNotFoundError: No module named 'foo'"
๐ฏ Future Vision
Note: The following features require server support (not yet implemented)
- ๐ Boring Cloud: Cloud collaboration and team sharing
- ๐ค Team Workflows: Multi-person workflow synchronization
- ๐ Enterprise SSO: Enterprise-grade identity authentication
๐ Acknowledgments
Thanks to the following projects and communities:
- Google Gemini - Powerful AI model
- Model Context Protocol - Standardized AI tool protocol
- Tree-sitter - Efficient multi-language parser
- ChromaDB - Vector database
- Ruff - Ultra-fast Python Linter
- FastMCP - MCP Server framework
- All Contributors and users!
๐ License
๐ Links
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.