codetree

codetree

Gives coding agents structured code understanding via tree-sitter with 23 tools and 10 languages, enabling precise queries instead of reading entire files.

Category
Visit Server

README

codetree

Tests PyPI Python License: MIT

Stop feeding entire files to your AI agent.

codetree is an MCP server that gives coding agents structured code understanding via tree-sitter — so they ask precise questions instead of reading thousands of lines. 23 tools, 10 languages, ~1 second startup. No vector DB, no embedding model, no config.

Quick Start

Prerequisite: Install uv if you don't have it (curl -LsSf https://astral.sh/uv/install.sh | sh).

Then cd into any project and run:

claude mcp add codetree -- uvx --from mcp-server-codetree codetree --root .

That's it. The . means "this project." Your agent now has structured code understanding.

Not using Claude Code? See Editor Setup for Cursor, VS Code, Windsurf, and Claude Desktop.

Before / After

Before codetree — agent reads the raw file:

$ cat calculator.py
import math
from typing import Optional

class Calculator:
    """A scientific calculator with memory."""

    def __init__(self):
        self.memory = 0
        self.history = []

    def add(self, a: float, b: float) -> float:
        """Add two numbers."""
        result = a + b
        self.history.append(('add', a, b, result))
        return result

    def divide(self, a: float, b: float) -> Optional[float]:
        """Divide a by b, returns None on zero division."""
        if b == 0:
            return None
        result = a / b
        self.history.append(('divide', a, b, result))
        return result

    # ... 200 more lines of methods ...

Tokens consumed: ~2,000+ for the full file

After codetree — agent asks for the skeleton:

class Calculator → line 4
  "A scientific calculator with memory."
  def __init__(self) (in Calculator) → line 7
  def add(self, a: float, b: float) (in Calculator) → line 11
    "Add two numbers."
  def divide(self, a: float, b: float) (in Calculator) → line 17
    "Divide a by b, returns None on zero division."
  def sqrt(self, x: float) (in Calculator) → line 24
    "Square root using math.sqrt."

Tokens consumed: ~80. That's a 25x reduction.

The agent sees every class, method, and docstring — with line numbers — without reading a single function body. When it needs the full source of divide, it calls get_symbol("calculator.py", "divide") and gets just those 6 lines.

23 Tools

Understand Structure

Tool Purpose
get_file_skeleton(file_path) Classes, functions, methods with line numbers + doc comments
get_symbol(file_path, symbol_name) Full source of a function or class
get_skeletons(file_paths) Batch skeletons for multiple files
get_symbols(symbols) Batch source for multiple symbols
get_imports(file_path) Import statements with line numbers

Navigate Relationships

Tool Purpose
find_references(symbol_name) All usages of a symbol across the repo
get_call_graph(file_path, function_name) What a function calls + what calls it
get_blast_radius(file_path, symbol_name) Transitive impact — what breaks if you change this

Analyze Quality

Tool Purpose
get_complexity(file_path, function_name) Cyclomatic complexity breakdown
find_dead_code(file_path?) Symbols defined but never referenced
detect_clones(file_path?, min_lines?) Duplicate / near-duplicate functions

Inspect & Search

Tool Purpose
search_symbols(query?, type?, parent?) Flexible symbol search with filters
find_tests(file_path, symbol_name) Find test functions for a symbol

Onboarding & Graph

Tool Purpose
index_status() Graph index freshness and stats
get_repository_map(max_items?) Compact repo overview: languages, entry points, hotspots
resolve_symbol(query, kind?, path_hint?) Disambiguate short name into ranked qualified matches
search_graph(query?, kind?, file_pattern?) Graph search with degree filters and pagination

Change & Dataflow

Tool Purpose
get_change_impact(symbol_query?, diff_scope?) Impact analysis via symbol or git diff, with risk levels
analyze_dataflow(file_path, function_name, mode?) Variable dataflow, taint analysis, or cross-function taint tracing

Visualization & History

Tool Purpose
find_hot_paths(top_n?) High-complexity × high-call-count optimization targets
get_dependency_graph(file_path?, format?) File-level dependency graph as Mermaid or list
git_history(mode?, file_path?, top_n?) Git blame, file churn, or change coupling analysis
suggest_docs(file_path?, symbol_name?) Find undocumented functions with context for doc generation

get_file_skeleton, get_skeletons, and search_symbols accept format="compact" for even fewer tokens.

Supported Languages

Language Extensions
Python .py
JavaScript .js, .jsx
TypeScript .ts
TSX .tsx
Go .go
Rust .rs
Java .java
C .c, .h
C++ .cpp, .cc, .cxx, .hpp, .hh
Ruby .rb

Editor Setup

The --root flag tells codetree which project to analyze. Use . for the current directory, or a full path.

Claude Code

cd into your project, then:

claude mcp add codetree -- uvx --from mcp-server-codetree codetree --root .

Cursor

Add to .cursor/mcp.json in your project:

{
  "mcpServers": {
    "codetree": {
      "command": "uvx",
      "args": ["--from", "mcp-server-codetree", "codetree", "--root", "${workspaceFolder}"]
    }
  }
}

VS Code (Copilot)

Add to .vscode/mcp.json in your project:

{
  "servers": {
    "codetree": {
      "command": "uvx",
      "args": ["--from", "mcp-server-codetree", "codetree", "--root", "${workspaceFolder}"]
    }
  }
}

Windsurf

Add to ~/.codeium/windsurf/mcp_config.json:

{
  "mcpServers": {
    "codetree": {
      "command": "uvx",
      "args": ["--from", "mcp-server-codetree", "codetree", "--root", "${workspaceFolder}"]
    }
  }
}

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows):

{
  "mcpServers": {
    "codetree": {
      "command": "uvx",
      "args": ["--from", "mcp-server-codetree", "codetree", "--root", "/path/to/your/project"]
    }
  }
}

Claude Desktop doesn't support ${workspaceFolder}, so use a full path here.

Why codetree?

Alternative Limitation codetree
Reading files directly Burns tokens, no structure, no relationships 25x token reduction, structured output
grep / ripgrep Text only, no AST awareness, no call graphs Understands code structure, not just text
LSP servers Heavyweight, stateful, language-specific setup One command, 10 languages, stateless MCP
SCIP / LSIF indexers Slow builds, complex setup, huge indexes ~1s startup, JSON cache, zero config
AST-only tools Raw trees are verbose and hard for agents Pre-structured output designed for agents

Architecture

Agent (Claude, Copilot, Cursor, etc.)
    │ MCP (stdio)
    ▼
codetree server (FastMCP)
    │
    ├── Indexer → LanguagePlugin → tree-sitter → structured results
    │   Cache (.codetree/index.json, mtime-based)
    │
    └── Graph Layer → SQLite (.codetree/graph.db)
        Persistent symbols + edges, incremental updates
        Change impact, dataflow, taint analysis
Module Responsibility
server.py FastMCP server — defines all 23 tools
indexer.py File discovery, plugin dispatch, definition index
cache.py Skeleton cache with mtime invalidation
registry.py Maps file extensions to language plugins
languages/ One plugin per language (Python, JS, TS, Go, Rust, Java, C, C++, Ruby)
graph/store.py SQLite persistence for symbols and edges
graph/builder.py Incremental graph builder (sha256 change detection)
graph/queries.py Repository map, symbol resolution, change impact, hot paths, dependency graph, doc suggestions
graph/dataflow.py Intra- and cross-function dataflow and taint analysis
graph/git_analysis.py Git blame, churn, change coupling analysis

Adding a Language

  1. pip install tree-sitter-LANG and add to pyproject.toml
  2. Copy src/codetree/languages/_template.py to languages/yourlang.py
  3. Implement the abstract methods
  4. Register extensions in registry.py
  5. Add tests

Development

git clone https://github.com/ThinkyMiner/codeTree.git
cd codeTree
python -m venv .venv
source .venv/bin/activate
pip install -e .
pip install pytest

# Run all tests (~1058 tests, ~35s)
pytest

# Run a single test file
pytest tests/languages/test_python.py -v

Contributing

Contributions welcome! See CONTRIBUTING.md for setup instructions and guidelines.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured