Better Finder MCP

Better Finder MCP

Enables intelligent file search with Git-like staging and indexing, offering semantic and hybrid search for documents, and integrates with Claude Desktop via MCP.

Category
Visit Server

README

🔍 Better Finder MCP

Git-like workflow for intelligent file search and Claude MCP integration

Better Finder transforms file discovery with semantic search, hybrid matching, and a familiar Git-style workflow. Index your documents, search with natural language, and integrate seamlessly with Claude Desktop.

🚀 Quick Start

Installation

# Clone and install
git clone https://github.com/GitHamza0206/better-finder-mcp.git
cd better-finder-mcp
uv venv
source .venv/bin/activate
pip install -e .

# Install it globally with uv
uv tool install -e .

Basic Workflow

# 1. Stage files for indexing (like git add)
better-finder add ~/Documents

# 2. Check what's staged
better-finder status

# 3. Index staged files
better-finder index

# 4. Search your files
better-finder search "quarterly financial reports"

📋 Commands

File Staging

  • better-finder add <path> - Stage files or Documents for indexing
  • better-finder rm <path> - Remove files from staging
  • better-finder status - Show staged files
  • better-finder index - Index staged files

Search & Management

  • better-finder search <query> - Search indexed files
  • better-finder stats - Show index statistics
  • better-finder clear-index - Clear all indexed data
  • better-finder server - Start MCP server for Claude

Utilities

  • better-finder show <file> - Display file content
  • better-finder remove-file <file> - Remove file from index

🎯 Key Features

Hybrid Search

Combines semantic search, keyword matching, and fuzzy filename search for comprehensive results.

Git-like Workflow

Familiar staging process gives you precise control over what gets indexed.

.betterfinderignore Support

Use ignore patterns to exclude sensitive files:

# .betterfinderignore
secrets/
*.key
temp-*.pdf
node_modules/

Supported File Types

  • Documents: PDF, DOC, DOCX, TXT, MD, RTF, ODT
  • Spreadsheets: XLSX, XLS, CSV, ODS
  • Presentations: PPTX, PPT
  • Data: JSON, XML

Claude MCP Integration

Start the MCP server to use Better Finder directly within Claude Desktop:

better-finder server

Add to your Claude Desktop config (~/.cursor/mcp.json):

{
  "mcpServers": {
    "better-finder": {
      "command": "better-finder",
      "args": ["server"]
    }
  }
}

🔧 Configuration

Better Finder uses smart defaults:

  • Default scan paths: ~/Documents, ~/Desktop, ~/Downloads
  • Ignored directories: .git, node_modules, .venv, cache folders
  • Chunk size: 2000 characters for better context
  • Search threshold: 0.4 for comprehensive results

💡 Examples

Document Discovery

# Find presentations about sales
better-finder search "sales presentation Q4"

# Look for specific file types
better-finder search "budget" --type excel

# Get more results
better-finder search "meeting notes" --max 20

Staging Workflow

# Stage entire project documentation
better-finder add ./docs

# Remove sensitive files
better-finder rm ./docs/secrets/

# Check what will be indexed
better-finder status

# Index everything staged
better-finder index

MCP Integration

Once the server is running, ask Claude:

  • "Search my documents for budget reports"
  • "Find presentations about project timelines"
  • "Show me files related to client proposals"

🏗️ Architecture

  • File Processors: Extract content from different formats
  • FAISS Vector Store: Fast similarity search with sentence transformers
  • SQLite Metadata: Efficient file metadata and chunk storage
  • Staging System: Git-like file management with JSON persistence
  • MCP Server: Model Context Protocol integration for Claude

📊 Performance

  • Indexing: ~100-500 files per minute
  • Search: Sub-second results
  • Memory: ~200-500 MB for typical collections
  • Storage: ~1-5 MB per 1000 documents

📝 License

MIT License - see LICENSE file for details.


Made for developers who want intelligent file discovery with familiar Git-like controls.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured