Files Vector Store

Files Vector Store

A very simple vector store that provides capability to watch a list of directories, and automatically index all the markdown, html and text files in the directory to a vector store to enhance context.

lishenxydlgzs

Programming Docs Access
Visit Server

README

@lishenxydlgzs/simple-files-vectorstore

A Model Context Protocol (MCP) server that provides semantic search capabilities across files. This server watches specified directories and creates vector embeddings of file contents, enabling semantic search across your documents.

Installation & Usage

Add to your MCP settings file:

{
  "mcpServers": {
    "files-vectorstore": {
      "command": "npx",
      "args": [
        "-y",
        "@lishenxydlgzs/simple-files-vectorstore"
      ],
      "env": {
        "WATCH_DIRECTORIES": "/path/to/your/directories"
      },
      "disabled": false,
      "autoApprove": []
    }
  }
}

MCP settings file locations:

  • VSCode Cline Extension: ~/Library/Application Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json
  • Claude Desktop App: ~/Library/Application Support/Claude/claude_desktop_config.json

Configuration

The server requires configuration through environment variables:

Required Environment Variables

You must specify directories to watch using ONE of the following methods:

  • WATCH_DIRECTORIES: Comma-separated list of directories to watch
  • WATCH_CONFIG_FILE: Path to a JSON configuration file with a watchList array

Example using WATCH_DIRECTORIES:

{
  "mcpServers": {
    "files-vectorstore": {
      "command": "npx",
      "args": [
        "-y",
        "@lishenxydlgzs/simple-files-vectorstore"
      ],
      "env": {
        "WATCH_DIRECTORIES": "/path/to/dir1,/path/to/dir2"
      },
      "disabled": false,
      "autoApprove": []
    }
  }
}

Example using WATCH_CONFIG_FILE:

{
  "mcpServers": {
    "files-vectorstore": {
      "command": "npx",
      "args": [
        "-y",
        "@lishenxydlgzs/simple-files-vectorstore"
      ],
      "env": {
        "WATCH_CONFIG_FILE": "/path/to/watch-config.json"
      },
      "disabled": false,
      "autoApprove": []
    }
  }
}

The watch config file should have the following structure:

{
  "watchList": [
    "/path/to/dir1",
    "/path/to/dir2",
    "/path/to/specific/file.txt"
  ]
}

Optional Environment Variables

  • CHUNK_SIZE: Size of text chunks for processing (default: 1000)
  • CHUNK_OVERLAP: Overlap between chunks (default: 200)
  • IGNORE_FILE: Path to a .gitignore style file to exclude files/directories based on patterns

Example with all optional parameters:

  {
    "mcpServers": {
      "files-vectorstore": {
        "command": "npx",
        "args": [
          "-y",
          "@lishenxydlgzs/simple-files-vectorstore"
        ],
        "env": {
          "WATCH_DIRECTORIES": "/path/to/dir1,/path/to/dir2",
          "CHUNK_SIZE": "2000",
          "CHUNK_OVERLAP": "500",
          "IGNORE_FILE": "/path/to/.gitignore"
        },
        "disabled": false,
        "autoApprove": []
      }
    }
  }

MCP Tools

This server provides the following MCP tools:

1. search

Perform semantic search across indexed files.

Parameters:

  • query (required): The search query string
  • limit (optional): Maximum number of results to return (default: 5, max: 20)

Example response:

[
  {
    "content": "matched text content",
    "source": "/path/to/file",
    "fileType": "markdown",
    "score": 0.85
  }
]

2. get_stats

Get statistics about indexed files.

Parameters: None

Example response:

{
  "totalDocuments": 42,
  "watchedDirectories": ["/path/to/docs"],
  "processingFiles": []
}

Features

  • Real-time file watching and indexing
  • Semantic search using vector embeddings
  • Support for multiple file types
  • Configurable chunk size and overlap
  • Background processing of files
  • Automatic handling of file changes and deletions

Repository

GitHub Repository

Recommended Servers

E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Exa MCP

Exa MCP

A Model Context Protocol server that enables AI assistants like Claude to perform real-time web searches using the Exa AI Search API in a safe and controlled manner.

Featured
Perplexity Chat MCP Server

Perplexity Chat MCP Server

MCP Server for the Perplexity API.

Featured
Web Research Server

Web Research Server

A Model Context Protocol server that enables Claude to perform web research by integrating Google search, extracting webpage content, and capturing screenshots.

Featured
PubMedSearch

PubMedSearch

A Model Content Protocol server that provides tools to search and retrieve academic papers from PubMed database.

Featured
Aindreyway Codex Keeper

Aindreyway Codex Keeper

Serves as a guardian of development knowledge, providing AI assistants with curated access to latest documentation and best practices.

Featured
Perplexity Deep Research

Perplexity Deep Research

A server that allows AI assistants to perform web searches using Perplexity's sonar-deep-research model with citation support.

Featured
Docx Document Processing Service

Docx Document Processing Service

A powerful Word document processing service based on FastMCP, enabling AI assistants to create, edit, and manage docx files with full formatting support. Preserves original styles when editing content.

Featured
Web Research Server

Web Research Server

MCP web research server (give Claude real-time info from the web) - oneshot-engineering/mcp-webresearch

Featured