MCP Servers

SEC EDGAR Filings MCP Server

Enables AI agents to download, convert, and analyze SEC EDGAR filings with tools for downloading filings, converting HTML to PDF, and transforming PDF to Markdown.

README

SEC EDGAR Filings MCP Server

A Model Context Protocol (MCP) server that enables AI agents to download, convert, and analyze SEC EDGAR filings. Built with FastMCP, this server provides tools for downloading SEC filings, converting HTML to PDF, and transforming PDF documents to Markdown for LLM processing.

Features

Core MCP Tools

read_as_markdown - Convert PDF files to Markdown using Docling <br> 1-2. read_markdown_file - List files, read large markdown files in chunks
html_to_pdf - Convert HTML/iXBRL files to PDF using Playwright
download_sec_filing - Download SEC EDGAR filings by CIK, year, and filing type
Set the three MCP Tools that you created to work with Cloud Desktop.

Key Capabilities

SEC EDGAR Integration: Direct download from SEC's official API
Document Processing: Complete pipeline from HTML → PDF → Markdown
Docker Ready: One-command setup with volume mounting
Claude Desktop Compatible: Pre-configured for immediate use
Rate Limiting: Complies with SEC's 10 requests/second limit
Image Extraction: Automatically extracts and references images from documents

Requirements

Docker (recommended)
Claude Desktop for MCP client testing

Start with Docker

1. Build the Docker Image

docker build -t sec-mcp .

2. Configure Claude Desktop

Update your Claude Desktop configuration file.

Replace {AbsolutePath} with your actual project path.

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json

Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "sec-edgar": {
      "command": "docker",
      "args": [
        "run",
        "--rm",
        "-i",
        "--volume",
        "/{AbsolutePath}/app/pdf:/app/pdf",
        "--volume",
        "/{AbsolutePath}/app/html:/app/html",
        "--volume",
        "/{AbsolutePath}/app/markdown:/app/markdown",
        "--volume",
        "/{AbsolutePath}/app/extracted_images:/app/extracted_images",
        "sec-mcp"
      ],
      "env": {}
    }
  }
}

3. Restart Claude Desktop

Close and reopen Claude Desktop to load the MCP server.

4. Example Usage Workflow

Here's a complete example prompt for Claude Desktop:

From downloading SEC EDGAR public files to Markdown conversion, please do the following in order. If the contents are in the folder when creating files, do not create them.

1. First, download the latest public HTML file of CIK 1018724, Year 2024, Form DEF 14A type from SEC EDGAR.

2. Converts downloaded HTML files to PDFs.

3. Convert the converted PDF file to Markdown.
Check the PDF size and wait for the Markdown text return if the size is small.
If the size is large, the Markdown text will not output immediately, and the conversion will take a long time, so wait and see the file in /markdown.

4. All steps run sequentially, and if the file is small, please wait for the Markdown conversion to complete. Avoid additional commands before completing.

API Documentation

1. read_as_markdown

Converts PDF files to Markdown using Docling.

read_as_markdown(
    input_file_path: str,    # PDF file path in pdf/ folder
)

Example:

read_as_markdown("amzn_2024_8k.pdf")

Features:

Extracts images with proper references
Handles large files by saving to file
Returns content directly for small files
Creates structured markdown with tables and formatting

1-2. read_markdown_file

read_markdown_file(
    markdown_filename: str, # Filename in markdown/ folder
    start_char: int = 0,    # Starting character position
    length: int = 50000     # Number of characters to read
)

2. html_to_pdf

Converts HTML/iXBRL files to PDF using Playwright.

html_to_pdf(
    input_file_path: str,  # Relative path in html/ folder
    output_file_path: str  # Output path in pdf/ folder
)

Example:

html_to_pdf("amzn_2024_8k/amzn-20241031.htm", "amzn_2024_8k.pdf")

3. download_sec_filing

Downloads SEC EDGAR filings for a specific company.

download_sec_filing(
    cik: str,           # Company's CIK number (e.g., "1018724" for Amazon)
    year: int,          # Filing year (2021-2025)
    filing_type: str,   # "8-K" | "10-Q" | "10-K" | "DEF 14A"
    output_dir_path: str # Output directory path (e.g., "amzn_2024_8k")
)

Example:

download_sec_filing("1018724", 2024, "8-K", "amzn_2024_8k")

Returns: Path to the main HTML filing (e.g., html/amzn_2024_8k/amzn-20241031.htm)

Project Structure

sec_mcp/
├── main.py                     # MCP server implementation
├── requirements.txt            # Python dependencies
├── Dockerfile                  # Docker configuration
├── claude_desktop_config.json  # Claude Desktop MCP config
├── README.md                   # This file
└── app/                        # Data directories (mounted as volumes)
    ├── pdf/                    # PDF files
    ├── html/                   # HTML/iXBRL files
    ├── markdown/               # Generated Markdown files
    └── extracted_images/       # Extracted images from documents

Testing

Test Files

The project has been tested with these SEC filings:

8-K: Amazon.com Inc. - Form 8-K. 2024-05-14.pdf
10-Q: Amazon.com Inc. - Form 10-Q. For the Fiscal Quarter Ended 2025-03-31.pdf
10-K: Amazon.com Inc. - Form 10-K. For the Fiscal Year Ended 2024-12-31.pdf
DEF 14A: Amazon.com Inc. - Form DEF 14A. Definitive Proxy Statement.pdf

Sample Test Commands

Download a filing:

download_sec_filing("1018724", 2024, "8-K", "test_download")

Convert to PDF:

html_to_pdf("test_download/main_file.htm", "test_output.pdf")

Convert to Markdown:
```
read_as_markdown("test_output.pdf")
```

Configuration

SEC API Compliance

Implements 10 requests/second rate limiting
Uses proper User-Agent headers
Follows SEC EDGAR access guidelines

Troubleshooting

Limitations

Image Reprocessing: When converting HTML to PDF and then to Markdown, images are regenerated rather than reused from the original HTML download. Failed to extract the original file name of the image downloaded to HTML. When PDF→Markdown is converted, the image is recreated and stored in extracted_images/ directory.
Table of Contents Recognition: Some Table of Contents sections are incorrectly recognized as images during PDF processing.

Support

For issues and questions:

Check the troubleshooting section
Review Docker and volume mount configurations
Verify Claude Desktop MCP setup
Test individual MCP tools

Built with FastMCP for seamless AI agent integration

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured