Orion Vision MCP Server
Enables AI systems to analyze documents and extract form data through Azure Form Recognizer/Document Intelligence, supporting various document types including receipts, invoices, and ID documents.
README
Orion Vision MCP Server 🚀
🔌 Compatible with Cline, Cursor, Claude Desktop, and any other MCP Clients!
Orion Vision MCP is also compatible with any MCP client
The Model Context Protocol (MCP) is an open standard that enables AI systems to interact seamlessly with various data sources and tools, facilitating secure, two-way connections.
The Orion Vision MCP server provides:
- Seamless integration with Azure Form Recognizer / Document Intelligence
- Document analysis and form data extraction capabilities
- Support for various document types (receipts, invoices, ID documents, etc.)
- Type-safe operations with TypeScript
Prerequisites 🔧
Before you begin, ensure you have:
- Azure Form Recognizer / Document Intelligence endpoint and key
- Claude Desktop or Cursor
- Node.js (v20 or higher)
- Git installed (only needed if using Git installation method)
Orion Vision MCP server installation ⚡
Running with NPX
npx -y orion-vision-mcp@latest
Installing via Smithery
To install Orion Vision MCP Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @orion-vision/mcp --client claude
Configuring MCP Clients ⚙️
Configuring Cline 🤖
The easiest way to set up the Orion Vision MCP server in Cline is through the marketplace with a single click:
- Open Cline in VS Code
- Click on the Cline icon in the sidebar
- Navigate to the "MCP Servers" tab (4 squares)
- Search "Orion Vision" and click "install"
- When prompted, enter your Azure Form Recognizer credentials
Alternatively, you can manually set up the Orion Vision MCP server in Cline:
- Open the Cline MCP settings file:
# For macOS:
code ~/Library/Application\ Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json
# For Windows:
code %APPDATA%\Code\User\globalStorage\saoudrizwan.claude-dev\settings\cline_mcp_settings.json
- Add the Orion Vision server configuration to the file:
{
"mcpServers": {
"orion-vision-mcp": {
"command": "npx",
"args": ["-y", "orion-vision-mcp@latest"],
"env": {
"AZURE_FORM_RECOGNIZER_ENDPOINT": "your-endpoint-here",
"AZURE_FORM_RECOGNIZER_KEY": "your-key-here"
},
"disabled": false,
"autoApprove": []
}
}
}
- Save the file and restart Cline if it's already running.
Configuring Cursor 🖥️
Note: Requires Cursor version 0.45.6 or higher
To set up the Orion Vision MCP server in Cursor:
- Open Cursor Settings
- Navigate to Features > MCP Servers
- Click on the "+ Add New MCP Server" button
- Fill out the following information:
- Name: Enter a nickname for the server (e.g., "orion-vision-mcp")
- Type: Select "command" as the type
- Command: Enter the command to run the server:
env AZURE_FORM_RECOGNIZER_ENDPOINT=your-endpoint AZURE_FORM_RECOGNIZER_KEY=your-key npx -y orion-vision-mcp@latestImportant: Replace
your-endpointandyour-keywith your Azure Form Recognizer credentials
Configuring the Claude Desktop app 🖥️
For macOS:
# Create the config file if it doesn't exist
touch "$HOME/Library/Application Support/Claude/claude_desktop_config.json"
# Opens the config file in TextEdit
open -e "$HOME/Library/Application Support/Claude/claude_desktop_config.json"
For Windows:
code %APPDATA%\Claude\claude_desktop_config.json
Add the Orion Vision server configuration:
{
"mcpServers": {
"orion-vision-mcp": {
"command": "npx",
"args": ["-y", "orion-vision-mcp@latest"],
"env": {
"AZURE_FORM_RECOGNIZER_ENDPOINT": "your-endpoint-here",
"AZURE_FORM_RECOGNIZER_KEY": "your-key-here"
}
}
}
}
Usage in Claude Desktop App 🎯
Once the installation is complete, and the Claude desktop app is configured, you must completely close and re-open the Claude desktop app to see the orion-vision-mcp server. You should see a hammer icon in the bottom left of the app, indicating available MCP tools.
Orion Vision Examples
- Analyze a Document:
Analyze the document at "https://example.com/document.pdf" using Azure Form Recognizer.
- Extract Form Data:
Extract data from the invoice at "https://example.com/invoice.pdf".
- Process ID Document:
Process the ID document at "https://example.com/id.pdf" and extract relevant information.
Troubleshooting 🛠️
Common Issues
-
Server Not Found
- Verify the npm installation by running
npm --version - Check Claude Desktop configuration syntax
- Ensure Node.js is properly installed by running
node --version
- Verify the npm installation by running
-
Azure Form Recognizer Credentials Issues
- Confirm your Azure Form Recognizer endpoint and key are valid
- Check the credentials are correctly set in the config
- Verify no spaces or quotes around the credentials
-
Document Processing Issues
- Verify the document URL is accessible
- Check the document format is supported
- Ensure the document is not corrupted or password-protected
Acknowledgments ✨
- Model Context Protocol for the MCP specification
- Anthropic for Claude Desktop
- Microsoft Azure for Form Recognizer / Document Intelligence
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.