roshan-alefba-mcp

roshan-alefba-mcp

A self-hostable MCP server that exposes Roshan AI's Alefba OCR service as tools, enabling document reading, page extraction, status polling, and export to PDF/Word/Excel for Persian, Arabic, and English text.

Category
Visit Server

README

<div align="center">

<img src="assets/banner.svg" alt="roshan-alefba-mcp" width="100%"/>

<br/><br/>

<img src="assets/icons/alefba.svg" height="200" alt="Alefba icon"/>

roshan-alefba-mcp

A self-hostable Model Context Protocol server for Roshan AI's OCR service Alefba (الفبا).

<sub>Unofficial, community-built. Wraps the public API documented at <a href="https://docs.roshan-ai.ir">docs.roshan-ai.ir</a>.</sub>

</div>


What is this?

Alefba (الفبا) is Roshan AI's high-accuracy OCR / document-understanding service for Persian (fa), Arabic (ar) and English (en). Give it an image or PDF and it returns the text split into pages, paragraphs and lines — each with its bounding box, direction and recognition confidence — and it can tell tables, images and text apart. It also exports the analyzed document as a searchable PDF, Word or Excel file.

This server exposes Alefba to any MCP client (Claude or otherwise) as a set of first-class tools. Alefba is commonly self-hosted, and organizations often run many independent instances (per data-center, tenant, or environment), so named instances are a core concept here: every tool takes an optional instance argument selecting which deployment to talk to.

Features

  • All Alefba endpoints as tools, prefixed alefba_ (read, pages, status, download to PDF/Word/Excel, delete, callback).
  • URL input and local-file upload (multipart).
  • Async OCR: queue with wait=false, then poll with alefba_get_result.
  • Many named, self-hosted instances; pick per call via instance.
  • Guardrails: http(s) URL validation, enum/number clamping, list-size limits, and token redaction — tokens are never logged or returned.
  • Transports: stdio (default), sse, streamable-http.

Install

Requires Python 3.10+.

git clone <this-repo>
cd roshan-alefba-mcp
python -m venv .venv && . .venv/bin/activate
pip install -e ".[dev]"      # runtime + test deps (omit [dev] for runtime only)

Run it:

python -m roshan_alefba_mcp --help
python -m roshan_alefba_mcp --transport stdio          # default
python -m roshan_alefba_mcp --transport streamable-http

Configuration

Configuration is read from environment variables. The simplest setup uses the shorthand form, which synthesizes a single instance named default:

Variable Description Default
ROSHAN_ALEFBA_BASE_URL Base URL of the default Alefba instance https://alefba.roshan-ai.ir
ROSHAN_ALEFBA_TOKEN API token for the default instance (sent as Authorization: Token <token>) (none)

For multiple instances, use the nested form (one base_url + token per instance):

Variable Description
ROSHAN_ALEFBA__INSTANCES__<NAME>__BASE_URL Base URL for instance <NAME>
ROSHAN_ALEFBA__INSTANCES__<NAME>__TOKEN Token for instance <NAME>
ROSHAN_ALEFBA__INSTANCES__<NAME>__VERIFY_SSL Verify TLS for <NAME> (default true)
ROSHAN_ALEFBA__INSTANCES__<NAME>__TIMEOUT Request timeout in seconds (default 60)
ROSHAN_ALEFBA__DEFAULT_INSTANCE Instance used when a call omits instance (default default)
ROSHAN_ALEFBA__LOG_LEVEL Log level (default INFO)

Example (two instances, default = dc1):

export ROSHAN_ALEFBA__INSTANCES__DC1__BASE_URL="https://alefba-dc1.example.ir"
export ROSHAN_ALEFBA__INSTANCES__DC1__TOKEN="token-1"
export ROSHAN_ALEFBA__INSTANCES__DC2__BASE_URL="https://alefba-dc2.example.ir"
export ROSHAN_ALEFBA__INSTANCES__DC2__TOKEN="token-2"
export ROSHAN_ALEFBA__DEFAULT_INSTANCE="dc1"

Call list_instances at any time to see configured instance names and base URLs (never tokens).

Use with an MCP client

Add the server to your client config (example for stdio):

{
  "mcpServers": {
    "roshan-alefba": {
      "command": "python",
      "args": ["-m", "roshan_alefba_mcp", "--transport", "stdio"],
      "env": {
        "ROSHAN_ALEFBA_BASE_URL": "https://alefba.roshan-ai.ir",
        "ROSHAN_ALEFBA_TOKEN": "your-token"
      }
    }
  }
}

Tool reference

All tools accept an optional instance (except list_instances and roshan_alefba_docs). OCR tools also share type (general | ID-card | excel), fix_orientation, word_positions, wait, and priority (1–4).

Tool Endpoint Purpose
alefba_read_document POST /api/read_document/ Read (OCR) a document/image from a URL; sync (wait=true) or async (task_ids).
alefba_read_document_upload POST /api/read_document/ (multipart) Upload a local file_path and read it.
alefba_get_result POST /api/read_document/ Fetch/poll an async result by task_id.
alefba_read_pages POST /api/read_pages/ Read specific pages given as URLs with @page=N.
alefba_document_status POST /api/document_status/ Per-document progress (analyzed, processed_pages, all_pages).
alefba_document_pages POST /api/document_pages/ List a document's page URLs.
alefba_download_word POST /api/download_word/ Download as Word (.docx); optional save_path.
alefba_download_excel POST /api/download_excel/ Download as Excel (.xlsx); requires type=excel; optional save_path.
alefba_download_pdf POST /api/download_pdf/ Download as searchable PDF (quality 0–100, color); optional save_path.
alefba_delete_document POST /api/delete_document/ Delete a document and its results.
alefba_read_document_callback POST /api/read_document/ Process and receive the result via a webhook callback_url.
healthcheck GET /api/healthcheck/ Check an instance is up and ready.
list_instances (local) List configured instance names + base URLs (no tokens).
roshan_alefba_docs (local) Documentation about Alefba and these tools.

Download tools return the download URL and request payload by default; pass save_path to download the bytes and save them locally (the saved path is returned). box values in OCR results are "left top width height" in pixels.

Architecture

Architecture

The MCP client calls tools registered by build_server(); the ocr, common and docs tool modules validate input (guardrails.py), resolve the target deployment (config.py) and talk to Alefba over an authenticated async HTTP client (client.py).

Self-hosting & scaling

One process can route to many self-hosted Alefba deployments, selected per call by instance:

Self-hosting multi-instance

The server is stateless, so scale it horizontally (more replicas behind a load balancer, or enable the Helm/Kubernetes HPA). Each replica reads the same instance configuration.

Request flow (async OCR + PDF export)

Request flow

  1. alefba_read_document(wait=false) queues the job; Alefba returns {state, task_ids}.
  2. Poll alefba_get_result(task_id) until the full {document_url, pages[...]} result is ready.
  3. alefba_download_pdf(document_url, save_path) exports a searchable PDF.

The diagrams above are generated with the diagrams library. Regenerate them with python assets/diagrams/generate_diagrams.py (requires pip install diagrams and the Graphviz dot binary).

Deployment

Manifests and modules live in deploy/:

See deploy/README.md for details.

Testing

make test        # pytest, HTTP mocked with respx (live tests skipped)
make smoke       # offline: build server, list tools, assert invariants
python examples/inspect_server.py

Live tests against a real Alefba instance are skipped unless ROSHAN_ALEFBA_LIVE=1 and credentials are set.

License

MIT. "Roshan", the Roshan logo, and "Alefba" are trademarks of their respective owner and are used only to identify the upstream service this tool integrates with.

<div align="center"> <img src="assets/icons/roshan.svg" height="64" alt="Alefba icon"/> </div>

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured