Onyx Documentation MCP Server

Onyx Documentation MCP Server

Provides AI systems with access to Onyx programming language documentation through web crawling and intelligent search capabilities.

Category
Visit Server

README

Onyx MCP Server

A Model Context Protocol (MCP) server providing search and query access to Onyx programming language documentation and GitHub code examples. The server includes comprehensive crawling capabilities to populate data, but crawling is NOT accessible through the MCP interface - ensuring clean separation between data collection and query functionality.

🚀 Quick Start

Installation & Setup

# Install dependencies  
npm install

# Create environment configuration
cp .env.example .env
# Edit .env and add your GitHub token (optional but recommended)

# Validate installation
npm run validate

Basic Usage

# Start the MCP server (default)
npm start

# Start the HTTP server for REST API access
npm run http

# Start the MCP-to-HTTP bridge (connects to local or remote HTTP server)
npm run bridge

# Run with development mode
npm run dev        # MCP server
npm run http:dev   # HTTP server

# Run tests
npm test

# Crawl data to populate the MCP (CLI only, not through MCP interface)
npm run crawl:all

🎯 Server Interface

The system provides both MCP query functionality and CLI-based crawling:

# MCP Server operations (query/search only)
node src/index.js server          # Start MCP server  
node src/index.js server --dev    # Development mode
node src/index.js http            # Start HTTP server
node src/index.js http --port 3002 # HTTP server on custom port
node src/index.js bridge          # Start MCP-to-HTTP bridge
node src/index.js bridge --url https://mcp.onyxlang.io # Connect to hosted server

# Data crawling (CLI only - NOT accessible through MCP)
node src/index.js crawl docs                    # Documentation only
node src/index.js crawl github repo1 repo2     # Specific repositories  
node src/index.js crawl url https://...        # Single URL
node src/index.js crawl all                     # Everything

# Utilities
node src/index.js test       # Run test suite
node src/index.js validate  # Validate setup

📁 Project Structure

onyx_mcp/
├── src/
│   ├── bridge.js          # 🌉 MCP-to-HTTP bridge for remote access
│   ├── index.js           # 🎯 Unified entry point
│   ├── mcp-server.js      # 🌐 MCP server implementation
│   ├── mcp-http.js        # 🌐 MCP over HTTP server implementation 
│   ├── test.js            # 🧪 Test suite
│   ├── validate.js        # ✅ Setup validation
│   ├── crawlers/          # 📡 Data crawlers
│   │   ├── docs.js        #   - Documentation crawler
│   │   ├── github.js      #   - GitHub repository crawler  
│   │   └── urls.js        #   - URL content crawler
│   └── core/              # 🔧 Core functionality
│       └── search-engine.js #   - Search and indexing
├── data/                  # 📊 Crawled data (auto-generated)
├── .env.example          # 🔐 Environment template
└── package.json          # 📦 Dependencies & scripts

🛠️ MCP Tools Available

The server provides these read-only search and query tools to Claude:

📚 Documentation

  • search_onyx_docs - Search official documentation
  • browse_onyx_sections - Browse by section
  • get_onyx_function_docs - Function documentation

🐙 GitHub Integration

  • search_github_examples - Search code by topic
  • get_onyx_functions - Function definitions from GitHub
  • get_onyx_structs - Struct definitions from GitHub
  • list_github_repos - List available repositories

🔍 Unified Search

  • search_all_sources - Search across all data sources
  • get_onyx_examples - Legacy compatibility for examples

⚠️ Important Note

Crawling tools are available through the CLI but intentionally NOT accessible through the MCP interface. This ensures clean separation between data collection and query functionality.

🔧 Configuration

Environment Variables (.env)

# GitHub token (recommended for higher rate limits)
GITHUB_TOKEN=your_github_token_here

# Optional settings
DEBUG=false
MAX_CRAWL_LIMIT=50

🌐 Claude Desktop Integration

You can connect to the Onyx MCP in two ways:

Option 1: Local MCP Server (Recommended for Development)

{
 "mcpServers": {
   "onyx_mcp": {
     "command": "node",
     "args": ["/path/to/onyx_mcp/src/index.js", "server"]
   }
 }
}

Option 2: Connect to Hosted Server via Bridge

{
 "mcpServers": {
   "onyx_mcp": {
     "command": "node",
     "args": ["/path/to/onyx_mcp/src/index.js", "bridge", "--url", "https://mcp.onyxlang.io"],
   }
 }
}

Option 3: Local HTTP Server + Bridge

For testing the bridge locally:

  1. Start the HTTP server:
    npm run http --port 3002
    
  2. Configure Claude Desktop to use the bridge:
    {
      "mcpServers": {
        "onyx_mcp": {
          "command": "node",
          "args": ["/path/to/onyx_mcp/src/index.js", "bridge", "--url", "http://localhost:3002"]
        }
      }
    }
    

🚀 Quick Start Guide

For Development (Local Setup)

  1. Clone and setup:

    git clone <repository>
    cd onyx_mcp
    npm install
    cp .env.example .env
    
  2. Populate data:

    npm run crawl:all
    
  3. Start MCP server:

    npm start
    
  4. Configure Claude Desktop with local server (see integration section above)

For Production (Hosted Server)

  1. Clone and setup:

    git clone <repository>
    cd onyx_mcp
    npm install
    
  2. Start bridge to hosted server:

    npm run bridge -- --url https://mcp.onyxlang.io
    
  3. Configure Claude Desktop with bridge (see integration section above)

Bridge Architecture

The bridge allows you to connect the MCP protocol to HTTP servers:

Claude Desktop → MCP Bridge → HTTP Server (Local or Remote)

Benefits:

  • ✅ Connect to hosted Onyx MCP at mcp.onyxlang.io
  • ✅ No need to run local server or populate data
  • ✅ Always up-to-date with latest Onyx information
  • ✅ Same MCP interface, different backend
  • ✅ Easy switching between local and remote servers

📊 Data Sources & Crawling

The system includes comprehensive crawling capabilities to populate data:

📚 Documentation Sources

  • Official Onyx documentation
  • Tutorial and guide files
  • API documentation
  • Language reference materials

🐙 GitHub Sources

  • Onyx language repositories
  • Code examples and tutorials
  • Package and library documentation
  • Configuration files and project setups

📁 Supported File Types

  • .onyx source files
  • .kdl configuration files
  • README, documentation, and guide files
  • HTML documentation pages
  • Package configurations (onyx.pkg, etc.)

🔄 Data Population Process

  1. Use CLI crawling commands to populate the data/ directory
  2. MCP server searches the pre-crawled data
  3. No crawling triggers are available through the MCP interface

📡 Enhanced GitHub Crawling

The GitHub crawler extracts comprehensive content:

📚 Documentation:

  • README.md, LICENSE, CHANGELOG.md
  • All documentation in docs/ folders
  • HTML documentation and web pages
  • Tutorial and guide files

🔧 Configuration:

  • .kdl files (Onyx project management)
  • onyx.pkg and package configurations
  • TOML, YAML, JSON configs

💻 Source Code:

  • All .onyx source files
  • Example and tutorial files
  • HTML examples and web interfaces

🌐 Web Content:

  • HTML documentation pages
  • Interactive examples and demos
  • Web-based tutorials and guides
  • API documentation in HTML format

Repository Management

# Crawl specific repositories
node src/index.js crawl github onyx-lang/onyx user/project

# With various URL formats
node src/index.js crawl github \
  https://github.com/onyx-lang/onyx \
  github.com/user/repo \
  owner/project

🧪 Testing & Validation

# Quick validation
npm run validate

# Full test suite  
npm test

# Expected results: 100% pass rate

Tests validate:

  • ✅ File structure integrity
  • ✅ Module import functionality
  • ✅ Data directory operations
  • ✅ Crawler configurations
  • ✅ Search engine error handling

💡 Usage Examples

Once connected to Claude Desktop:

"Show me examples of HTTP requests in Onyx"
"How do I define a struct with KDL configuration?"
"What are the available string manipulation functions?"
"Find PostgreSQL ORM examples in Onyx repositories"

🔧 Configurable Context System

Global Context Message

All MCP tool responses include a configurable context message that can be easily modified at the top of src/mcp-server.js:

// =============================================================================
// CONFIGURABLE CONTEXT MESSAGE
// =============================================================================
// This message will be prepended to all MCP tool responses.
// Modify this section to customize the context provided to the assistant.
const GLOBAL_CONTEXT_MESSAGE = `You are assisting with Onyx programming language queries...`;

This allows you to:

  • Customize the assistant's context for Onyx queries
  • Provide consistent guidance across all tool responses
  • Easily update instructions without modifying individual tools
  • Maintain context coherence throughout conversations

🚀 Key Design Principles

Security & Separation of Concerns

  • MCP interface is read-only - cannot trigger crawling or data modification
  • Crawling available through CLI - full control over data collection
  • Clean architecture - data collection separate from query functionality
  • No external API calls through MCP tools

Enhanced User Experience

  • Consistent context across all responses
  • Tool-specific messaging for clarity
  • Comprehensive error handling with context
  • Legacy compatibility for existing workflows

🔍 Data Flow

  1. CLI Crawling Commands populate data sources in data/ directory
  2. Search Engine indexes and provides unified search capabilities
  3. MCP Server exposes read-only search tools to Claude
  4. Claude receives contextual responses with configurable messaging
  5. Context System ensures consistent, helpful guidance in all responses
  6. No crawling triggers available through MCP interface

📈 Performance

  • Efficient caching prevents unnecessary re-crawling
  • Rate limiting respects API limits
  • Parallel processing for multiple repositories
  • Comprehensive error handling for reliability

This MCP server provides Claude with secure, read-only access to Onyx programming language knowledge through a configurable context system. Comprehensive crawling capabilities are available through CLI commands but intentionally not accessible through the MCP interface, ensuring clean separation between data collection and query functionality.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured