databricks-sql-mcp

databricks-sql-mcp

Enables AI assistants to execute SQL queries and explore databases, tables, and catalogs on Databricks using Unity Catalog.

Category
Visit Server

README

Databricks SQL MCP Server

License: MIT Docker Python 3.11+

A Model Context Protocol (MCP) server that lets AI assistants like Claude execute SQL queries on Databricks. Claude launches the server as a Docker container and communicates over stdin/stdout using the MCP stdio transport.

Features

Core Tools:

  • execute_sql - Run any SQL query against a Databricks SQL Warehouse
  • list_databases - List databases/schemas in the default catalog
  • list_tables - List tables in a specific database
  • describe_table - Show column names, types, and comments for a table

Unity Catalog Tools:

  • list_catalogs - List all catalogs in the Unity Catalog metastore
  • list_schemas - List schemas within a specific catalog
  • list_tables_full - List tables using full 3-part naming (catalog.schema.table)
  • describe_table_full - Describe a table using its full catalog path

Unity Catalog Hierarchy

CATALOG (e.g., analytics)
  └── SCHEMA/DATABASE (e.g., silver)
      └── TABLE (e.g., customers)

Fully qualified name: catalog.schema.table (e.g., analytics.silver.customers)

Prerequisites

  • Docker installed and running
  • Databricks workspace access with a SQL Warehouse
  • Databricks personal access token
  • Claude Desktop or Claude Code (CLI)

Getting Your Databricks Credentials

1. Databricks Host URL

Your workspace URL. The format depends on your cloud provider:

Cloud Example URL
Azure https://adb-1234567890.azuredatabricks.net
AWS https://my-workspace.cloud.databricks.com
GCP https://my-workspace.gcp.databricks.com

2. Personal Access Token

  1. Open your Databricks workspace
  2. Click your username (top-right)
  3. Go to User Settings > Developer > Access tokens
  4. Click Generate new token
  5. Copy the token (starts with dapi...)

3. SQL Warehouse ID

  1. Go to SQL Warehouses in Databricks
  2. Click on a warehouse
  3. Copy the warehouse ID from the URL: /sql/warehouses/<this-id>

Quick Install (Recommended)

The install scripts pull the Docker image, prompt for your Databricks credentials, and register the MCP server with Claude using the Claude Code CLI.

Mac / Linux

curl -fsSL https://raw.githubusercontent.com/benguy1000/databricks-sql-mcp/master/install.sh | bash

Or clone the repo first:

git clone https://github.com/benguy1000/databricks-sql-mcp.git
cd databricks-sql-mcp
chmod +x install.sh
./install.sh

Windows (PowerShell)

git clone https://github.com/benguy1000/databricks-sql-mcp.git
cd databricks-sql-mcp
.\install.bat

After running the installer, restart Claude Desktop (or start a new Claude Code session) and you're ready to go.


Manual Installation

If you prefer to set things up manually, or if the quick install doesn't work for your setup, follow the steps below.

Running with Docker

Pull the Image from Docker Hub

docker pull bkeeleygib/databricks-sql-mcp:latest

Or build it yourself:

docker build -t bkeeleygib/databricks-sql-mcp .

Run with Environment Variables

The -i flag is required -- MCP uses stdin/stdout for communication between Claude and the server.

docker run -i --rm \
  -e DATABRICKS_HOST="https://your-workspace.azuredatabricks.net" \
  -e DATABRICKS_TOKEN="dapi1234567890abcdef" \
  -e DATABRICKS_WAREHOUSE_ID="abc123def456" \
  bkeeleygib/databricks-sql-mcp:latest

Run with .env File

Create a .env file with your credentials (see .env.example):

DATABRICKS_HOST=https://your-workspace.azuredatabricks.net
DATABRICKS_TOKEN=dapi1234567890abcdef
DATABRICKS_WAREHOUSE_ID=abc123def456

Then run:

docker run -i --rm --env-file .env bkeeleygib/databricks-sql-mcp:latest

Configuring Claude Desktop

Add the server to your Claude Desktop config file:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "databricks-sql": {
      "command": "docker",
      "args": [
        "run",
        "-i",
        "--rm",
        "-e", "DATABRICKS_HOST=https://your-workspace.azuredatabricks.net",
        "-e", "DATABRICKS_TOKEN=dapi1234567890abcdef",
        "-e", "DATABRICKS_WAREHOUSE_ID=abc123def456",
        "bkeeleygib/databricks-sql-mcp:latest"
      ]
    }
  }
}

Important: Replace the placeholder values with your actual credentials, then restart Claude Desktop.

Configuring Claude Code (CLI)

Use the Claude Code CLI to register the server directly:

claude mcp add-json databricks-sql '{
  "command": "docker",
  "args": [
    "run", "-i", "--rm",
    "-e", "DATABRICKS_HOST=https://your-workspace.azuredatabricks.net",
    "-e", "DATABRICKS_TOKEN=dapi1234567890abcdef",
    "-e", "DATABRICKS_WAREHOUSE_ID=abc123def456",
    "bkeeleygib/databricks-sql-mcp:latest"
  ]
}'

Example Queries

Once connected, you can ask Claude:

Browsing:

  • "List all catalogs in my Unity Catalog"
  • "Show me the schemas in the analytics catalog"
  • "What tables are in analytics.silver?"
  • "Describe the schema of analytics.silver.customers"

Querying:

  • "Run: SELECT * FROM analytics.silver.customers LIMIT 10"
  • "Show me the top 5 most expensive items"
  • "What's the total revenue by category?"

Security Notes

Never commit your .env file or credentials to Git!

  • The .env file is listed in .gitignore
  • Each user needs their own Databricks personal access token
  • The Docker image does not contain any credentials
  • Credentials are passed at runtime via environment variables

Development

Local Development (without Docker)

Requires Python 3.11+.

# Install dependencies
pip install -r requirements.txt

# Create .env file with your credentials
cp .env.example .env
# Edit .env with your values

# Run the server
python server.py

Project Structure

server.py          # MCP server implementation (FastMCP + Databricks SDK)
requirements.txt   # Python dependencies (pinned to major versions)
Dockerfile         # Docker image definition
install.sh         # Quick installer for Mac/Linux
install.bat        # Quick installer for Windows
.env.example       # Template for environment variables

Troubleshooting

"Error: DATABRICKS_WAREHOUSE_ID not set"

  • Make sure you passed all three environment variables
  • Check that your .env file has values for DATABRICKS_HOST, DATABRICKS_TOKEN, and DATABRICKS_WAREHOUSE_ID

"Query failed: ..."

  • Verify your credentials are correct
  • Check that the SQL Warehouse is running (it may be auto-suspended)
  • Ensure you have permissions to access the requested data

"Server disconnected"

  • Restart Claude Desktop or start a new Claude Code session
  • Verify Docker is running: docker info
  • Check that the container starts successfully: docker run -i --rm --env-file .env bkeeleygib/databricks-sql-mcp:latest

"Tables show as false"

  • This bug has been fixed in the latest version
  • Pull the latest image: docker pull bkeeleygib/databricks-sql-mcp:latest

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured