Databricks-cursor-mcp
Enables managing Databricks directly from Cursor IDE with 30 tools for notebooks, clusters, jobs, DBFS, SQL, and workspace operations.
README
Databricks MCP Server for Cursor
30 powerful tools to manage Databricks directly from Cursor IDE with AI assistance.
š Quick Setup
Prerequisites
- Node.js v14+ (
node --version) ā if missing, runbrew install node - Databricks CLI (
databricks --version) ā if missing, runbrew install databricks - Databricks account with OAuth access
Installation
# 1. Clone or download this directory
git clone https://github.com/nayantarasundarraj-hue/Databricks-cursor-mcp.git ~/databricks-mcp-server
cd ~/databricks-mcp-server
# 2. Install dependencies
npm install
# 3. Verify installation
npm test
# 4. Authenticate with Databricks
databricks auth login --profile dev --host https://YOUR-WORKSPACE.cloud.databricks.com
# 5. Verify your paths (note these down for the config below)
which node
which databricks
# 6. Configure Cursor
Edit ~/.cursor/mcp.json:
{
"mcpServers": {
"databricks": {
"command": "node",
"args": [
"/Users/YOUR_USERNAME/databricks-mcp-server/server.js"
],
"env": {
"PATH": "/opt/homebrew/bin:/usr/local/bin:/usr/bin:/bin:/usr/sbin:/sbin",
"DATABRICKS_PROFILE": "dev",
"DATABRICKS_WORKSPACE_HOST": "YOUR-WORKSPACE.cloud.databricks.com"
}
}
}
}
Important:
- Replace
YOUR_USERNAMEwith your macOS username andYOUR-WORKSPACEwith your Databricks workspace hostname. DATABRICKS_PROFILEmust match the profile name you used duringdatabricks auth login. Common values aredev,affirm-prod, orDEFAULT. Check~/.databrickscfgto see your profiles.- If you get
node: command not found, runwhich nodeand replace"node"with the full path (e.g./opt/homebrew/bin/node).
6. Restart Cursor
Quit completely (Command + Q / Alt + F4) and reopen. Wait 10-15 seconds.
ā Verify It Works
In Cursor, ask:
- "List my Databricks clusters"
- "List notebooks in /Users/"
- "Show me available Databricks tools"
š ļø What You Get - 30 Tools
š Browse & Read (5)
list_notebooks- Browse foldersget_notebook- Read contentlist_clusters- Show clusterslist_jobs- Show jobsget_job_details- Job info
ā” Execute & Monitor (5)
run_notebook- Execute with parametersget_run- Check statusget_run_output- Get resultscancel_run- Stop executionlist_runs- View history
āļø Edit Notebooks (3) - with Auto-Backup!
export_notebook- Downloadimport_notebook- Upload with backupget_notebook_revisions- View history
š„ļø Manage Clusters (4)
start_cluster- Wake upstop_cluster- Save costsget_cluster_status- Check statusrestart_cluster- Restart
š Job Operations (2)
run_job- Trigger jobsget_job_runs- View runs
š DBFS Operations (4)
list_dbfs- Browse filesystemread_dbfs_file- Read filesupload_to_dbfs- Uploaddownload_from_dbfs- Download
š SQL Queries (2)
execute_sql- Run querieslist_sql_warehouses- List warehouses
š Workspace Management (3)
create_notebook- Create newdelete_notebook- Removemove_notebook- Organize
š¦ Library Management (2)
list_cluster_libraries- Show installedinstall_library- Add packages
š” Usage Examples
Edit code safely:
"Add import pandas as pd to /Users/me/notebook"
Save money:
"Stop all idle clusters"
Query data:
"Run this SQL on warehouse abc123: SELECT * FROM users LIMIT 10"
Automate workflows:
"Run /Users/me/ETL on cluster X with params date=2024-01-20"
š Safety Features
- ā Path validation ā all local file operations are restricted to your home directory and /tmp; sensitive paths (.ssh, .aws, .env, .kube, etc.) are blocked
- ā
Shell injection prevention ā uses
execFileSync(notexecSync), no shell interpretation - ā
Human-in-the-loop (HIL) ā 8 operations require explicit user confirmation:
run_notebook,run_job,delete_notebook,move_notebook,stop_cluster,restart_cluster,upload_to_dbfs,install_library - ā SQL blocklist ā DDL/DML/Databricks-specific statements (DROP, DELETE, INSERT, COPY INTO, CALL, VACUUM, etc.) require confirmation; read-only SQL runs immediately
- ā Error sanitization ā OAuth tokens, URLs, and internal paths are stripped from error messages
- ā Language validation ā only PYTHON, SQL, SCALA, R accepted for notebook creation
- ā Pinned dependencies ā SDK version locked to prevent supply chain attacks
- ā Automatic local backup on notebook edits (timestamped)
- ā New Databricks revision on import (UI rollback)
ā ļø Agentic Deployment Warning
This MCP exposes both read and write tools in a single server. In an agentic context where the AI processes untrusted content (web pages, Slack messages, RAG documents, emails), a prompt injection could chain read operations (e.g., get_notebook, read_dbfs_file) with write/exec operations (e.g., execute_sql, upload_to_dbfs).
Recommendations:
- Do NOT co-install this MCP in an agent that also has tools reading untrusted external content (web fetch, Slack search, RAG over external docs, mailbox readers) without understanding this risk.
- Always review HIL confirmation prompts carefully ā they show the full operation details including file paths and SQL queries.
- For high-security environments, consider disabling write tools via
disabledToolsin your Cursor MCP config and only enabling the read-only subset.
š Troubleshooting
"node: command not found"
Cursor doesn't load your shell config (.zshrc), so it may not find node. Run which node in your terminal and use that full path as the command value in your mcp.json.
"databricks not found" on tool calls
Same issue ā Cursor can't find the Databricks CLI. Make sure the PATH in your config env includes the directory from which databricks (usually /opt/homebrew/bin or /usr/local/bin).
"npm: command not found"
You need Node.js installed:
brew install node
Then retry npm install.
"Cannot find module @modelcontextprotocol/sdk"
cd ~/databricks-mcp-server
npm install
"Authentication failed" or "Token expired"
databricks auth login --profile YOUR_PROFILE --host https://YOUR-WORKSPACE.cloud.databricks.com
# Then restart Cursor (Cmd+Q, reopen)
"Profile not found"
Make sure DATABRICKS_PROFILE in your mcp.json matches a profile in ~/.databrickscfg. Run cat ~/.databrickscfg to check.
Tools not showing in Cursor
- Check the absolute path in
mcp.jsonis correct - Quit Cursor completely (Cmd+Q) and reopen
- Wait 10-15 seconds
- Check Settings ā MCP for connection status
- Check logs: View ā Output ā "Model Context Protocol"
š Project Structure
databricks-mcp-server/
āāā server.js # Main MCP server (1,380 lines)
āāā package.json # Dependencies
āāā node_modules/ # Installed packages (auto-created)
āāā README.md # This file
š¤ Contributing
To add new tools:
- Add method in
server.js:
async myNewTool(args) {
const result = await this.executeCLI(['my-command', 'subcommand', args.param, '--output', 'json']);
return JSON.parse(result);
}
- Register in
setupHandlers():
case 'my_new_tool':
const result = await this.myNewTool(args);
return { content: [{ type: 'text', text: JSON.stringify(result) }] };
-
Add tool definition in
ListToolsRequestSchemahandler -
Test and commit!
š License
MIT - Feel free to modify and share
š¬ Support
Issues? Check troubleshooting above or ask the team!
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.