Cloudera Data Visualization MCP Server
Enables AI agents to manage Cloudera Data Visualization resources including groups, users, roles, segments, workspaces, datasets, visuals, connections, and migrations via natural language.
README
Cloudera Data Visualization MCP Server
A Model Context Protocol server that exposes the full Cloudera Data Visualization (CDV) REST API to AI agents. This lets LLMs list, create, update, and delete CDV resources—groups, users, roles, segments, filter associations, workspaces, datasets, visuals, and data connections—as well as run jobs, query the data API, import/export migrations, and perform operational debugging tasks.
Tools
Groups
| Tool | Description |
|---|---|
list_groups |
List all groups |
get_group(object_id) |
Get a group by ID |
create_group(body) |
Create a new group |
update_group(object_id, body) |
Update a group by ID |
delete_group(object_id) |
Delete a group by ID |
Users
| Tool | Description |
|---|---|
list_users |
List all users |
get_user(object_id) |
Get a user by ID |
update_user(object_id, body) |
Update a user by ID |
edit_user_profile(username, body) |
Edit a user's profile by username |
Roles
| Tool | Description |
|---|---|
list_roles |
List all roles |
get_role(object_id) |
Get a role by ID |
create_role(body) |
Create a new role |
update_role(object_id, body) |
Update a role by ID |
delete_role(object_id) |
Delete a role by ID |
Segments
| Tool | Description |
|---|---|
list_segments |
List all segments |
get_segment(object_id) |
Get a segment by ID |
create_segment(body) |
Create a new segment |
update_segment(object_id, body) |
Update a segment by ID |
delete_segment(object_id) |
Delete a segment by ID |
Filter Associations
| Tool | Description |
|---|---|
list_filter_associations |
List all filter associations |
get_filter_association(object_id) |
Get a filter association by ID |
create_filter_association(body) |
Create a new filter association |
update_filter_association(object_id, body) |
Update a filter association by ID |
delete_filter_association(object_id) |
Delete a filter association by ID |
Workspaces
| Tool | Description |
|---|---|
list_workspaces |
List all workspaces |
get_workspace(object_id) |
Get a workspace by ID |
create_workspace(body) |
Create a new workspace |
update_workspace(object_id, body) |
Update a workspace by ID |
delete_workspace(object_id) |
Delete a workspace by ID |
Datasets
| Tool | Description |
|---|---|
list_datasets |
List all datasets |
get_dataset(object_id) |
Get a dataset by ID |
create_dataset(body) |
Create a new dataset |
update_dataset(object_id, body) |
Update a dataset by ID |
delete_dataset(object_id) |
Delete a dataset by ID |
Visuals
| Tool | Description |
|---|---|
list_visuals |
List all visuals (charts/dashboards) |
get_visual(object_id) |
Get a visual by ID |
create_visual(body) |
Create a new visual (raw API body) |
update_visual(object_id, body) |
Update a visual by ID |
delete_visual(object_id) |
Delete a visual by ID |
create_smart_visual(dataset_id, visual_type, title, columns, workspace_id?) |
Create a chart visual — only exposes patterns confirmed to work via the API |
Supported visual_type values for create_smart_visual:
| Type | Use case | Notes |
|---|---|---|
trellis-bars |
Bar chart — one measure vs. one dimension | Measure on x, dimension on y |
trellis-groupedbars |
Grouped bars — one SUM measure, split by color dimension | Use "shelf": "color_shelf" for grouping |
pie |
Pie chart — SUM measure by dimension | Dimension auto-placed on color_shelf |
All other CDV chart types require configuration in CDV's interactive builder and are blocked here to prevent broken visuals.
What is blocked and why:
| Pattern | Error returned | Alternative |
|---|---|---|
count aggregate |
CDV generates count([col]) that Impala rejects |
Use sum on a numeric column |
trellis-lines, scatter, etc. |
Unsupported visual type | Use CDV's builder; use query_dataapi for the data |
Columns starting with avg_, sum_, etc. as measure targets |
CDV tokenizer bug produces avg([avg_col]) |
Choose a different column |
| Date/timestamp columns as dimensions | CDV bracket conversion fails for TIMESTAMP | Use CDV's builder for time-series |
filters parameter |
WHERE ([col]) bracket SQL Impala rejects |
Use query_dataapi for filtered queries; add filters in CDV's builder |
| Deleting dashboards | Cascade-deletes ALL linked chart visuals | Record visual IDs first; use delete_visual carefully |
Connections
| Tool | Description |
|---|---|
list_connections |
List all data connections |
get_connection(object_id) |
Get a connection by ID |
create_connection(body) |
Create a new connection |
update_connection(object_id, body) |
Update a connection by ID |
delete_connection(object_id) |
Delete a connection by ID |
export_connection(object_id) |
Export a connection definition by ID |
Migrations
| Tool | Description |
|---|---|
export_migration |
Export all CDV artifacts as a migration bundle |
import_migration(body) |
Import a CDV migration bundle |
Data API
| Tool | Description |
|---|---|
query_dataapi_get(params) |
Query /apps/dataapi via GET with optional parameters |
query_dataapi_post(body) |
Query /apps/dataapi via POST with a JSON body |
query_data_get(params) |
Query /api/data via GET with optional parameters |
query_data_post(body) |
Query /api/data via POST with a JSON body |
Jobs
| Tool | Description |
|---|---|
run_job(body) |
Trigger a job run |
run_extract(body) |
Run an existing extract job |
create_extract(body) |
Create a new extract job definition |
Debugging / Operations
| Tool | Description |
|---|---|
get_gc_monitor |
Retrieve GC monitor statistics |
post_gc_monitor(body) |
Trigger a GC monitor action |
get_gc_stats |
Retrieve GC statistics |
post_gc_stats(body) |
Trigger a GC stats action |
get_log_levels |
Get log levels for all loggers |
set_log_level(body) |
Set the default log level |
get_logger_level(logger_name) |
Get log level for a specific logger |
set_logger_level(logger_name, body) |
Set log level for a specific logger |
get_toggle_cprofile |
Get current cProfile tracing state |
toggle_cprofile(body) |
Toggle cProfile tracing on/off |
reset_dataconnection_cache(connection_id) |
Reset the cache for a data connection |
reset_dataset_cache(dataset_id) |
Reset the cache for a dataset |
Examples
Cloudera Agent Studio — AI Assisted Supply Chain
The screenshots below show the CDV MCP Server being used inside Cloudera Agent Studio as part of an AI-powered supply chain workflow. The user asked a natural language question and the agent automatically:
- Discovered the available connections and datasets
- Explored the
procurement_transactionstable schema - Planned the right visualization type and column mapping
- Called
create_smart_visualto build a stacked bar chart in CDV - Called
create_dashboardto make it visible in the Logistics MCP Demo workspace
Prompt: "Which suppliers are driving the most urgent spend? Build a stacked bar chart in Data viz."
Step 1 — Agent plans the visualization
The agent reasons through the dataset, identifies supplier_name as the dimension, priority_code as the color grouping, and total_price (sum) as the measure, then maps these to a trellis-groupedbars visual type.

Step 2 — Visual created and linked in CDV
The agent confirms the visualization details, creates the chart via create_smart_visual, wraps it in a dashboard with create_dashboard, and returns a direct link to the new artifact in the Logistics MCP Demo workspace.

Agent Workflow Guidance
CDV Data Hierarchy
CDV organizes data in three layers, each building on the one below:
Connection (links to an external database, e.g. Impala, Hive, Spark SQL)
└── Dataset (points to a specific table or query within that connection)
└── Visual / Dashboard (chart or dashboard built on a dataset)
Agents must respect this hierarchy when exploring or creating resources.
Discovery — Always Check Before Creating
Before creating anything, always discover what already exists at each level.
Step 1 — Connections
Call list_connections to see what data sources are registered in CDV.
Present the list to the user and identify which connection holds their data.
Only offer create_connection if no suitable connection exists and the user explicitly asks for one.
Step 2 — Datasets
Call list_datasets to see what datasets already exist (these are built on top of connections).
Present the results and ask the user which dataset to use.
Only offer create_dataset if no suitable dataset exists on the right connection and the user explicitly confirms they want a new one. Note that create_dataset requires a dc_id — the connection ID from Step 1.
Step 3 — Visuals / Dashboards
Call list_visuals to see what charts/dashboards already exist before building new ones.
Only call create_smart_visual or create_visual once the dataset has been confirmed.
Never assume anything needs to be created. CDV instances typically already have connections, datasets, and even visuals the user can reuse.
Correct flow:
User: "Build me a bar chart of shipment costs"
Agent: list_connections() ← discover data sources
Agent: "I found these connections: ..."
Agent: list_datasets() ← discover existing datasets
Agent: "Which dataset should I use?"
User: "Use supplier_shipping_performance"
Agent: list_workspaces() ← identify target workspace
Agent: create_smart_visual(...) ← now build the visual
Incorrect flow (avoid):
User: "Build me a bar chart of shipment costs"
Agent: create_connection(...) ← ❌ never skip discovery
Agent: create_dataset(...) ← ❌ never create without confirming first
Agent: create_smart_visual(...)
Workspace Discovery — Same Rule Applies
Always call list_workspaces before creating a visual or dashboard. Identify the correct workspace ID from the list; only call create_workspace if the user explicitly requests a new one.
Visual Creation Workflow
When create_smart_visual is used:
- Call
list_connections→ understand available data sources. - Call
list_datasets→ confirm dataset with user (note whichdc_idit belongs to). - Call
list_workspaces→ identify the target workspace ID. - Call
create_smart_visualwithdataset_id,workspace_id, and column specs.- If the Smart Visual API returns 404 (not available on all CDV instances), the tool automatically falls back to the standard admin API using the provided
workspace_id.
- If the Smart Visual API returns 404 (not available on all CDV instances), the tool automatically falls back to the standard admin API using the provided
Environment Variables
| Variable | Required | Description |
|---|---|---|
CDV_BASE_URL |
Yes | Base URL of your CDV deployment (e.g. https://my-cdv.example.com) |
CDV_API_KEY |
Yes | CDV API key — sent as Authorization: apikey <key> |
MCP_TRANSPORT |
No | Transport protocol: stdio (default), http, or sse |
Usage with Claude Desktop
Add the following to the mcpServers section of your claude_desktop_config.json:
Option 1: Direct installation from GitHub (Recommended)
{
"mcpServers": {
"cdv-mcp-server": {
"command": "uvx",
"args": [
"--from",
"git+https://github.com/kevintalbert/cdv-mcp-server@main",
"run-server"
],
"env": {
"CDV_BASE_URL": "https://my-cdv-instance.example.com",
"CDV_API_KEY": "your-api-key-here",
"CDV_USERNAME": "vizapps_admin",
"CDV_PASSWORD": "vizapps_admin"
}
}
}
}
Option 2: Local installation (after cloning the repository)
{
"mcpServers": {
"cdv-mcp-server": {
"command": "uv",
"args": [
"--directory",
"/path/to/CDV-MCP-Server",
"run",
"src/cdv_mcp_server/server.py"
],
"env": {
"CDV_BASE_URL": "https://my-cdv-instance.example.com",
"CDV_API_KEY": "your-api-key-here",
"CDV_USERNAME": "vizapps_admin",
"CDV_PASSWORD": "vizapps_admin"
}
}
}
}
For Option 2, replace /path/to/CDV-MCP-Server with your local path.
Local Development
# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh
# Install dependencies
uv sync
# Run the server (stdio transport, default)
uv run run-server
# Run with HTTP transport
MCP_TRANSPORT=http uv run run-server
You can also create a .env file in the project root with your credentials:
CDV_BASE_URL=https://my-cdv-instance.example.com
CDV_API_KEY=your-api-key-here
CDV_USERNAME=vizapps_admin
CDV_PASSWORD=your-cdv-password
Transport
The MCP server's transport protocol is configurable via the MCP_TRANSPORT environment variable:
stdio(default) — communicate over standard input/output. Useful for local tools, CLI scripts, and integrations like Claude Desktop.http— expose an HTTP server. Useful for web-based deployments and microservices.sse— use Server-Sent Events (SSE) transport. Useful for existing web-based deployments that rely on SSE.
Authentication
The server uses two separate authentication mechanisms with different capability levels:
| Credential | Env Var | Required | Unlocks |
|---|---|---|---|
| CDV Admin API Key | CDV_API_KEY |
Always | Data exploration, admin CRUD, connections, datasets, workspaces, users, groups, migrations, jobs |
| CDV Username + Password | CDV_USERNAME / CDV_PASSWORD |
Optional | create_smart_visual, create_dashboard, create_visual, update_visual, delete_visual |
Without CDV_USERNAME / CDV_PASSWORD
When session credentials are absent, chart and dashboard creation tools are not registered in the MCP server at all — they will not appear in the agent's tool list. The server operates in a read/explore-only mode for visuals:
- ✅
query_dataapi— run SQL queries, explore data - ✅
list_connections,list_datasets,list_workspaces— discover resources - ✅
list_visuals,get_visual— inspect existing dashboards - ✅ All admin tools (users, groups, roles, datasets, etc.)
- ❌
create_smart_visual— not available - ❌
create_dashboard— not available - ❌
create_visual/update_visual/delete_visual— not available
Why two credentials?
CDV's admin API (arc/adminapi/v1/visuals) creates visual metadata (title, type, dataset) but does not persist shelf configurations (which columns appear on which axes). Shelf data is stored through CDV's UI API (arc/reports/report/{id}), which requires a browser-style session login. Without CDV_USERNAME/CDV_PASSWORD, visuals would be created as empty skeletons that render blank in CDV's frontend.
The session is cached in memory and reused across tool calls within a server process.
Copyright (c) 2025 - Cloudera, Inc. All rights reserved.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.