Duma

Duma

Enables AI agents to ask questions to users asynchronously via a local macOS app, allowing users to respond by text or voice.

Category
Visit Server

README

<div align="center">

<img src="icon.png" width="128" alt="Duma icon">

Duma - Async questions from AI agents to you, via MCP<br>Answer by text or voice

"Duma" - Hungarian for "talking" </div>

Duma is a local macOS app. Agents send questions through MCP, you answer in a native window whenever you get to it, and agents poll back for the response.

Demo

Duma demo

Prerequisites

Setup

1. Install dependencies

brew install python           # 3.12 or newer
brew install portaudio        # required for audio recording
brew install uv               # Python package manager

2. Clone and set up the project

git clone https://github.com/zoltanpetrik/duma.git
cd duma
uv venv                 # creates an isolated Python environment for Duma
uv sync                  # installs all dependencies into that environment

Voice transcription requires an OpenAI API key - see Configuration below. The app works without it; recordings just won't be transcribed.

Running

source .venv/bin/activate
python -m duma

With debug mode (enables developer tools and verbose logging):

python -m duma --debug

When running from the command line, the menu bar icon and microphone TCC prompt will show Python instead of Duma. Use the distribution build for full macOS integration.

Try it out

The app opens empty. To see it in action without setting up MCP, start the app in debug mode (python -m duma --debug) and create a test question from another terminal:

curl -X POST http://localhost:31299/api/questions \
  -H "Content-Type: application/json" \
  -d '{
    "agent_id": "test-agent",
    "short_description": "Pasta or rice?",
    "question_body": "Should we make pasta or rice tonight?",
    "proposed_answers": ["Pasta", "Rice", "Something else"],
    "high_priority": false
  }'

A notification pops up, the question appears in the sidebar, and you can answer it. This endpoint is only available in debug mode - in production, questions come through MCP.

CLI options

Option Default Description
--port 31299 HTTP server port
--debug off Enable pywebview devtools (right-click → Inspect Element) and verbose logging

MCP Configuration

Make sure Duma is running before you connect any MCP client.

Claude Code

Add Duma as a global MCP server so it is available in every project:

claude mcp add --transport http --scope user duma http://localhost:31299/mcp

Then verify that it was added:

claude mcp list

Encouraging Claude to use Duma

Claude will not proactively use Duma unless you tell it to. Add this to your project's CLAUDE.md (or ~/.claude/CLAUDE.md to apply it globally):

## Duma (async questions)

When you encounter a question or decision that doesn't need an immediate answer
(approvals, preferences, non-blocking clarifications), use the Duma MCP tools
instead of asking in the chat. This lets me answer on my own time while you
continue working.

Use agent_id "claude-code" for all Duma calls. Before asking a new question,
call list_my_pending_questions to check if you already have unanswered questions
on the same topic. After asking, continue with other work and poll get_response
later - don't wait for an answer.

Other MCP clients

Add Duma to your MCP client config:

{
  "mcpServers": {
    "duma": {
      "url": "http://localhost:31299/mcp"
    }
  }
}

Distribution build

For proper macOS integration (Dock name, notification identity, microphone permission prompt all showing "Duma" instead of "Python"), build the standalone app:

./build.sh
open dist/Duma.app

This runs PyInstaller and bundles the Python interpreter, dependencies, application code, and static files into a self-contained app at dist/Duma.app. The script handles code signing automatically. The app icon lives at static/icon.icns (generate one at icon.kitchen if you need to replace it).

The output can also be copied to another Mac and launched directly - no Python, venv, or Homebrew needed on the target machine.

Configuration

To enable voice transcription, create a config file with your OpenAI API key:

mkdir -p ~/.duma
cat > ~/.duma/config.json << 'EOF'
{
    "OPENAI_API_KEY": "your-api-key-here"
}
EOF

The app works without this - voice recordings are saved but not transcribed. Transcription language (Hungarian/English) can be switched in the app via the toggle next to the record button.

Development

See DOCS.md and SPECS.md for the full picture.

Tests

uv sync --group test
.venv/bin/python -m pytest

66 tests covering the database, service, REST API, and transcription layers. macOS-native integration (menu bar, notifications, audio recording) is tested manually - see the QA checklist in SPECS.md.

Data

All Duma data stays local:

  • Configuration: ~/.duma/config.json
  • Database: ~/.duma/duma.db
  • Audio recordings: ~/.duma/audio/
  • Logs: ~/.duma/duma.log

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured