
Whispera
AI-powered voice transcription app for macOS using WhisperKit
README
Whispera
A native macOS app that replaces the built-in dictation with OpenAI's Whisper for superior transcription accuracy. Transcribe speech, local files, YouTube videos, and network streams - all processed locally on your Neural Engine. <div align="center">
⬇️ Download Latest Release
</div>
Demos
<table> <tr> <th>Speech to Text Field</th> <th>File/URL Transcription with Timestamps</th> </tr> <tr> <td width="50%"> <video src="https://github.com/user-attachments/assets/1da72bbb-a1cf-46ee-a997-893f1939e626" controls> Your browser does not support the video tag. </video> </td> <td width="50%"> <video src="https://github.com/user-attachments/assets/d573bef4-a3b2-49ac-a1fd-3c6735648fdc" controls> Your browser does not support the video tag. </video> </td> </tr> </table>
Features
- Live transcription (beta)
- Speech-to-text - Replaces macOS native dictation with WhisperKit (OpenAI's Whisper model on Neural Engine) for better accuracy
- File transcription - Audio and video files
- Network media transcription - Stream video/music URLs
- YouTube transcription
All processing runs locally. Internet required only for initial model download.
Roadmap
- [x] Multi-language support beyond English
- PR: https://github.com/sapoepsilon/Whispera/pull/2
- Release: https://github.com/sapoepsilon/Whispera/releases/tag/v1.0.3
- [x] Real-time translation capabilities
- PR: https://github.com/sapoepsilon/Whispera/pull/17
- Release: https://github.com/sapoepsilon/Whispera/releases/tag/v1.0.18
- [ ] Additional customization options
Usage
Simply use your configured global shortcut to start transcribing with Whisper instead of the default macOS dictation.
Known Issues
- The app does not work with Intel mac(see Issue 15
- Auto install does not work, after an app has been downloaded, please manually drag and drop the app to you
/Application
folder
Requirements
- macOS 13.0 or later
- Apple Silicon
- We are working on support for Intel Mac
Credits
Built with:
- WhisperKit - On-device Whisper transcription for Apple Silicon
- YouTubeKit - YouTube content extraction
- swift-markdown-ui
Thanks to these projects for making privacy-focused, local transcription a reality.
License
MIT License
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.