Audio MCP Server
by GongRzhe
Audio recording, playback, and text-to-speech through AI assistants
ai-ml Python Intermediate Self-hostable No API key
β 8 stars π
Updated: 10mo ago
Description
An MCP server for audio recording, playback, and text-to-speech capabilities. Provides 5 tools for listing audio devices, recording from microphone with configurable parameters (duration, sample rate, channels, device), playing back recordings, playing audio files, and text-to-speech. Useful for voice-based AI workflows and audio processing tasks.
β Best for
AI workflows that need local audio recording and playback capabilities
βοΈ Skip if
You need cloud-based speech-to-text or production-grade TTS
π‘ Use cases
- Recording audio through AI assistant commands
- Playing back recordings and audio files
- Listing and selecting audio input/output devices
- Voice note capture during AI-assisted workflows
π Pros
- β No API key required β fully local audio processing
- β Configurable recording parameters (duration, sample rate, channels)
- β Device enumeration for input/output selection
- β Cross-platform support
π Cons
- β Text-to-speech is planned but not yet fully implemented
- β Requires audio hardware (microphone/speakers)
- β Audio library dependencies may need system-level installation
- β Small community (8 stars)
π§ Exposed tools (5 tools)
| Tool | Category | Description |
|---|---|---|
| list_audio_devices | device-management | List all available audio input and output devices |
| play_latest_recording | playback | Play back the most recently recorded audio |
| play_audio | playback | Text-to-speech with configurable voice parameters |
| play_audio_file | playback | Play an audio file through speakers |
| record_audio | recording | Capture microphone input with configurable duration and parameters |
π‘ Tips & tricks
Use list_audio_devices first to identify available devices, then specify the device index in record_audio for the correct input.
Quick info
- Author
- GongRzhe
- License
- MIT
- Runtime
- Python 3.10+
- Transport
- stdio
- Category
- ai-ml
- Difficulty
- Intermediate
- Self-hostable
- β
- Auth
- β
- Docker
- β
- Version
- 1.0.0
- Updated
- May 17, 2025
Client compatibility
- β Claude Code
- β Cursor
- β VS Code Copilot
- β Gemini CLI
- β Windsurf
- β Cline
- β JetBrains AI
- β Warp
Platforms
π macOS π§ Linux πͺ Windows