Audio MCP Server

Name: Audio MCP Server
Author: GongRzhe

by GongRzhe

Audio recording, playback, and text-to-speech through AI assistants

ai-ml Python Intermediate Self-hostable No API key

⭐ 8 stars 📅 Updated: 10mo ago

View on GitHub View package

Description

An MCP server for audio recording, playback, and text-to-speech capabilities. Provides 5 tools for listing audio devices, recording from microphone with configurable parameters (duration, sample rate, channels, device), playing back recordings, playing audio files, and text-to-speech. Useful for voice-based AI workflows and audio processing tasks.

✅ Best for

AI workflows that need local audio recording and playback capabilities

⏭️ Skip if

You need cloud-based speech-to-text or production-grade TTS

💡 Use cases

Recording audio through AI assistant commands
Playing back recordings and audio files
Listing and selecting audio input/output devices
Voice note capture during AI-assisted workflows

👍 Pros

✓ No API key required — fully local audio processing
✓ Configurable recording parameters (duration, sample rate, channels)
✓ Device enumeration for input/output selection
✓ Cross-platform support

👎 Cons

✗ Text-to-speech is planned but not yet fully implemented
✗ Requires audio hardware (microphone/speakers)
✗ Audio library dependencies may need system-level installation
✗ Small community (8 stars)

🔧 Exposed tools (5 tools)

Tool	Category	Description
list_audio_devices	device-management	List all available audio input and output devices
play_latest_recording	playback	Play back the most recently recorded audio
play_audio	playback	Text-to-speech with configurable voice parameters
play_audio_file	playback	Play an audio file through speakers
record_audio	recording	Capture microphone input with configurable duration and parameters

💡 Tips & tricks

Use list_audio_devices first to identify available devices, then specify the device index in record_audio for the correct input.

Quick info

Author: GongRzhe
License: MIT
Runtime: Python 3.10+
Transport: stdio
Category: ai-ml
Difficulty: Intermediate
Self-hostable: ✅
Auth: —
Docker: —
Version: 1.0.0
Updated: May 17, 2025

Client compatibility

❓ Claude Code
❓ Cursor
❓ VS Code Copilot
❓ Gemini CLI
❓ Windsurf
❓ Cline
❓ JetBrains AI
❓ Warp

Platforms

🍎 macOS 🐧 Linux 🪟 Windows