MCPdir β€” MCP Server Directory
πŸ‡¬πŸ‡§ πŸ‡ͺπŸ‡Έ

Audio MCP Server

by GongRzhe

Audio recording, playback, and text-to-speech through AI assistants

ai-ml Python Intermediate Self-hostable No API key
⭐ 8 stars πŸ“… Updated: 10mo ago

Description

An MCP server for audio recording, playback, and text-to-speech capabilities. Provides 5 tools for listing audio devices, recording from microphone with configurable parameters (duration, sample rate, channels, device), playing back recordings, playing audio files, and text-to-speech. Useful for voice-based AI workflows and audio processing tasks.

βœ… Best for

AI workflows that need local audio recording and playback capabilities

⏭️ Skip if

You need cloud-based speech-to-text or production-grade TTS

πŸ’‘ Use cases

  • Recording audio through AI assistant commands
  • Playing back recordings and audio files
  • Listing and selecting audio input/output devices
  • Voice note capture during AI-assisted workflows

πŸ‘ Pros

  • βœ“ No API key required β€” fully local audio processing
  • βœ“ Configurable recording parameters (duration, sample rate, channels)
  • βœ“ Device enumeration for input/output selection
  • βœ“ Cross-platform support

πŸ‘Ž Cons

  • βœ— Text-to-speech is planned but not yet fully implemented
  • βœ— Requires audio hardware (microphone/speakers)
  • βœ— Audio library dependencies may need system-level installation
  • βœ— Small community (8 stars)

πŸ”§ Exposed tools (5 tools)

ToolCategoryDescription
list_audio_devicesdevice-managementList all available audio input and output devices
play_latest_recordingplaybackPlay back the most recently recorded audio
play_audioplaybackText-to-speech with configurable voice parameters
play_audio_fileplaybackPlay an audio file through speakers
record_audiorecordingCapture microphone input with configurable duration and parameters

πŸ’‘ Tips & tricks

Use list_audio_devices first to identify available devices, then specify the device index in record_audio for the correct input.

Quick info

Author
GongRzhe
License
MIT
Runtime
Python 3.10+
Transport
stdio
Category
ai-ml
Difficulty
Intermediate
Self-hostable
βœ…
Auth
β€”
Docker
β€”
Version
1.0.0
Updated
May 17, 2025

Client compatibility

  • ❓ Claude Code
  • ❓ Cursor
  • ❓ VS Code Copilot
  • ❓ Gemini CLI
  • ❓ Windsurf
  • ❓ Cline
  • ❓ JetBrains AI
  • ❓ Warp

Platforms

🍎 macOS 🐧 Linux πŸͺŸ Windows