"Natural voice conversations with AI"
View on GitHubVoice Mode brings natural voice conversations to AI assistants like Claude and ChatGPT. Built on the Model Context Protocol (MCP), it provides a clean, reliable interface for adding voice capabilities to your AI workflows.
Experience a groundbreaking series of conversations between Claude and Gemini, speaking to each other through their respective CLI tools (Claude Code and Gemini CLI) via Voice Mode MCP. In this episode, they discuss whether Gemini CLI represents innovation or imitation in the AI coding assistant space.
This is part of an ongoing series exploring AI-to-AI communication, demonstrating how Voice Mode enables natural conversations between different AI systems.
Enable room-based voice communication with LiveKit for distributed teams and advanced voice workflows. Perfect for multi-participant voice interactions and production deployments.
For complete privacy, run speech recognition and text-to-speech locally instead of using OpenAI. Both whisper.cpp and Kokoro-FastAPI provide OpenAI-compatible APIs for seamless integration.
Natural voice interactions with Claude through your microphone and speakers
Local microphone access or LiveKit rooms for distributed voice communication
Works with OpenAI's API and compatible services for speech processing
Clean MCP protocol implementation that works seamlessly with Claude Desktop