The Ultimate Open Source Solution.
Break language barriers. You
speak Chinese, AI speaks English. Low latency, high quality.
Powerful tech stack for seamless cross-language communication
Powered by Faster-Whisper (large-v3) for accurate speech recognition in multiple languages.
Integrated with OpenAI / Gemini / Groq for real-time translation with < 1.5s latency.
Microsoft VibeVoice engine delivers professional, human-like English speech output.
Seamlessly output to Zoom, Teams, or Google Meet via VB-CABLE.
Witness the disappearance of language barriers
Edge-TTS, Coqui, GPT-SoVITS
Speak foreign languages with your own voice
macOS, Linux, Docker Support
Support for multiple language translation