Realtime Voice Web
Live room audio (WebRTC) + voice assistant turn flow (live transcript, response, interrupt).
Start Mic
Pause Mic
Start Voice Assistant
Push-to-talk
PTT(ms)
100
200
300
400
500
600
700
800
900
1000
2000
3000
4000
5000
6000
7000
8000
9000
10000
Lang
STT
local/faster-whisper-small
local/faster-whisper-medium
local/faster-whisper-large-v3
openai/gpt-4o-mini-transcribe
openai/gpt-4o-transcribe
openai/whisper-1
openai/gpt-4o-mini-realtime-preview
openai/gpt-4o-realtime-preview
gemini/gemini-2.5-flash
gemini/gemini-2.5-pro
gemini/gemini-live-2.5-flash-preview
gemini/gemini-live-2.5-pro-preview
gemini/gemini-2.0-flash
TTS
local/edge-tts
openai/gpt-4o-mini-tts
openai/gpt-4o-tts
openai/gpt-4o-audio-preview
openai/gpt-4o-mini-realtime-preview
openai/gpt-4o-realtime-preview
gemini/gemini-2.5-flash-preview-tts
gemini/gemini-2.5-pro-preview-tts
gemini/gemini-live-2.5-flash-preview
gemini/gemini-live-2.5-pro-preview
gemini/gemini-2.5-flash-native-audio-preview-09-2025
Active models
Live transcript
Log