Chat + SenseVoice + Qwen3-TTS Streaming

SenseVoice handles microphone transcription. Chat replies use streamed Qwen3-TTS PCM over /v1/chat/stream-tts.

Idle ASR: disconnected Chat TTS: idle assistant: idle
Audio Controls
Tune full-duplex behavior live. These settings are saved in your browser.
0.060
Lower = more sensitive. Raise this if the assistant interrupts itself.
450 ms
How long speech must persist before interrupting assistant playback.
3500 ms
How long after playback ends to keep rejecting likely echo transcripts.
0.90
Higher = stricter echo rejection. Lower = more aggressive filtering.
Current mic RMS
0.0000
Near-end speech hold
0 ms
Last echo similarity
Playback state
idle

Conversation

Assistant audio starts as soon as PCM chunks arrive. Chat history is saved in localStorage.

ASR Transcript / Reply Metadata

ASR Transcript
Current ASR Partial
Last Final ASR
Last Reply
Tool used
Emotion
Style
Tone
Accent
Persona
Event Log