Barge-In

Barge-in is the capability that allows callers to interrupt an AI voice agent while it is speaking. When enabled, the system detects the interruption, stops its current output, and begins listening to the caller’s input.

How does barge-in work?

The system continuously monitors for incoming speech even while generating audio output. When voice activity is detected, it signals the text-to-speech engine to stop and switches to listening mode. The interruption is processed as new input, and the conversation continues from there.

Why does barge-in matter?

Without barge-in, callers must wait for the agent to finish speaking before they can respond. This creates frustrating experiences, especially when callers already know what they want or when the agent provides unnecessary information. Barge-in makes conversations feel natural and respectful of the caller’s time.

Barge-in in practice

A caller phones a restaurant reservation line. The AI agent begins explaining hours and location, but the caller interrupts with “I just need to change my reservation for tonight.” The agent immediately stops, acknowledges the request, and proceeds to the modification flow without forcing the caller to listen to irrelevant information.