Call Orchestration

Call orchestration is the coordination of multiple systems and processes during a voice interaction to deliver seamless experiences. It manages the interplay between speech processing, AI reasoning, business logic, and external integrations in real time.

How does call orchestration work?

The orchestration layer acts as a conductor, sequencing operations and managing state throughout the call. When a caller speaks, orchestration coordinates ASR transcription, LLM processing, tool calls to external systems, and TTS response generation. It handles timing, error recovery, and ensures each component receives the context it needs.

Why does call orchestration matter?

Individual components like speech recognition and language models are powerful but insufficient alone. Orchestration turns them into a coherent system that maintains conversation flow, executes actions at the right moments, and recovers gracefully when things go wrong. Poor orchestration creates disjointed experiences even with excellent individual components.

Call orchestration in practice

A caller asks to reschedule an appointment. Orchestration coordinates: ASR captures the request, the LLM identifies reschedule intent, a calendar API retrieves available slots, the LLM formulates options, TTS speaks them, ASR captures the selection, the calendar API books the new time, and TTS confirms. Each handoff happens seamlessly within a natural conversation flow.

For workflow configuration, see the workflow automation documentation.