From Building Voice Agents with the OpenAI Realtime API
Table of Contents
4 of 21 chapters available ยท Premium unlocks the rest
- 1
Legal Notices
- 2
About This Book
- 3
Part I: Foundations
- 4
Chapter 1: Realtime API Orientation
- 5
Chapter 2: Python Environment and Project Setup
- 6
Chapter 3: Realtime Sessions, Event Flow, and History Strategy
- 7
Part II: Core Voice Agents
- 8
Chapter 4: Building the First WebSocket Voice Agent with Live Audio
- 9
Chapter 5: Audio Chunking, Turn Detection, Manual Turns, and Interruption Handling
- 10
Chapter 6: Function Calling, Tool Authorization, and Voice-Safe Inputs
- 11
Part III: Specialized Realtime Models
- 12
Chapter 7: Streaming Transcription with gpt-realtime-whisper
- 13
Chapter 8: Live Translation with gpt-realtime-translate
- 14
Chapter 9: Telephony Integration with SIP Attach
- 15
Part IV: Production Architecture
- 16
Chapter 10: Browser WebRTC Boundaries and Server Sideband Control
- 17
Chapter 11: Observability, Evaluation, Cost Management, and Data Safety
- 18
Chapter 12: Deployment, Scaling, Reconnection, Operational Hardening, and End-to-End Case Study
- 19
Next Steps
- 20
Part V: Review Questions
- 21
Answer Key