How to Build an AI Voice Receptionist That Books Meetings
Architecture for a 24/7 voice agent: Vapi, intent routing, Cal.com booking, and n8n logging — without sounding like a phone tree.
After-hours calls are pure margin leak: prospects call, no one answers, they book with whoever picks up first. An AI voice receptionist fixes that if you design it like an operator, not a gimmick.
Core architecture
- Telephony — Twilio or carrier SIP into Vapi (or similar).
- Listen — VAD + streaming STT with barge-in so callers can interrupt.
- Intent — LLM classifies: book, FAQ, escalate, wrong number.
- Knowledge — vector KB for pricing, hours, services.
- Act — Cal.com availability check + booking, or warm transfer.
- Log — n8n workflow posts transcript + outcome to Slack and CRM.
The AI Voice Receptionist build on my site runs at ~61% booking rate with sub-2-minute average handle time when the KB is tight.
What makes it sound human
- Short responses; one idea per turn.
- Confirm spelling for email and name.
- Explicit escalation path: "Let me get a human on this."
- Latency budget: target < 800ms to first token after caller stops.
Compliance and trust
- Disclose AI at the start where required.
- Record retention policy in your privacy page.
- Never invent availability — always hit the live calendar API.
When to hire vs. build
DIY works for single-location services and founder-led sales. Multi-location, HIPAA, or complex routing needs custom evals and monitoring. Book a call if you want this wired to your stack.