Headline Impact
90%
Reduction in Screening Cost vs. $20/hr Human Tele-Callers
HR Tech
Voice AI
Autonomous Agents
90%
Cost reduction vs. $20/hr human tele-callers
<1.5s
Response latency for natural conversational flow
Autonomous
End-to-end screening interviews without human involvement
The Client
Jiseki — US-based Job Placement Platform
A US-based startup building a job placement platform for immigration services, needing to conduct telephonic screening interviews at scale. At $20/hour for human tele-callers, manual operations couldn't scale.
The Challenge
$20/hr Tele-Callers Made Scaling Prohibitively Expensive
Human tele-callers at $20/hour made scaling prohibitively expensive. The platform needed to screen candidates across multiple roles, gathering work history, qualifications, and stress tolerance — but the AI had to feel conversational, not robotic.
Key technical challenges: handling user interruptions mid-sentence, keeping response latency under 2 seconds, and managing varying speaking speeds.
What We Built
Autonomous Voice AI Screening Agent
We built an end-to-end voice AI agent that conducts fully autonomous screening interviews over the phone — with natural conversational flow and sub-1.5-second response latency.
1. Speech-to-Speech Pipeline
End-to-end voice AI: Deepgram for speech-to-text, GPT-4o mini for reasoning, Amazon Polly for text-to-speech, all orchestrated through Twilio telephony.
2. Latency Optimization
Chunked sentence processing, hardcoded responses for common patterns, OpenAI socket streaming, and intelligent pause detection — bringing response time below 1.5 seconds.
3. Interruption Handling
Natural conversation flow that gracefully handles when candidates interrupt, speak over the AI, or pause unexpectedly.
4. Cost Optimization
Lighter models and decision-tree audio responses for predictable conversation segments, minimizing API costs per call.
5. Monitoring & Analytics
Real-time call monitoring via alan.app, tracking conversation quality, completion rates, and candidate scoring accuracy.
Technology
Powered By
Deepgram STT
GPT-4o Mini
Amazon Polly TTS
Twilio
Socket Streaming
Intelligent Pause Detection
The Results
90% Cost Reduction with Fully Autonomous Screening
Reduced screening costs by approximately 90% compared to human tele-callers. The AI agent conducts fully autonomous screening interviews, evaluating work history and qualifications with sub-1.5-second response latency that feels natural and conversational.
"Tele-calling at $20 per hour was proving prohibitively expensive to expand operations. The AI agent gave us scale we couldn't have achieved otherwise."
— Jiseki
Ready to Transform Your Operations?
We've delivered $100M+ in business impact across IT services, healthcare, HR tech, and fintech.
Book a Scoping Call