The Short Answer
Vapi advertises a $0.05 per minute platform fee. The real per-minute cost of running a Vapi agent in production is typically $0.13 to $0.28 per minute once you add LLM inference, speech-to-text, text-to-speech, telephony carriage, and the operational tax of retries and tool calls.
Whether that is "expensive" depends on what you are comparing it to. Compared to a human receptionist at $15 to $25 per hour, it is dramatically cheaper. Compared to a managed AI receptionist with flat-rate pricing (like OnCallClerk at $29 to $99 per month for unlimited calls), it usually costs more once volume passes a few hundred minutes per month, and it transfers the operational burden onto you.
This is a Vapi-specific breakdown. For broader voice AI economics, see why voice AI costs more than expected and the cheapest way to run a voice AI agent.
Vapi's Pricing Model: What You Are Actually Paying For
Vapi is a developer platform for building AI voice agents. Unlike a managed service that bundles everything into one bill, Vapi is a thin orchestration layer between your phone calls and the underlying AI providers. You pay Vapi, and you also pay every model provider underneath.
Their published pricing breaks into five separate components:
| Component | What It Covers | Typical Per-Minute Cost |
|---|---|---|
| Vapi platform fee | Orchestration, dashboard, call routing | $0.05 |
| LLM (Large Language Model) | The "brain" generating responses | $0.03 to $0.12 |
| STT (Speech-to-Text) | Transcribing the caller in real time | $0.01 to $0.04 |
| TTS (Text-to-Speech) | Speaking the AI's response aloud | $0.02 to $0.10 |
| Telephony | Carrier minutes via Twilio or Vonage | $0.013 to $0.05 |
These add up. Vapi exposes the line items honestly, but builders evaluating the platform on the $0.05 headline number routinely under-budget by 3x to 5x.
A Real Per-Minute Cost Example
Here is what a typical inbound customer service Vapi agent actually costs per minute of conversation in 2026. Numbers come from published Vapi documentation and provider price lists.
Source: Published Vapi, OpenAI, Deepgram, ElevenLabs, and Twilio rate cards
Translating that chart into dollars: a one-minute call typically costs $0.155 to $0.18 in raw provider charges before any margin or retry overhead. A five-minute call costs $0.78 to $0.90. A 30-minute call (uncommon but not rare for support use cases) costs $4.65 to $5.40.
For 10,000 minutes per month, that is $1,550 to $1,800 in provider costs before any operational work, monitoring, or engineering time.
What Pushes the Real Cost Higher
The clean per-minute math above assumes a perfect world: one LLM call per turn, zero retries, no tool calls, and a modest TTS provider. Production agents are messier.
Tool calls and function execution
Every time the agent looks up a customer in your CRM, checks the calendar for availability, or queries a knowledge base, that is an additional LLM call. A 3-minute booking conversation might trigger 4 to 8 tool calls, each consuming tokens for both the request and the response. Builders routinely see LLM bills 50% to 100% higher than the per-minute estimates when tool use is heavy.
Premium TTS voices
ElevenLabs Turbo v2 lands around $0.06 to $0.10 per minute of speech. Their flagship voices (used for emotional range and natural prosody) cost more. Many builders start with a budget TTS like Cartesia or PlayHT, hate the audio quality, and switch to ElevenLabs anyway, doubling that line item.
Background noise and interruption handling
Real callers cough, talk over the agent, and call from cars with bad cell signal. Vapi exposes endpointing controls and interruption handling, but the more aggressive your VAD (voice activity detection) settings, the more ASR cycles you trigger. This is invisible until your provider bills arrive.
Failed calls still bill
A call that disconnects after 8 seconds because the caller could not hear the agent still incurs the platform fee, the connection fee, and however many ASR cycles ran. At scale this represents 3% to 8% of total spend on calls that produced no business value.
Twilio surge pricing
Carrier rates are not flat. Toll-free inbound, international caller-ID handling, and certain destination numbers carry surcharges. Vapi passes these through. Operators see line items they did not anticipate, especially when scaling beyond US/Canada coverage.
A Vapi agent that costs $0.16 per minute on paper often delivers a real bill closer to $0.24 to $0.32 per minute once tool calls, retries, premium voices, and carrier surcharges are included. Plan for the upper end.
When Vapi Is the Right Cost Profile
Vapi is a strong economic choice when:
- You have specialized requirements that justify building from primitives (custom data integrations, proprietary models, regulated industry compliance).
- Your call volume is large enough that bundled-pricing services would charge per-minute overage rates above your raw provider costs.
- You have engineering capacity to monitor, optimize, and maintain the stack. The hidden cost of Vapi is engineering time, not the bill.
- You need full control over the LLM, the prompts, the voice, the ASR provider, and how each one is configured.
Companies running thousands of minutes per day of structured outbound dialing, large support deployments, or specialized vertical agents (legal intake, healthcare triage with proper compliance) tend to land cheaper on Vapi than on a managed service.
When Vapi Is the Wrong Cost Profile
Vapi is usually the wrong choice when:
- You are a solo operator or small business answering inbound calls. The engineering overhead of running a custom voice agent will dwarf the Vapi bill itself.
- You want predictable monthly costs. Per-minute pricing makes budgeting volatile, especially during seasonal call surges.
- Your call volume is under 1,000 minutes per month. At low volumes, managed services with flat fees beat per-minute pricing every time.
- You do not have an engineer dedicated to maintaining the agent. Voice AI breaks in subtle ways: latency drift, prompt regression, ASR provider outages, model deprecations. Someone has to own it.
For most plumbers, electricians, junk removal companies, dental offices, and law firms (the majority of small-business buyers of AI phone answering), Vapi is the wrong tool for the job. It is a brilliant developer platform aimed at a different buyer.
Vapi vs OnCallClerk vs Building from Scratch
| Vapi | OnCallClerk | DIY (Twilio + OpenAI) | |
|---|---|---|---|
| Setup time | Days to weeks | 10 minutes | Months |
| Engineering required | Yes (one full-time minimum at scale) | None | Yes (full team) |
| Per-minute cost | $0.13 to $0.28 | $0 (flat plan) | $0.10 to $0.20 |
| Monthly minimum | $0 (pay as you go) | $29 | $0 |
| 1,000 min/mo total | $130 to $280 | $29 to $99 | $100 to $200 + engineering time |
| 10,000 min/mo total | $1,300 to $2,800 | $99 to $499 (custom) | $1,000 to $2,000 + engineering time |
| Best for | Developers building products | Small businesses answering inbound calls | Companies with platform requirements |
For a deeper feature comparison, see the OnCallClerk vs Vapi alternative page.
What Vapi Does Well
To be clear about strengths: Vapi is genuinely good at what it does.
- Latency. End-to-end latency on a well-tuned Vapi agent is among the lowest available, often under 800ms first-token-to-speech. See how to build low-latency AI phone agents.
- Provider flexibility. You can swap LLMs (OpenAI, Anthropic, Google, open-source), ASR (Deepgram, AssemblyAI, OpenAI Whisper), and TTS (ElevenLabs, Cartesia, PlayHT) without rewriting your application.
- Developer experience. The dashboard, logs, and call recording tooling are best-in-class.
- Honest pricing. Vapi exposes the underlying provider costs rather than hiding them. That transparency is rare in this space.
The "is Vapi expensive" question is not really about whether Vapi is well-priced, it is about whether the cost structure fits your use case.
Frequently Asked Questions
What is the cheapest way to run a Vapi agent?
Use a budget LLM (GPT-4o-mini, Claude Haiku, or open-source via Together AI), a budget TTS (Cartesia or PlayHT), and Deepgram Nova for ASR. Tune your prompts to minimize token usage and avoid unnecessary tool calls. This stack lands closer to $0.10 per minute. Quality drops noticeably compared to a GPT-4o + ElevenLabs setup, but for high-volume routine calls it is workable. See the cheapest way to run a voice AI agent for a deeper breakdown.
Does Vapi have hidden fees?
No, Vapi is unusually transparent about its pricing model. The $0.05 platform fee plus pass-through provider costs is honest. The "hidden" costs are not Vapi's, they are the underlying providers (OpenAI tokens, ElevenLabs character costs, Twilio carrier minutes). Most builders are surprised because they did not budget for the full stack, not because Vapi misrepresented anything.
How does Vapi compare to Retell AI on price?
Retell AI uses similar pass-through pricing for provider costs but bundles its platform fee differently. In practice, Retell and Vapi land within about 10% of each other on raw per-minute cost for comparable configurations. The bigger difference is in developer experience and feature set, not price.
Is Vapi cheaper than Twilio Voice + OpenAI directly?
Not usually. A direct Twilio + OpenAI stack can be 20% to 40% cheaper per minute, but you absorb the engineering cost of building Vapi's orchestration layer yourself: barge-in handling, endpointing, function calling, transcript management, recording, and dashboards. For small teams the time cost easily exceeds the platform fee savings.
What does a typical monthly bill look like for a small business on Vapi?
For an inbound receptionist running 500 calls per month at an average of 3 minutes each (1,500 minutes total), expect $200 to $420 in monthly provider costs, plus 10 to 20 hours of engineering or operations time keeping the agent tuned. Compared to a flat $29 to $99 managed service, this is rarely the right economic choice for an SMB. See the OnCallClerk vs Vapi alternative comparison for a side-by-side.
Does the cost go down with volume?
Provider costs are largely flat per minute regardless of volume. You can negotiate enterprise rates with OpenAI, ElevenLabs, and Twilio at high volume, which can compress 20% to 30% of the bill. Vapi's own platform fee does not scale down meaningfully. For most operators, the volume break-even point versus a managed service is around 1,500 to 3,000 minutes per month, after which Vapi's flexibility becomes worth the operational tax.
Keep Reading
- Why Voice AI Costs More Than Expected - The general cost decomposition
- The Cheapest Way to Run a Voice AI Agent - Optimization strategies
- How to Build a Low-Latency AI Phone Agent - Engineering deep dive
- OnCallClerk vs Vapi - Direct comparison
- How AI Voice Agents Understand Humans - The technology underneath
