How much does an AI agent cost? Real numbers from real builds
The headline numbers
What we charge, by engagement type:
| Engagement | Duration | Investment |
|---|---|---|
| Discovery sprint (workflow map, spec, evals, scope) | 1–2 weeks | €4,000–8,000 |
| Working prototype on real data | 2–3 weeks | €12,000–25,000 |
| Production agent (single-purpose) | 6–10 weeks | €25,000–50,000 |
| Production agent (multi-channel / multi-purpose) | 10–16 weeks | €50,000–120,000 |
| Multi-agent platform | 16–24 weeks | €120,000–250,000 |
| Ongoing retainer | Monthly | from €1,500/month |
These are real ranges from real engagements, not pitch-deck numbers. Where you land in each range depends on integration complexity, document/data variety, and how clean your existing systems are.
What drives cost
Five factors, ranked by impact:
1. Integration count and complexity
The single biggest cost driver. An agent that talks to one system (your own database) is cheap. An agent that talks to NetSuite + Salesforce + Microsoft 365 + a custom data warehouse + your CRM is expensive — because each integration needs auth, schema validation, retry, idempotency, observability.
Budget rule of thumb: each meaningful external integration adds €5,000–€15,000 to the build, depending on whether it's a well-documented modern API or a legacy system.
2. Data variety
A document agent processing invoices from 3 vendors with similar layouts is cheap. The same agent processing invoices from 200 vendors with wildly varied layouts costs more — because the eval suite, the schema, and the reviewer queue all scale with variety.
Budget rule: add ~20% to the document agent build if vendor / format variety is high.
3. Compliance and auditability
Regulated industries (healthcare, finance, legal, EU public sector) add cost — data residency requirements, audit trail granularity, approval gates, encrypted-at-rest, role-based access. None of these are technically hard; they're each an engineering tax.
Budget rule: add 15–30% for heavily regulated environments.
4. Channel count
Text-only agents are cheaper than agents that also do voice. Add a voice surface and you add Twilio integration, realtime model usage, voice-specific evals, recording compliance. Add a separate mobile surface and you add another platform's build effort.
Budget rule: each new channel adds €15,000–€30,000.
5. Production-grade rigor
The difference between "it works in the demo" and "it's been running for 6 months without a Sunday-evening outage." Production rigor includes: real evals, real observability, real on-call runbooks, real load testing, real failover paths.
We always build production-grade. Some shops sell "MVPs" that skip this. Their MVPs cost less; their MVPs also break.
Per-call infrastructure cost
The other half of the cost story is what each agent invocation costs in production. Honest numbers from our deployments:
| Agent type | Per-invocation cost |
|---|---|
| Conversational agent (single query) | €0.001–€0.01 |
| Document processing (per invoice) | €0.01–€0.05 |
| Voice agent (per minute) | €0.10–€0.15 |
| Voice agent (per call, 90 sec average) | €0.12–€0.40 |
| Research / synthesis (per dossier) | €0.50–€3.00 |
| Workflow orchestrator (per run) | €0.001–€0.10 |
Plus the operational overhead — observability tooling (€100–€500/month at typical volumes), hosting (€50–€500/month), and minor fixed costs.
The retainer question
After the build, you have three options:
- In-house: take the code and the docs, run it yourselves. Many clients prefer this. We hand over runbooks, eval suites, and CI/CD configured to your team.
- Light retainer: we handle evals, prompt tuning, model upgrades, occasional firefighting. €1,500–€2,500/month.
- Full retainer: we operate it. Includes on-call, ongoing improvements, monthly business review. €3,500–€5,000/month and up.
Roughly 60% of our clients take option 2 or 3 because operating production AI agents is its own job. The rest take it in-house.
What's NOT included
A few things we explicitly don't bundle into the build price:
- LLM API costs (Anthropic, OpenAI, Google). These pass through directly. We help you set spend caps and surface costs on the dashboard.
- Twilio / Telnyx voice charges. Per-minute pass-through.
- Vector store hosting (if you use a managed one like Pinecone). Usually trivial; sometimes €50–€500/month at scale.
- Custom model training. We rarely recommend it; if you need it, separate engagement.
- Hardware / on-prem infrastructure. We design for cloud; if you need on-prem, scope changes.
Common pricing traps
"Cheap" agencies that don't include evals
Some shops sell €15,000 "AI agent MVPs" that ship without an eval suite, observability, or proper guardrails. The agent works in the demo and quietly degrades in production. Three months later you pay someone else 2× the price to do it properly.
Fixed-price quotes before discovery
Anyone quoting a fixed price for a production AI agent from a one-hour intro call is either over-quoting (to cover the unknown) or under-quoting (and will discover scope mid-build). Insist on a paid discovery sprint as the first engagement.
"Per-agent" SaaS pricing for unique workflows
Some platforms charge per agent or per AI workflow on a SaaS basis. Fine for generic use cases. Expensive for unique ones. Math out the 24-month cost vs custom-build before committing to a multi-year SaaS contract.
Hidden vendor lock-in
The agent works great with Vendor X's tools. Migrating off costs €40,000. This shows up most often with low-code platforms (Power Automate at extreme scale, Microsoft AI Studio with proprietary connectors). Worth verifying portability up front.
The buyer checklist
Before you sign anything, confirm:
- Scope was defined in a paid discovery sprint, not a free pitch call.
- Eval suite is part of the build, not an add-on.
- Observability is part of the build, not an add-on.
- Code is yours, in your git repo, on day one.
- Provider abstraction exists — you can swap LLM providers without a rewrite.
- Approval gates exist on every irreversible action.
- Cost ceilings are configurable per agent.
- Handover is built into the engagement — runbooks, eval suites, CI/CD configured for your team.
If any of these are missing, ask why.
Where to go next
For the broader hiring-an-agency checklist, see 12 questions to ask when hiring an AI agency. For our internal playbook on how we ship efficiently, see The AI Development playbook.
If you have a project in mind and want a fast cost take, drop us a note. One paragraph is enough.
Frequently asked questions
Keep reading
Hiring an AI development agency: 12 questions to ask
Twelve questions that separate serious AI dev shops from demoware vendors. Asks about evals, observability, code ownership, provider lock-in, references, and what they'll refuse to do. If a vendor can't answer cleanly, walk.
AI agents vs automation: which one do you actually need?
Use plain automation when the rules are deterministic — same inputs, same outputs, no judgment required. Use AI agents when inputs are unstructured (PDFs, emails, voice) or each instance needs a decision. Most production systems mix both: automation moves the predictable steps, an agent handles the messy ones.
What is an AI agent? The full breakdown
An AI agent is a system that turns a goal into a sequence of tool calls. Where a chatbot answers questions, an agent completes jobs. It plans steps, picks tools, executes them, recovers from failures, and either finishes the task or hands off to a human. The defining ingredients are a goal, retrieval, tools, guardrails, evals, and observability.
AI Agents Development
Custom agents that read documents, hold conversations, take phone calls, and execute multi-step workflows — wired into the systems you already run.
Want this delivered in your stack?
If the article describes a workflow you'd like to ship, drop us a note. We reply within one business day.