Gradient Labs: what it is, how Otto works, and who it's for

Written by

Alicia Kirana Utomo

Reviewed by

Katelin Teen

Last edited June 25, 2026

Expert Verified

Gradient Labs hero banner, AI customer operations for financial services

TL;DR

Gradient Labs builds an AI customer operations agent called Otto, purpose-built for regulated financial services: banks, fintechs, lenders, and insurers. It is unusually deep for this category. Instead of deflecting tickets to a help center, it resolves whole workflows end to end, disputes, KYC, collections, lending, and customer service, and it reports an 80 to 90 percent peak resolution rate with CSAT that sometimes beats human teams.

The catch is in the positioning. Gradient Labs is for financial services and almost nothing else, and there is no public price; everything runs through a sales conversation. If you run a bank or a lender and you care about FCA Consumer Duty and chargeback filing, that focus is exactly what you want.

If you run ecommerce, SaaS, or general support, you are not its target customer, and you will get further with a horizontal agent that drops onto your existing helpdesk, shows you a price, and lets you test it before you commit. That is where something like eesel fits: it trains on your past tickets and help center, runs on Zendesk or Freshdesk in minutes, and bills a flat fee per interaction. More on both below.

I have spent the last few years putting AI agents on live support queues, so I went through Gradient Labs the way I would evaluate it for a real rollout. Here is what it actually is, how Otto works under the hood, and who should pay attention.

What Gradient Labs is

Most AI support tools are horizontal. They sit on any helpdesk, answer questions from a knowledge base, and aim to deflect the easy stuff. Gradient Labs went the other way. It is vertical and deep: AI-native customer operations for financial services, with a tagline that tells you the whole strategy, "we handle what others hand off."

The company was founded in 2023 in London by three ex-Monzo people: Dimitri Masin, who led data and ML there, alongside Neal Lathia and chief scientist Danai Antoniou. That origin matters. They spent years trying to automate bank support from the inside, and built Gradient Labs to do the thing they could not quite reach at Monzo. As Masin put it in an Unite.AI interview, "we weren't building co-pilots, we were building end-to-end automation of customer support."

Investors have backed that thesis. The company raised a $26M Series A, which started as $13M led by Redpoint in mid-2025 and then doubled with another $13M led by Octopus Ventures and CommerzVentures around June 2026. Gradient Labs says its agents now reach more than 32 million end users, with revenue up roughly 10x year over year.

Gradient Labs' procedure builder and live call view, as shown in its OpenAI case study

That screenshot is a good summary of the product's personality. On the left is a written procedure ("fraud impersonation callback") with numbered steps the agent must follow; on the right is a live call where the agent verifies identity and sends a verification code. This is not a chatbot answering FAQs. It is an agent executing a regulated process.

How Otto actually works

This is the part I find genuinely interesting, because it is where most "AI agent" products are thin and Gradient Labs is not.

Under the hood, Otto is built as a state machine: a central reasoning agent orchestrates a set of specialized skills, and a case can move across workflows without losing context. Some states are deterministic (do exactly this) and some are agentic (reason about what to do next). It is a hybrid, multi-model system that uses larger models for the hard reasoning steps and smaller, faster models for the deterministic ones, drawing on the latest from OpenAI, Anthropic, and Google rather than betting on a single provider.

How one Gradient Labs case flows from a customer message through the reasoning agent to an action

Two design choices stand out.

First, it is trained on your SOPs and your best agents, not just your help center. Gradient Labs makes the argument, correctly in my experience, that your knowledge base is not enough on its own; the real know-how lives in how your best people handle hard conversations. You author procedures in plain language, which is about as close to "support automation as writing a document" as anyone has gotten.

A natural-language procedure in Gradient Labs, with resources, channels, and step actions, as taken from Gradient Labs

Second, it reasons and takes actions, it does not just answer. When a customer gets in touch, Otto remembers past conversations and does the real work to resolve the issue: freezing a lost card, tracking down a missing payment, filing a chargeback with the card scheme. That is the difference between deflection and resolution, and it is the line Gradient Labs keeps drawing. If you want the background on why this distinction matters, we wrote about the gap between an AI agent vs a rule-based chatbot separately.

There is a real trade-off Masin is refreshingly honest about: the agent is slower because it thinks more. "Our median response time might be 15-20 seconds," he told Unite.AI, "but for financial institutions, that's a fair trade." For a bank, a correct answer in 20 seconds beats a wrong one in two. For an impatient ecommerce shopper, maybe not.

The specialist agents

Rather than one generalist bot, Gradient Labs ships a suite of agents, each tuned to a finance workflow and reporting its own live stats.

Gradient Labs' specialist agents grid showing resolution and goal-completion rates, as taken from Gradient Labs

The lineup covers the moments that actually cost a finance team money and risk:

Customer service resolves issues across chat, email, and voice.
Disputes gathers and verifies evidence, decides an outcome, escalates to a human for sign-off, and files the chargeback.
KYC and KYB check documents against policy, request what is missing, and verify a customer or business before onboarding.
Collections contacts borrowers in arrears at the right time, personalizes from account history, and secures a promise to pay, including over AI voice.
The newer lending agent automates the whole borrower journey, from application through collections.

Each one is doing real back-office work, not just first-line chat. That is rare, and it is the strongest argument for the product.

Guardrails and compliance

If you are automating support in finance, this section is the one that decides the purchase, and it is where Gradient Labs has clearly spent its effort.

Otto runs 20+ financial-services guardrails on every single turn of a conversation. They check for things like financial-advice detection, customer vulnerability signals, complaints, prompt injection, and attempts to bypass identity verification, and they keep the conversation inside the defined procedure.

Gradient Labs' guardrail panel showing customer and agent guardrails passing on a conversation, as taken from Gradient Labs

On the compliance side it is pre-configured for FCA Consumer Duty, CONC, Reg E, Reg Z, PSD2, GDPR, and the EU AI Act, with automated QA on conversations. It is SOC 2 Type 2 certified with SSO, audit logs, and role-based permissions.

Chief scientist Danai Antoniou framed the stakes well in the OpenAI case study: "you have to architect from the ground up for no hallucinations. That needs to be the guiding principle as you're building." In finance, she added, the gap between a resolved call and a missed signal "is the difference between resolving a call and creating a compliance incident." The fear of an AI confidently giving a wrong answer is the single biggest objection I hear from regulated teams, and controlling hallucinations in support really is the whole game here.

Ask a Human keeps a person in the loop

Gradient Labs' answer to "what about the cases the AI shouldn't decide alone" is a feature called Ask a Human. The agent keeps chatting with the customer while a human reviewer works the sensitive part behind the scenes, so there is no queue, no second thread, and no asking the customer to repeat themselves.

A nice example: for a disputed transaction over a set threshold (say £1,000), policy requires human review. The agent verifies the account, confirms the charge, asks why it is disputed, then routes a summary to a reviewer over Slack, tells the customer a colleague is looking into it, and keeps the conversation going until the human approves. This is a smarter design than the usual "AI gives up and dumps you in a queue" escalation, and the results back it up.

Zego testimonial: Ask a Human cut human handling time from 12 minutes to just over 3 while lifting CSAT, as taken from Gradient Labs

UK insurer Zego, with around 150,000 active customers, reports that Ask a Human cut human handling time from 12 minutes to just over 3, a drop of more than 70 percent, while CSAT went up.

Voice and channels

One thing that genuinely sets Gradient Labs apart is running AI voice in production at scale, not just text. Otto handles chat, email, and voice as one agent across every channel.

Gradient Labs handles chat, voice, and email through one agent, as taken from Gradient Labs

The proof point here is SteadyPay, which runs roughly 33,000 AI voice calls a month for outbound collections, with about a 60 percent success rate among engaged customers. Voice at that volume in a regulated workflow is hard, and most "AI agent" vendors do not attempt it.

Pricing: outcomes-based, but you have to call

Here is the honest version: Gradient Labs publishes no price at all. The pricing page is a lead form. What it does publish is the model, and the model is interesting.

From the company's own FAQ: you have "a simple outcomes based pricing model without platform fees, where you pay only for successful query resolutions." No per-seat charge, no per-conversation charge, no base subscription. You pay when Otto actually resolves something.

Three ways AI support gets priced: per seat, per conversation, or per resolution

Masin has been pointed about why, telling The Register: "if we don't resolve the issue and you still need to get your human team involved, then you don't need to pay us." He has also taken a direct shot at per-conversation pricing, arguing that charging a flat fee per chat "doesn't create any incentive" for the vendor to make the agent better. I think he is right about the incentive, and outcomes-based pricing is genuinely buyer-friendly when the unit of value is clear.

Here is what is and is not public:

Pricing question	Gradient Labs
Public rate card	None; quote only
Billing unit	Per successful resolution
Platform / base fee	None stated
Per-seat fee	None
Free trial	None advertised
Self-serve signup	No, sales conversation required
Day-one start	20 to 50 percent of easy queries on Zendesk or Freshdesk with no integration
Deeper rollout	Requires defining procedures and integrating data, run as a joint project
API access	Yes, via sales

The thing to weigh is the total cost of ownership versus the speed of getting started. Outcomes-based pricing is attractive, but "talk to sales, scope a project, integrate data" is a months-long motion, not a same-afternoon one. If you want to understand the broader AI agent vs human agent cost math before any sales call, that is worth reading first.

Is Gradient Labs right for you?

The fit question here is unusually binary, so I built a quick way to check where you land. Pick the row that sounds like you.

What real customers say

I want to be straight about the evidence here. There is no real third-party review surface for Gradient Labs yet: no G2 or Capterra scores, no Reddit pile-ons. The Capterra listing exists but carries no reviews. So the numbers below are vendor-stated or from partner case studies, and you should treat them as such, strong, but not independently audited.

That said, the first-party proof is specific, which I respect more than round marketing numbers:

Plum customer result: 98.6% QA score and 80% CSAT, as taken from Gradient Labs

Pockit reports a 70% resolution rate and 80% CSAT, with a stated goal of 100% automation.
Plum hit a 98.6% QA score and set up in about 30 minutes with no engineering effort.
Morse (Sling Money) saw 50% day-one resolution during a growth spike.

"Otto, Gradient Labs' AI agent, has been a game-changer for us. With a 98% CSAT, it delivers superb customer experiences. We especially value how closely Otto matches our tone of voice."
MC Glover, VP of Strategy & Operations, via Gradient Labs

The end-user reactions Gradient Labs quotes are the real tell. A Plum customer wrote, "are you virtual? I thought you were a human," and a Morse customer said, "I did actually think you were a person for the whole conversation." When customers cannot tell, that is the bar.

Where Gradient Labs fits, and where it doesn't

Let me be clear, because I think Gradient Labs is a genuinely good product that most teams should still not buy.

If you run customer operations at a bank, a lender, a neobank, or an insurer, this is one of the most serious tools in the market. The depth on disputes, KYC, collections, and lending, the per-turn guardrails, the SOC 2 and FCA coverage, and real voice at scale add up to something a general-purpose agent cannot match. The outcomes-based pricing is buyer-friendly, and the team has clearly lived this problem, which makes the usual build vs buy question easy to answer in their favor.

If you are anyone else, the same focus works against you. The compliance machinery is overhead you will not use, there is no public price, there is no free trial, and getting to the good resolution rates means a scoped project with the vendor. That is a fine motion for an enterprise finance buyer and a slow one for a 12-person SaaS or ecommerce team that just wants tier-1 deflection handled this quarter.

Two ways to buy AI for support: vertical and enterprise, or horizontal and self-serve

The deeper point is that there are two ways to buy AI for support right now. One is vertical and enterprise: pick a tool built for your exact industry, talk to sales, and run a project. The other is horizontal and self-serve: drop an agent onto the helpdesk you already use, train it on your own tickets, and turn it on. Gradient Labs is the best-known example of the first. The second is where most teams actually live.

Try eesel for everyone outside finance

If you read the fit checker above and landed on "not finance," this is the part for you. eesel is the horizontal version of the idea Gradient Labs proved in banking: an AI agent that resolves real tickets, only built to drop onto the helpdesk you already run.

The differences are the ones that matter when you are not a bank. eesel works like a new hire that plugs into Zendesk or Freshdesk in a few minutes, trains on your past tickets and help center so it sounds like your team from day one, and lets you simulate it against thousands of historical tickets before a single reply goes to a customer. Pricing is public and usage-based, a flat fee per AI interaction with no per-resolution surcharge and no per-seat fee, so you can do the cost math yourself before you ever book a call. And because the biggest worry is always an AI replying when it should not, you control exactly which tickets it handles and route the rest to a person, which is the trust-and-control lever most teams ask for first.

The proof shows up fast precisely because there is no scoped project to wait through. As Kim Simpson at Gridwise put it, "in the first month, eesel is resolving 73% of our tier 1 requests," after seeing results during a 7-day trial. You can try eesel free and run that same simulation on your own tickets today.

Frequently asked questions

What is Gradient Labs?

Gradient Labs is an AI customer operations company whose agent, Otto, resolves support and back-office work for regulated financial services: banks, fintechs, lenders, and insurers. It covers customer service, disputes, KYC, collections, and lending end to end rather than just deflecting tickets. If you are not in financial services, a horizontal AI for customer service tool is usually a closer fit.

How much does Gradient Labs cost?

Gradient Labs does not publish a public price. It uses an outcomes-based model where you pay only for successful resolutions, with no platform fees and no per-seat charges, and the actual rate is set in a sales conversation. If you want to see a rate and start without talking to sales, a usage-priced option like eesel bills a flat fee per AI interaction with no per-resolution surcharge.

Is Gradient Labs only for financial services?

Yes, in practice. Its agents, guardrails, and compliance work (FCA, Reg E, PSD2, the EU AI Act) are all built around regulated finance. For ecommerce, SaaS, or general support, you will get more out of a tool aimed at those teams, such as one of these AI support agents.

What resolution rate does Gradient Labs achieve?

Gradient Labs reports 40 to 60 percent of queries resolved out of the box and 80 to 90 percent after a few months of tuning, with most deployments starting above 50 percent on day one (these are vendor-stated figures). Whatever tool you pick, it helps to understand how resolution rate is actually measured before you compare numbers.

How does Gradient Labs handle accuracy and compliance?

It runs 20+ financial-services guardrails on every turn (financial-advice detection, vulnerability signals, complaint detection, verification-bypass attempts) and is SOC 2 Type 2 certified. Controlling hallucinations and routing low-confidence cases to a human is the core safety problem for any AI support agent in a regulated space.

What are the best Gradient Labs alternatives?

If you are in regulated finance and want full back-office automation, Gradient Labs is hard to beat on focus. If you want an agent that works on your existing helpdesk, shows transparent pricing, and lets you test it yourself first, look at horizontal options in this list of AI support agents, including eesel.

Hire your AI teammate

Set up in minutes. No credit card required.

Try for free Book a demo

Share this article

Article by

Alicia Kirana Utomo

Kira is a writer at eesel AI with a Computer Science background and over a year of hands-on experience evaluating AI-powered customer service tools. She focuses on breaking down how helpdesk platforms and AI agents actually work so that support teams can make better buying decisions.