8:["$","div",null,{"className":"page bg-white","children":[["$","article",null,{"className":"mb-10 p-6 tblsm:p-10 dsk:px-[72px] dsk:pt-[120px] pb-0 max-w-[1644px] mx-auto [&_section]:mb-[50px] [&_[data-quote]]:mt-0 [&_.container]:p-0 tblsm:[&_.container]:p-0 tblsm:[&_.columns]:!block tblsm:pt-8 ","children":[["$","$L20",null,{"data":{"id":"cG9zdDo0OTQ4Mg==","title":"A guide to Fin AI benchmarking","excerpt":"

Dive into the essential guide on Fin AI benchmarking. We cover the top frameworks, key evaluation metrics, and how to move from theoretical scores to real-world performance for your support and internal teams.

\n","slug":"fin-ai-benchmarking-en","date":"2025-10-14T15:20:31","dateGmt":"2025-10-14T15:20:31","modified":"2025-10-27T04:11:14","language":{"slug":"en"},"featuredImage":{"node":{"altText":"","mediaDetails":{"width":1785,"height":949},"sourceUrl":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/Banner-Fin-AI-agent-explained_-Is-Intercoms-AI-the-best-for-customer-support-in-2025_.png"}},"postMeta":{"banner":null,"minsRead":null,"hideHeroImage":false,"reviewer":{"nodes":[{"name":"Stanley Nicholas","firstName":"Stanley","lastName":"Nicholas","authors":{"avatar":{"node":{"altText":"","mediaItemUrl":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/foto-stanley-scaled-e1755654205288.jpeg","mediaDetails":{"width":1083,"height":1083}}}}}]}},"author":{"node":{"firstName":"Stevia","lastName":"Putri","description":"Stevia Putri is a marketing generalist at eesel AI, where she helps turn powerful AI tools into stories that resonate. She’s driven by curiosity, clarity, and the human side of technology.","email":null,"seo":{"social":{"facebook":"","instagram":"instagram.com/steviaanlena","linkedIn":"https://www.linkedin.com/in/steviaputri/","twitter":"https://x.com/steviaanlena"}},"authors":{"avatar":{"node":{"altText":"","mediaItemUrl":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/IMG-20250812-WA0014-e1755016187283.jpg","mediaDetails":{"width":544,"height":1013}}},"role":"Writer","roleFrench":"Writer","roleGerman":"Writer","roleSpanish":"Writer","rolePortuguese":"Writer","roleJapanese":"Writer"}}},"categories":{"nodes":[{"slug":"guides-en","name":"Guides"}]},"tags":{"edges":[]},"seo":{"canonical":"https://www.eesel.ai//fin-ai-benchmarking-en","title":"A guide to Fin AI benchmarking - eesel AI","metaDesc":"Understand the complex world of Fin AI benchmarking. Explore top frameworks like FinBen and Kensho, and learn how to evaluate AI for your financial services team.","focuskw":"","opengraphTitle":"A guide to Fin AI benchmarking","opengraphDescription":"Understand the complex world of Fin AI benchmarking. Explore top frameworks like FinBen and Kensho, and learn how to evaluate AI for your financial services team.","opengraphImage":{"altText":"","sourceUrl":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/Banner-Fin-AI-agent-explained_-Is-Intercoms-AI-the-best-for-customer-support-in-2025_.png","srcSet":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/Banner-Fin-AI-agent-explained_-Is-Intercoms-AI-the-best-for-customer-support-in-2025_-300x159.png 300w, https://website-cms.eesel.ai/wp-content/uploads/2025/08/Banner-Fin-AI-agent-explained_-Is-Intercoms-AI-the-best-for-customer-support-in-2025_-1024x544.png 1024w, https://website-cms.eesel.ai/wp-content/uploads/2025/08/Banner-Fin-AI-agent-explained_-Is-Intercoms-AI-the-best-for-customer-support-in-2025_-768x408.png 768w, https://website-cms.eesel.ai/wp-content/uploads/2025/08/Banner-Fin-AI-agent-explained_-Is-Intercoms-AI-the-best-for-customer-support-in-2025_-1536x817.png 1536w, https://website-cms.eesel.ai/wp-content/uploads/2025/08/Banner-Fin-AI-agent-explained_-Is-Intercoms-AI-the-best-for-customer-support-in-2025_.png 1785w"},"opengraphUrl":"https://www.eesel.ai//fin-ai-benchmarking-en","opengraphSiteName":"eesel AI","opengraphModifiedTime":"2025-10-27T04:11:14+00:00","breadcrumbs":[{"url":"https://website-cms.eesel.ai/","text":"Home"},{"url":"https://www.eesel.ai/fin-ai-benchmarking/","text":"A guide to Fin AI benchmarking"}],"readingTime":1},"editorBlocks":[{"__typename":"AcfTextblock","parentClientId":null,"clientId":"692fe2dadd121","innerBlocks":[],"textBlock":{"marginBottomReduced":false,"heading":null,"content":"$21","contentType":["markdownV2"]}},{"__typename":"AcfFaqs","parentClientId":null,"clientId":"692fe2dadd12c","innerBlocks":[],"faqs":{"type":["default"],"heading":"Frequently asked questions","answerType":["markdown"],"faqs":[{"question":"What exactly is Fin AI Benchmarking and why is it crucial in the financial sector?","answer":"

Fin AI Benchmarking is the systematic testing of AI models on finance-specific tasks to measure their performance. It's crucial because the high stakes in finance mean even minor AI errors can lead to compliance issues, security threats, or significant financial losses.

\n"},{"question":"Can you explain the difference between Foundational Model Fin AI Benchmarking and Applied Agent Fin AI Benchmarking?","answer":"

Foundational Model Fin AI Benchmarking tests the raw intelligence of an LLM using standard financial datasets in an academic setting. Applied Agent Fin AI Benchmarking, however, assesses a fully integrated AI application's performance on real-world business metrics like [resolution rates](https://www.eesel.ai/blog/deflection-rate-what-is-it-and-how-to-improve-it) and accuracy with your company's unique data.

\n"},{"question":"What key skills or metrics are typically evaluated during Fin AI Benchmarking?","answer":"

Fin AI Benchmarking commonly measures information extraction and structuring, assessing an AI's ability to accurately pull specific data from text. It also evaluates quantitative and numerical reasoning, and the AI's domain-specific knowledge and question-answering capabilities based on specialized financial documents.

\n"},{"question":"What are the main limitations of current public Fin AI Benchmarking frameworks for a typical business or support team?","answer":"

Many current Fin AI Benchmarking frameworks are either too academic, focused on complex market analysis, or designed for niche research tasks. They often don't reflect an AI's real-world performance on a company's specific documents, internal slang, or high-volume customer service workflows.

\n"},{"question":"How can a company use Fin AI Benchmarking to assess an AI's performance on its own unique internal data and workflows?","answer":"

Companies should move beyond theoretical scores by conducting practical Fin AI Benchmarking with their own data. Platforms like eesel AI allow you to simulate AI performance on past tickets in a sandboxed environment, providing data-backed forecasts of resolution rates and cost savings specific to your business.

\n"},{"question":"What are the key real-world metrics that businesses should prioritize when conducting Fin AI Benchmarking for customer support?","answer":"

For customer support, crucial real-world metrics for Fin AI Benchmarking include resolution rate, customer satisfaction (CSAT), first-response time, and cost savings. These directly impact your bottom line and reflect how effectively the AI handles *your* specific customer interactions and problems.

\n"},{"question":"How does eesel AI's approach to Fin AI Benchmarking differ in terms of cost predictability compared to other platforms?","answer":"

Unlike some platforms that use unpredictable [per-resolution pricing](https://www.eesel.ai/blog/understanding-zendesk-ai-pricing-a-complete-pay-per-resolution-guide), eesel AI offers a flat monthly fee for its Fin AI Benchmarking and agent services. This predictable cost model includes a generous allowance of AI interactions, making budgeting straightforward and avoiding surprise charges based on high success rates.

\n"}],"questionText":null,"supportLink":null}}]},"shareUrl":"https://www.eesel.ai/en/blog/fin-ai-benchmarking-en"}],["$","span",null,{"className":"my-8 tblsm:my-[60px] dsk:my-18 dskxl:my-20 block w-full h-px bg-border-light dsklg:my-[72px] "}],["$","$L22",null,{"image":"$23","className":"w-full max-h-[780px] overflow-hidden h-auto object-cover mb-10 rounded-xl tblsm:mb-10 dsk:mb-[60px] dsklg:mb-[72px] dsklg:max-w-[1150px] dsklg:mx-auto","priority":true,"sizes":"(max-width: 500px) 300px,(max-width: 1600px) 100vw, 1600px","quality":80}],["$","div",null,{"className":"","children":[["$","div",null,{"className":"grid gap-[70px] grid-cols-1 dsklg:grid-cols-[1fr_600px_1fr] dskxl:grid-cols-[1fr_800px_1fr]","children":[["$","div",null,{"className":"relative hidden dsk:flex flex-col gap-6 ","children":["$","div",null,{"className":"sticky top-[92px]","children":["$","$L25",null,{}]}]}],["$","div",null,{"className":"","children":["$undefined",["$","div",null,{"className":"relative [&_.faqWrapper]:!mt-5","data-content":true,"children":[["$","div",null,{"className":"relative [&_.faqWrapper]:!mt-5","dangerouslySetInnerHTML":{"__html":"\n\n"}}],["$","div",null,{"children":[["$","$11",null,{"fallback":null,"children":["$","section",null,{"className":"relative !mb-0 data-[margin-bottom-reduced=true]:mb-[30px]","data-margin-bottom-reduced":false,"children":["$","div",null,{"className":"container mx-auto","children":[null,false,["$","div",null,{"className":"$26","children":[["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Let’s be honest, artificial intelligence is popping up everywhere in the financial world. It promises to do everything from analyzing markets at lightning speed to running ","position":{"start":{"line":1,"column":1,"offset":0},"end":{"line":1,"column":173,"offset":172}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/blog/how-to-enable-24-7-customer-support-with-ai-in-2025"},"children":[{"type":"text","value":"customer support that’s always on","position":{"start":{"line":1,"column":174,"offset":173},"end":{"line":1,"column":207,"offset":206}}}],"position":{"start":{"line":1,"column":173,"offset":172},"end":{"line":1,"column":287,"offset":286}}},{"type":"text","value":". But in finance, the stakes are just plain higher. One wrong answer isn't just a minor hiccup; it can turn into a compliance headache, a security threat, or a mistake that costs real money.","position":{"start":{"line":1,"column":287,"offset":286},"end":{"line":1,"column":477,"offset":476}}}],"position":{"start":{"line":1,"column":1,"offset":0},"end":{"line":1,"column":479,"offset":478}}},"children":["Let’s be honest, artificial intelligence is popping up everywhere in the financial world. It promises to do everything from analyzing markets at lightning speed to running ",["$","a",null,{"href":"https://www.eesel.ai/blog/how-to-enable-24-7-customer-support-with-ai-in-2025","node":"$27","children":"customer support that’s always on"}],". But in finance, the stakes are just plain higher. One wrong answer isn't just a minor hiccup; it can turn into a compliance headache, a security threat, or a mistake that costs real money."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"This is where AI testing, or benchmarking, is supposed to help. The big problem? Most AI benchmarks test general knowledge. They’re like a high school pop quiz, checking if an AI knows historical facts or can write a poem. That’s neat, but it tells you absolutely nothing about whether it can handle the dense jargon, numerical reasoning, and strict rules that define the ","position":{"start":{"line":3,"column":1,"offset":480},"end":{"line":3,"column":373,"offset":852}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/blog/ai-in-banking"},"children":[{"type":"text","value":"financial industry","position":{"start":{"line":3,"column":374,"offset":853},"end":{"line":3,"column":392,"offset":871}}}],"position":{"start":{"line":3,"column":373,"offset":852},"end":{"line":3,"column":434,"offset":913}}},{"type":"text","value":".","position":{"start":{"line":3,"column":434,"offset":913},"end":{"line":3,"column":435,"offset":914}}}],"position":{"start":{"line":3,"column":1,"offset":480},"end":{"line":3,"column":437,"offset":916}}},"children":["This is where AI testing, or benchmarking, is supposed to help. The big problem? Most AI benchmarks test general knowledge. They’re like a high school pop quiz, checking if an AI knows historical facts or can write a poem. That’s neat, but it tells you absolutely nothing about whether it can handle the dense jargon, numerical reasoning, and strict rules that define the ",["$","a",null,{"href":"https://www.eesel.ai/blog/ai-in-banking","node":"$31","children":"financial industry"}],"."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"This guide is here to clear up the confusion around Fin AI Benchmarking. We'll break down what it really is, walk through the major frameworks everyone’s talking about, and show you how to look past the shiny theoretical scores to find an AI that actually gets the job done for your business.","position":{"start":{"line":5,"column":1,"offset":918},"end":{"line":5,"column":293,"offset":1210}}}],"position":{"start":{"line":5,"column":1,"offset":918},"end":{"line":5,"column":295,"offset":1212}}},"children":"This guide is here to clear up the confusion around Fin AI Benchmarking. We'll break down what it really is, walk through the major frameworks everyone’s talking about, and show you how to look past the shiny theoretical scores to find an AI that actually gets the job done for your business."}],"\n",["$","h2",null,{"className":"text-[28px] tracking-[0px] font-semibold text-[#121212] tblsm:mb-8 leading-[120%] max-w-[600px] mt-14 mb-6 tblsm:text-4xl tblsm:leading-[110%] tblsm:max-w-none tblsm:mt-20","node":{"type":"element","tagName":"h2","properties":{},"children":[{"type":"text","value":"What is Fin AI Benchmarking?","position":{"start":{"line":7,"column":4,"offset":1217},"end":{"line":7,"column":32,"offset":1245}}}],"position":{"start":{"line":7,"column":1,"offset":1214},"end":{"line":7,"column":34,"offset":1247}}},"children":"What is Fin AI Benchmarking?"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Fin AI Benchmarking is just a formal way of saying you’re systematically testing AI models on finance-specific jobs to see how they perform. It’s about creating a standardized report card to compare how different AI systems measure up.","position":{"start":{"line":9,"column":1,"offset":1249},"end":{"line":9,"column":236,"offset":1484}}}],"position":{"start":{"line":9,"column":1,"offset":1249},"end":{"line":9,"column":238,"offset":1486}}},"children":"Fin AI Benchmarking is just a formal way of saying you’re systematically testing AI models on finance-specific jobs to see how they perform. It’s about creating a standardized report card to compare how different AI systems measure up."}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"But there’s a key difference you need to get your head around, because it completely changes how you should be thinking about choosing an AI tool:","position":{"start":{"line":11,"column":1,"offset":1488},"end":{"line":11,"column":147,"offset":1634}}}],"position":{"start":{"line":11,"column":1,"offset":1488},"end":{"line":11,"column":149,"offset":1636}}},"children":"But there’s a key difference you need to get your head around, because it completely changes how you should be thinking about choosing an AI tool:"}],"\n",["$","ul",null,{"className":"flex flex-col m-0 ml-5 list-disc gap-2 ps-0 mb-6 [&>:last-child]:mb-0","node":{"type":"element","tagName":"ul","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"li","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"Foundational Model Benchmarking:","position":{"start":{"line":13,"column":5,"offset":1642},"end":{"line":13,"column":37,"offset":1674}}}],"position":{"start":{"line":13,"column":3,"offset":1640},"end":{"line":13,"column":39,"offset":1676}}},{"type":"text","value":" Think of this as an academic exam for the AI model itself. Researchers use standard financial datasets to test the raw intelligence of ","position":{"start":{"line":13,"column":39,"offset":1676},"end":{"line":13,"column":175,"offset":1812}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/blog/small-language-models"},"children":[{"type":"text","value":"large language models (LLMs)","position":{"start":{"line":13,"column":176,"offset":1813},"end":{"line":13,"column":204,"offset":1841}}}],"position":{"start":{"line":13,"column":175,"offset":1812},"end":{"line":13,"column":254,"offset":1891}}},{"type":"text","value":" like GPT-4 or Llama 3. The scores tell you which model is \"smarter\" in a sterile lab environment.","position":{"start":{"line":13,"column":254,"offset":1891},"end":{"line":13,"column":352,"offset":1989}}}],"position":{"start":{"line":13,"column":3,"offset":1640},"end":{"line":13,"column":354,"offset":1991}}},{"type":"text","value":"\n"}],"position":{"start":{"line":13,"column":1,"offset":1638},"end":{"line":13,"column":354,"offset":1991}}},{"type":"text","value":"\n"},{"type":"element","tagName":"li","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"Applied Agent Benchmarking:","position":{"start":{"line":15,"column":5,"offset":1997},"end":{"line":15,"column":32,"offset":2024}}}],"position":{"start":{"line":15,"column":3,"offset":1995},"end":{"line":15,"column":34,"offset":2026}}},{"type":"text","value":" This is the real-world driving test. It checks how a fully integrated AI application, like an ","position":{"start":{"line":15,"column":34,"offset":2026},"end":{"line":15,"column":129,"offset":2121}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/product/ai-agent"},"children":[{"type":"text","value":"AI agent","position":{"start":{"line":15,"column":130,"offset":2122},"end":{"line":15,"column":138,"offset":2130}}}],"position":{"start":{"line":15,"column":129,"offset":2121},"end":{"line":15,"column":178,"offset":2170}}},{"type":"text","value":" sitting inside your helpdesk, performs on the business metrics you actually care about. We’re talking resolution rates, accuracy on ","position":{"start":{"line":15,"column":178,"offset":2170},"end":{"line":15,"column":311,"offset":2303}}},{"type":"element","tagName":"em","properties":{},"children":[{"type":"text","value":"your","position":{"start":{"line":15,"column":312,"offset":2304},"end":{"line":15,"column":316,"offset":2308}}}],"position":{"start":{"line":15,"column":311,"offset":2303},"end":{"line":15,"column":317,"offset":2309}}},{"type":"text","value":" company’s documents, and whether customers are happy.","position":{"start":{"line":15,"column":317,"offset":2309},"end":{"line":15,"column":371,"offset":2363}}}],"position":{"start":{"line":15,"column":3,"offset":1995},"end":{"line":15,"column":373,"offset":2365}}},{"type":"text","value":"\n"}],"position":{"start":{"line":15,"column":1,"offset":1993},"end":{"line":15,"column":373,"offset":2365}}},{"type":"text","value":"\n"}],"position":{"start":{"line":13,"column":1,"offset":1638},"end":{"line":15,"column":373,"offset":2365}}},"children":["\n",["$","li","li-0",{"children":["\n",["$","p",null,{"className":"","node":"$3b","children":[["$","strong",null,{"className":"font-semibold","node":"$3e","children":"Foundational Model Benchmarking:"}]," Think of this as an academic exam for the AI model itself. Researchers use standard financial datasets to test the raw intelligence of ",["$","a",null,{"href":"https://www.eesel.ai/blog/small-language-models","node":"$4c","children":"large language models (LLMs)"}]," like GPT-4 or Llama 3. The scores tell you which model is \"smarter\" in a sterile lab environment."]}],"\n"]}],"\n",["$","li","li-1",{"children":["\n",["$","p",null,{"className":"","node":"$5d","children":[["$","strong",null,{"className":"font-semibold","node":"$60","children":"Applied Agent Benchmarking:"}]," This is the real-world driving test. It checks how a fully integrated AI application, like an ",["$","a",null,{"href":"https://www.eesel.ai/product/ai-agent","node":"$6e","children":"AI agent"}]," sitting inside your helpdesk, performs on the business metrics you actually care about. We’re talking resolution rates, accuracy on ",["$","em","em-0",{"children":"your"}]," company’s documents, and whether customers are happy."]}],"\n"]}],"\n"]}],"\n",["$","pre",null,{"className":"flex flex-col gap-3 text-base text-[#808080] font-default mb-5 text-wrap","node":{"type":"element","tagName":"pre","properties":{},"children":[{"type":"element","tagName":"img","properties":{"loading":"lazy","decoding":"async","className":["alignnone","wp-image-56031","size-medium"],"src":"https://website-cms.eesel.ai/wp-content/uploads/2025/10/FinAI-Fin-Agent-Performance-300x169.png","alt":"Screenshot of Fin AI agent performance dashboard","width":300,"height":169,"srcSet":"https://website-cms.eesel.ai/wp-content/uploads/2025/10/FinAI-Fin-Agent-Performance-300x169.png 300w, https://website-cms.eesel.ai/wp-content/uploads/2025/10/FinAI-Fin-Agent-Performance-1024x576.png 1024w, https://website-cms.eesel.ai/wp-content/uploads/2025/10/FinAI-Fin-Agent-Performance-768x432.png 768w, https://website-cms.eesel.ai/wp-content/uploads/2025/10/FinAI-Fin-Agent-Performance-1536x864.png 1536w, https://website-cms.eesel.ai/wp-content/uploads/2025/10/FinAI-Fin-Agent-Performance-2048x1152.png 2048w","sizes":"(max-width: 300px) 100vw, 300px"},"children":[],"position":{"start":{"line":17,"column":6,"offset":2372},"end":{"line":17,"column":837,"offset":3203}}},{"type":"text","value":"Fin AI's agent performance dashboard.","position":{"start":{"line":17,"column":837,"offset":3203},"end":{"line":17,"column":874,"offset":3240}}}],"position":{"start":{"line":17,"column":1,"offset":2367},"end":{"line":17,"column":880,"offset":3246}}},"children":[["$","span",null,{"style":{"display":"block","position":"relative","width":"100%","aspectRatio":"300 / 169"},"children":["$","$L22",null,{"image":{"src":"https://website-cms.eesel.ai/wp-content/uploads/2025/10/FinAI-Fin-Agent-Performance.png","alt":"Screenshot of Fin AI agent performance dashboard","mediaDetails":{"width":300,"height":169}},"fill":true,"style":{"objectFit":"contain"},"className":"w-full h-auto border-2 border-[#e0e0e0] rounded-md overflow-hidden","sizes":"(max-width: 768px) 100vw, 700px"}]}],"Fin AI's agent performance dashboard."]}]," \n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"So, why does this matter? A model that aces a theoretical finance exam won't have a clue how to handle a customer asking about your company’s unique refund policy. Those foundational scores are a decent starting point, but the only test that truly counts is how an AI performs in your world, using your knowledge, and plugged into your ","position":{"start":{"line":19,"column":1,"offset":3250},"end":{"line":19,"column":337,"offset":3586}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/blog/how-to-automate-your-customer-support-workflow-using-ai"},"children":[{"type":"text","value":"workflows","position":{"start":{"line":19,"column":338,"offset":3587},"end":{"line":19,"column":347,"offset":3596}}}],"position":{"start":{"line":19,"column":337,"offset":3586},"end":{"line":19,"column":431,"offset":3680}}},{"type":"text","value":".","position":{"start":{"line":19,"column":431,"offset":3680},"end":{"line":19,"column":432,"offset":3681}}}],"position":{"start":{"line":19,"column":1,"offset":3250},"end":{"line":19,"column":434,"offset":3683}}},"children":["So, why does this matter? A model that aces a theoretical finance exam won't have a clue how to handle a customer asking about your company’s unique refund policy. Those foundational scores are a decent starting point, but the only test that truly counts is how an AI performs in your world, using your knowledge, and plugged into your ",["$","a",null,{"href":"https://www.eesel.ai/blog/how-to-automate-your-customer-support-workflow-using-ai","node":"$8d","children":"workflows"}],"."]}],"\n",["$","h2",null,{"className":"text-[28px] tracking-[0px] font-semibold text-[#121212] tblsm:mb-8 leading-[120%] max-w-[600px] mt-14 mb-6 tblsm:text-4xl tblsm:leading-[110%] tblsm:max-w-none tblsm:mt-20","node":{"type":"element","tagName":"h2","properties":{},"children":[{"type":"text","value":"The landscape of Fin AI Benchmarking frameworks","position":{"start":{"line":21,"column":4,"offset":3688},"end":{"line":21,"column":51,"offset":3735}}}],"position":{"start":{"line":21,"column":1,"offset":3685},"end":{"line":21,"column":53,"offset":3737}}},"children":"The landscape of Fin AI Benchmarking frameworks"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"A few big projects are trying to standardize how the industry measures AI performance. They’re a mix of open-source academic efforts and pricey enterprise solutions, and each has a different goal. Knowing what they are helps you see where things are headed, but it also highlights their limitations for your day-to-day business needs.","position":{"start":{"line":23,"column":1,"offset":3739},"end":{"line":23,"column":335,"offset":4073}}}],"position":{"start":{"line":23,"column":1,"offset":3739},"end":{"line":23,"column":337,"offset":4075}}},"children":"A few big projects are trying to standardize how the industry measures AI performance. They’re a mix of open-source academic efforts and pricey enterprise solutions, and each has a different goal. Knowing what they are helps you see where things are headed, but it also highlights their limitations for your day-to-day business needs."}],"\n",["$","h3",null,{"className":"tracking-[0px] font-semibold text-2xl leading-[120%] pt-9 pb-6 tblsm:text-[28px] tblsm:pt-14","node":{"type":"element","tagName":"h3","properties":{},"children":[{"type":"text","value":"FinBen: The open-source academic benchmark","position":{"start":{"line":25,"column":5,"offset":4081},"end":{"line":25,"column":47,"offset":4123}}}],"position":{"start":{"line":25,"column":1,"offset":4077},"end":{"line":25,"column":49,"offset":4125}}},"children":"FinBen: The open-source academic benchmark"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"a","properties":{"href":"https://www.thefin.ai/dataset-benchmark/finben"},"children":[{"type":"text","value":"FinBen","position":{"start":{"line":27,"column":2,"offset":4128},"end":{"line":27,"column":8,"offset":4134}}}],"position":{"start":{"line":27,"column":1,"offset":4127},"end":{"line":27,"column":57,"offset":4183}}},{"type":"text","value":" is a massive benchmark put together by ","position":{"start":{"line":27,"column":57,"offset":4183},"end":{"line":27,"column":97,"offset":4223}}},{"type":"element","tagName":"a","properties":{"href":"https://arxiv.org/html/2505.20650v1"},"children":[{"type":"text","value":"a group of researchers","position":{"start":{"line":27,"column":98,"offset":4224},"end":{"line":27,"column":120,"offset":4246}}}],"position":{"start":{"line":27,"column":97,"offset":4223},"end":{"line":27,"column":158,"offset":4284}}},{"type":"text","value":" known as The Fin AI. It’s built to test LLMs on dozens of financial tasks, from analyzing the sentiment of news articles to predicting market trends. It's incredibly detailed and completely transparent.","position":{"start":{"line":27,"column":158,"offset":4284},"end":{"line":27,"column":361,"offset":4487}}}],"position":{"start":{"line":27,"column":1,"offset":4127},"end":{"line":27,"column":363,"offset":4489}}},"children":[["$","a",null,{"href":"https://www.thefin.ai/dataset-benchmark/finben","node":"$97","children":"FinBen"}]," is a massive benchmark put together by ",["$","a",null,{"href":"https://arxiv.org/html/2505.20650v1","node":"$a1","children":"a group of researchers"}]," known as The Fin AI. It’s built to test LLMs on dozens of financial tasks, from analyzing the sentiment of news articles to predicting market trends. It's incredibly detailed and completely transparent."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"So, who is this really for? Mostly AI researchers and developers who want to compare the raw brainpower of different foundational models on financial data. The catch for your business is that it’s highly academic. A high score on FinBen means a model is good at sifting through generic financial documents, but that says nothing about how it’ll fare as a support agent trying to answer a question about a specific invoice.","position":{"start":{"line":29,"column":1,"offset":4491},"end":{"line":29,"column":423,"offset":4913}}}],"position":{"start":{"line":29,"column":1,"offset":4491},"end":{"line":29,"column":425,"offset":4915}}},"children":"So, who is this really for? Mostly AI researchers and developers who want to compare the raw brainpower of different foundational models on financial data. The catch for your business is that it’s highly academic. A high score on FinBen means a model is good at sifting through generic financial documents, but that says nothing about how it’ll fare as a support agent trying to answer a question about a specific invoice."}],"\n",["$","h3",null,{"className":"tracking-[0px] font-semibold text-2xl leading-[120%] pt-9 pb-6 tblsm:text-[28px] tblsm:pt-14","node":{"type":"element","tagName":"h3","properties":{},"children":[{"type":"text","value":"S&P AI Benchmarks by Kensho: The proprietary industry standard","position":{"start":{"line":31,"column":5,"offset":4921},"end":{"line":31,"column":67,"offset":4983}}}],"position":{"start":{"line":31,"column":1,"offset":4917},"end":{"line":31,"column":69,"offset":4985}}},"children":"S&P AI Benchmarks by Kensho: The proprietary industry standard"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Coming from one of the biggest names in finance, ","position":{"start":{"line":33,"column":1,"offset":4987},"end":{"line":33,"column":50,"offset":5036}}},{"type":"element","tagName":"a","properties":{"href":"https://press.spglobal.com/2024-04-03-S-P-Global-Redefines-Financial-AI-Benchmarking-with-S-P-AI-Benchmarks-by-Kensho-Beta"},"children":[{"type":"text","value":"S&P AI Benchmarks by Kensho","position":{"start":{"line":33,"column":51,"offset":5037},"end":{"line":33,"column":78,"offset":5064}}}],"position":{"start":{"line":33,"column":50,"offset":5036},"end":{"line":33,"column":203,"offset":5189}}},{"type":"text","value":" is a commercial product that ranks LLMs on their math skills and financial smarts. It’s designed to see if an AI can perform at the level of a human financial analyst.","position":{"start":{"line":33,"column":203,"offset":5189},"end":{"line":33,"column":371,"offset":5357}}}],"position":{"start":{"line":33,"column":1,"offset":4987},"end":{"line":33,"column":373,"offset":5359}}},"children":["Coming from one of the biggest names in finance, ",["$","a",null,{"href":"https://press.spglobal.com/2024-04-03-S-P-Global-Redefines-Financial-AI-Benchmarking-with-S-P-AI-Benchmarks-by-Kensho-Beta","node":"$ab","children":"S&P AI Benchmarks by Kensho"}]," is a commercial product that ranks LLMs on their math skills and financial smarts. It’s designed to see if an AI can perform at the level of a human financial analyst."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"This is a great fit for huge financial institutions that need a trusted, third-party stamp of approval on a model before using it for high-stakes analysis. The downside for most businesses is its focus. It's all about complex market analysis, not the practical, high-volume work of customer service or ","position":{"start":{"line":35,"column":1,"offset":5361},"end":{"line":35,"column":303,"offset":5663}}},{"type":"element","tagName":"a","properties":{"href":"https://eesel.ai/solution/ai-for-it-operations"},"children":[{"type":"text","value":"internal IT support","position":{"start":{"line":35,"column":304,"offset":5664},"end":{"line":35,"column":323,"offset":5683}}}],"position":{"start":{"line":35,"column":303,"offset":5663},"end":{"line":35,"column":372,"offset":5732}}},{"type":"text","value":" where most of us are trying to automate.","position":{"start":{"line":35,"column":372,"offset":5732},"end":{"line":35,"column":413,"offset":5773}}}],"position":{"start":{"line":35,"column":1,"offset":5361},"end":{"line":35,"column":415,"offset":5775}}},"children":["This is a great fit for huge financial institutions that need a trusted, third-party stamp of approval on a model before using it for high-stakes analysis. The downside for most businesses is its focus. It's all about complex market analysis, not the practical, high-volume work of customer service or ",["$","a",null,{"href":"https://eesel.ai/solution/ai-for-it-operations","node":"$b5","children":"internal IT support"}]," where most of us are trying to automate."]}],"\n",["$","h3",null,{"className":"tracking-[0px] font-semibold text-2xl leading-[120%] pt-9 pb-6 tblsm:text-[28px] tblsm:pt-14","node":{"type":"element","tagName":"h3","properties":{},"children":[{"type":"text","value":"Vals.ai Finance Agent: The agent-focused evaluator","position":{"start":{"line":37,"column":5,"offset":5781},"end":{"line":37,"column":55,"offset":5831}}}],"position":{"start":{"line":37,"column":1,"offset":5777},"end":{"line":37,"column":57,"offset":5833}}},"children":"Vals.ai Finance Agent: The agent-focused evaluator"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"a","properties":{"href":"https://www.vals.ai/benchmarks/finance_agent-09-08-2025"},"children":[{"type":"text","value":"Vals.ai","position":{"start":{"line":39,"column":2,"offset":5836},"end":{"line":39,"column":9,"offset":5843}}}],"position":{"start":{"line":39,"column":1,"offset":5835},"end":{"line":39,"column":67,"offset":5901}}},{"type":"text","value":" does things a bit differently. Instead of just testing the model, it tests ","position":{"start":{"line":39,"column":67,"offset":5901},"end":{"line":39,"column":143,"offset":5977}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/blog/agentic-ai"},"children":[{"type":"text","value":"AI ","position":{"start":{"line":39,"column":144,"offset":5978},"end":{"line":39,"column":147,"offset":5981}}},{"type":"element","tagName":"em","properties":{},"children":[{"type":"text","value":"agents","position":{"start":{"line":39,"column":148,"offset":5982},"end":{"line":39,"column":154,"offset":5988}}}],"position":{"start":{"line":39,"column":147,"offset":5981},"end":{"line":39,"column":155,"offset":5989}}}],"position":{"start":{"line":39,"column":143,"offset":5977},"end":{"line":39,"column":194,"offset":6028}}},{"type":"text","value":", systems that can use tools to get things done. Their benchmark looks at how well an agent can do the job of an entry-level analyst, like digging through SEC filings to find a specific piece of information.","position":{"start":{"line":39,"column":194,"offset":6028},"end":{"line":39,"column":401,"offset":6235}}}],"position":{"start":{"line":39,"column":1,"offset":5835},"end":{"line":39,"column":403,"offset":6237}}},"children":[["$","a",null,{"href":"https://www.vals.ai/benchmarks/finance_agent-09-08-2025","node":"$bf","children":"Vals.ai"}]," does things a bit differently. Instead of just testing the model, it tests ",["$","a",null,{"href":"https://www.eesel.ai/blog/agentic-ai","node":"$c9","children":["AI ",["$","em","em-0",{"children":"agents"}]]}],", systems that can use tools to get things done. Their benchmark looks at how well an agent can do the job of an entry-level analyst, like digging through SEC filings to find a specific piece of information."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"This is aimed at teams at hedge funds or banks building or buying AI agents for complicated, multi-step research. But once again, it’s geared toward sophisticated financial analysis. The tasks it measures (like parsing a 10-K report) are a world away from the everyday support questions that most companies deal with.","position":{"start":{"line":41,"column":1,"offset":6239},"end":{"line":41,"column":318,"offset":6556}}}],"position":{"start":{"line":41,"column":1,"offset":6239},"end":{"line":41,"column":320,"offset":6558}}},"children":"This is aimed at teams at hedge funds or banks building or buying AI agents for complicated, multi-step research. But once again, it’s geared toward sophisticated financial analysis. The tasks it measures (like parsing a 10-K report) are a world away from the everyday support questions that most companies deal with."}],"\n",["$","h3",null,{"className":"tracking-[0px] font-semibold text-2xl leading-[120%] pt-9 pb-6 tblsm:text-[28px] tblsm:pt-14","node":{"type":"element","tagName":"h3","properties":{},"children":[{"type":"text","value":"FINOS: The collaborative compliance framework","position":{"start":{"line":43,"column":5,"offset":6564},"end":{"line":43,"column":50,"offset":6609}}}],"position":{"start":{"line":43,"column":1,"offset":6560},"end":{"line":43,"column":52,"offset":6611}}},"children":"FINOS: The collaborative compliance framework"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"The ","position":{"start":{"line":45,"column":1,"offset":6613},"end":{"line":45,"column":5,"offset":6617}}},{"type":"element","tagName":"a","properties":{"href":"https://www.finos.org/blog/building-a-shared-benchmarking-framework-for-ai-in-financial-services"},"children":[{"type":"text","value":"Fintech Open Source Foundation (FINOS)","position":{"start":{"line":45,"column":6,"offset":6618},"end":{"line":45,"column":44,"offset":6656}}}],"position":{"start":{"line":45,"column":5,"offset":6617},"end":{"line":45,"column":143,"offset":6755}}},{"type":"text","value":" isn't really a benchmark. It’s more of a group project to build a shared framework for handling AI risk, trust, and compliance. It’s all about creating the guardrails to make sure AI is adopted safely in the industry.","position":{"start":{"line":45,"column":143,"offset":6755},"end":{"line":45,"column":361,"offset":6973}}}],"position":{"start":{"line":45,"column":1,"offset":6613},"end":{"line":45,"column":363,"offset":6975}}},"children":["The ",["$","a",null,{"href":"https://www.finos.org/blog/building-a-shared-benchmarking-framework-for-ai-in-financial-services","node":"$dd","children":"Fintech Open Source Foundation (FINOS)"}]," isn't really a benchmark. It’s more of a group project to build a shared framework for handling AI risk, trust, and compliance. It’s all about creating the guardrails to make sure AI is adopted safely in the industry."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"This is perfect for the compliance, risk, and legal folks at financial institutions who need to set up internal rules for using AI responsibly. The limitation for your business is that FINOS gives you principles and categories, not a tool you can plug in to measure your ","position":{"start":{"line":47,"column":1,"offset":6977},"end":{"line":47,"column":272,"offset":7248}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/product/ai-chatbot"},"children":[{"type":"text","value":"AI chatbot's","position":{"start":{"line":47,"column":273,"offset":7249},"end":{"line":47,"column":285,"offset":7261}}}],"position":{"start":{"line":47,"column":272,"offset":7248},"end":{"line":47,"column":327,"offset":7303}}},{"type":"text","value":" resolution rate today. It’s about the ","position":{"start":{"line":47,"column":327,"offset":7303},"end":{"line":47,"column":366,"offset":7342}}},{"type":"element","tagName":"em","properties":{},"children":[{"type":"text","value":"rules","position":{"start":{"line":47,"column":367,"offset":7343},"end":{"line":47,"column":372,"offset":7348}}}],"position":{"start":{"line":47,"column":366,"offset":7342},"end":{"line":47,"column":373,"offset":7349}}},{"type":"text","value":" of the game, not the score.","position":{"start":{"line":47,"column":373,"offset":7349},"end":{"line":47,"column":401,"offset":7377}}}],"position":{"start":{"line":47,"column":1,"offset":6977},"end":{"line":47,"column":403,"offset":7379}}},"children":["This is perfect for the compliance, risk, and legal folks at financial institutions who need to set up internal rules for using AI responsibly. The limitation for your business is that FINOS gives you principles and categories, not a tool you can plug in to measure your ",["$","a",null,{"href":"https://www.eesel.ai/product/ai-chatbot","node":"$e7","children":"AI chatbot's"}]," resolution rate today. It’s about the ",["$","em","em-0",{"children":"rules"}]," of the game, not the score."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Here’s a quick rundown of how they stack up:","position":{"start":{"line":49,"column":1,"offset":7381},"end":{"line":49,"column":45,"offset":7425}}}],"position":{"start":{"line":49,"column":1,"offset":7381},"end":{"line":49,"column":47,"offset":7427}}},"children":"Here’s a quick rundown of how they stack up:"}],"\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n",["$","table",null,{"className":"mb-7 !border !border-[#121212] overflow-x-auto block","node":{"type":"element","tagName":"table","properties":{},"children":[{"type":"element","tagName":"thead","properties":{},"children":[{"type":"element","tagName":"tr","properties":{},"children":[{"type":"element","tagName":"th","properties":{"align":"left"},"children":[{"type":"text","value":"Framework","position":{"start":{"line":51,"column":3,"offset":7431},"end":{"line":51,"column":12,"offset":7440}}}],"position":{"start":{"line":51,"column":1,"offset":7429},"end":{"line":51,"column":13,"offset":7441}}},{"type":"element","tagName":"th","properties":{"align":"left"},"children":[{"type":"text","value":"Primary Focus","position":{"start":{"line":51,"column":15,"offset":7443},"end":{"line":51,"column":28,"offset":7456}}}],"position":{"start":{"line":51,"column":13,"offset":7441},"end":{"line":51,"column":29,"offset":7457}}},{"type":"element","tagName":"th","properties":{"align":"left"},"children":[{"type":"text","value":"Best For","position":{"start":{"line":51,"column":31,"offset":7459},"end":{"line":51,"column":39,"offset":7467}}}],"position":{"start":{"line":51,"column":29,"offset":7457},"end":{"line":51,"column":40,"offset":7468}}},{"type":"element","tagName":"th","properties":{"align":"left"},"children":[{"type":"text","value":"Type","position":{"start":{"line":51,"column":42,"offset":7470},"end":{"line":51,"column":46,"offset":7474}}}],"position":{"start":{"line":51,"column":40,"offset":7468},"end":{"line":51,"column":47,"offset":7475}}},{"type":"element","tagName":"th","properties":{"align":"left"},"children":[{"type":"text","value":"Key Limitation for Support Teams","position":{"start":{"line":51,"column":49,"offset":7477},"end":{"line":51,"column":81,"offset":7509}}}],"position":{"start":{"line":51,"column":47,"offset":7475},"end":{"line":51,"column":83,"offset":7511}}}],"position":{"start":{"line":51,"column":1,"offset":7429},"end":{"line":51,"column":83,"offset":7511}}}],"position":{"start":{"line":51,"column":1,"offset":7429},"end":{"line":51,"column":83,"offset":7511}}},{"type":"element","tagName":"tbody","properties":{},"children":[{"type":"element","tagName":"tr","properties":{},"children":[{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"FinBen","position":{"start":{"line":53,"column":5,"offset":7553},"end":{"line":53,"column":11,"offset":7559}}}],"position":{"start":{"line":53,"column":3,"offset":7551},"end":{"line":53,"column":13,"offset":7561}}}],"position":{"start":{"line":53,"column":1,"offset":7549},"end":{"line":53,"column":14,"offset":7562}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Foundational LLM capabilities","position":{"start":{"line":53,"column":16,"offset":7564},"end":{"line":53,"column":45,"offset":7593}}}],"position":{"start":{"line":53,"column":14,"offset":7562},"end":{"line":53,"column":46,"offset":7594}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"AI Researchers","position":{"start":{"line":53,"column":48,"offset":7596},"end":{"line":53,"column":62,"offset":7610}}}],"position":{"start":{"line":53,"column":46,"offset":7594},"end":{"line":53,"column":63,"offset":7611}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Open-Source","position":{"start":{"line":53,"column":65,"offset":7613},"end":{"line":53,"column":76,"offset":7624}}}],"position":{"start":{"line":53,"column":63,"offset":7611},"end":{"line":53,"column":77,"offset":7625}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Too academic, doesn't reflect real-world agent performance.","position":{"start":{"line":53,"column":79,"offset":7627},"end":{"line":53,"column":138,"offset":7686}}}],"position":{"start":{"line":53,"column":77,"offset":7625},"end":{"line":53,"column":140,"offset":7688}}}],"position":{"start":{"line":53,"column":1,"offset":7549},"end":{"line":53,"column":140,"offset":7688}}},{"type":"element","tagName":"tr","properties":{},"children":[{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"S&P Kensho","position":{"start":{"line":54,"column":5,"offset":7693},"end":{"line":54,"column":15,"offset":7703}}}],"position":{"start":{"line":54,"column":3,"offset":7691},"end":{"line":54,"column":17,"offset":7705}}}],"position":{"start":{"line":54,"column":1,"offset":7689},"end":{"line":54,"column":18,"offset":7706}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Quantitative reasoning","position":{"start":{"line":54,"column":20,"offset":7708},"end":{"line":54,"column":42,"offset":7730}}}],"position":{"start":{"line":54,"column":18,"offset":7706},"end":{"line":54,"column":43,"offset":7731}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Financial Analysts","position":{"start":{"line":54,"column":45,"offset":7733},"end":{"line":54,"column":63,"offset":7751}}}],"position":{"start":{"line":54,"column":43,"offset":7731},"end":{"line":54,"column":64,"offset":7752}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Proprietary","position":{"start":{"line":54,"column":66,"offset":7754},"end":{"line":54,"column":77,"offset":7765}}}],"position":{"start":{"line":54,"column":64,"offset":7752},"end":{"line":54,"column":78,"offset":7766}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Focused on market analysis, not customer support workflows.","position":{"start":{"line":54,"column":80,"offset":7768},"end":{"line":54,"column":139,"offset":7827}}}],"position":{"start":{"line":54,"column":78,"offset":7766},"end":{"line":54,"column":141,"offset":7829}}}],"position":{"start":{"line":54,"column":1,"offset":7689},"end":{"line":54,"column":141,"offset":7829}}},{"type":"element","tagName":"tr","properties":{},"children":[{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"Vals.ai","position":{"start":{"line":55,"column":5,"offset":7834},"end":{"line":55,"column":12,"offset":7841}}}],"position":{"start":{"line":55,"column":3,"offset":7832},"end":{"line":55,"column":14,"offset":7843}}}],"position":{"start":{"line":55,"column":1,"offset":7830},"end":{"line":55,"column":15,"offset":7844}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Agentic research tasks","position":{"start":{"line":55,"column":17,"offset":7846},"end":{"line":55,"column":39,"offset":7868}}}],"position":{"start":{"line":55,"column":15,"offset":7844},"end":{"line":55,"column":40,"offset":7869}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Hedge Funds, Banks","position":{"start":{"line":55,"column":42,"offset":7871},"end":{"line":55,"column":60,"offset":7889}}}],"position":{"start":{"line":55,"column":40,"offset":7869},"end":{"line":55,"column":61,"offset":7890}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Proprietary","position":{"start":{"line":55,"column":63,"offset":7892},"end":{"line":55,"column":74,"offset":7903}}}],"position":{"start":{"line":55,"column":61,"offset":7890},"end":{"line":55,"column":75,"offset":7904}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Geared towards complex analyst tasks, not high-volume support.","position":{"start":{"line":55,"column":77,"offset":7906},"end":{"line":55,"column":139,"offset":7968}}}],"position":{"start":{"line":55,"column":75,"offset":7904},"end":{"line":55,"column":141,"offset":7970}}}],"position":{"start":{"line":55,"column":1,"offset":7830},"end":{"line":55,"column":141,"offset":7970}}},{"type":"element","tagName":"tr","properties":{},"children":[{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"FINOS","position":{"start":{"line":56,"column":5,"offset":7975},"end":{"line":56,"column":10,"offset":7980}}}],"position":{"start":{"line":56,"column":3,"offset":7973},"end":{"line":56,"column":12,"offset":7982}}}],"position":{"start":{"line":56,"column":1,"offset":7971},"end":{"line":56,"column":13,"offset":7983}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Risk & Compliance Standards","position":{"start":{"line":56,"column":15,"offset":7985},"end":{"line":56,"column":42,"offset":8012}}}],"position":{"start":{"line":56,"column":13,"offset":7983},"end":{"line":56,"column":43,"offset":8013}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Compliance Officers","position":{"start":{"line":56,"column":45,"offset":8015},"end":{"line":56,"column":64,"offset":8034}}}],"position":{"start":{"line":56,"column":43,"offset":8013},"end":{"line":56,"column":65,"offset":8035}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"Open-Source","position":{"start":{"line":56,"column":67,"offset":8037},"end":{"line":56,"column":78,"offset":8048}}}],"position":{"start":{"line":56,"column":65,"offset":8035},"end":{"line":56,"column":79,"offset":8049}}},{"type":"element","tagName":"td","properties":{"align":"left"},"children":[{"type":"text","value":"A framework of principles, not a performance testing tool.","position":{"start":{"line":56,"column":81,"offset":8051},"end":{"line":56,"column":139,"offset":8109}}}],"position":{"start":{"line":56,"column":79,"offset":8049},"end":{"line":56,"column":141,"offset":8111}}}],"position":{"start":{"line":56,"column":1,"offset":7971},"end":{"line":56,"column":141,"offset":8111}}}],"position":{"start":{"line":53,"column":1,"offset":7549},"end":{"line":56,"column":141,"offset":8111}}}],"position":{"start":{"line":51,"column":1,"offset":7429},"end":{"line":56,"column":141,"offset":8111}}},"children":[["$","thead","thead-0",{"children":["$","tr","tr-0",{"children":[["$","th","th-0",{"style":{"textAlign":"left"},"children":"Framework"}],["$","th","th-1",{"style":{"textAlign":"left"},"children":"Primary Focus"}],["$","th","th-2",{"style":{"textAlign":"left"},"children":"Best For"}],["$","th","th-3",{"style":{"textAlign":"left"},"children":"Type"}],["$","th","th-4",{"style":{"textAlign":"left"},"children":"Key Limitation for Support Teams"}]]}]}],["$","tbody","tbody-0",{"children":[["$","tr","tr-0",{"children":[["$","td","td-0",{"style":{"textAlign":"left"},"children":["$","strong",null,{"className":"font-semibold","node":"$f1","children":"FinBen"}]}],["$","td","td-1",{"style":{"textAlign":"left"},"children":"Foundational LLM capabilities"}],["$","td","td-2",{"style":{"textAlign":"left"},"children":"AI Researchers"}],["$","td","td-3",{"style":{"textAlign":"left"},"children":"Open-Source"}],["$","td","td-4",{"style":{"textAlign":"left"},"children":"Too academic, doesn't reflect real-world agent performance."}]]}],["$","tr","tr-1",{"children":[["$","td","td-0",{"style":{"textAlign":"left"},"children":["$","strong",null,{"className":"font-semibold","node":"$fb","children":"S&P Kensho"}]}],["$","td","td-1",{"style":{"textAlign":"left"},"children":"Quantitative reasoning"}],["$","td","td-2",{"style":{"textAlign":"left"},"children":"Financial Analysts"}],["$","td","td-3",{"style":{"textAlign":"left"},"children":"Proprietary"}],["$","td","td-4",{"style":{"textAlign":"left"},"children":"Focused on market analysis, not customer support workflows."}]]}],["$","tr","tr-2",{"children":[["$","td","td-0",{"style":{"textAlign":"left"},"children":["$","strong",null,{"className":"font-semibold","node":"$105","children":"Vals.ai"}]}],["$","td","td-1",{"style":{"textAlign":"left"},"children":"Agentic research tasks"}],["$","td","td-2",{"style":{"textAlign":"left"},"children":"Hedge Funds, Banks"}],["$","td","td-3",{"style":{"textAlign":"left"},"children":"Proprietary"}],["$","td","td-4",{"style":{"textAlign":"left"},"children":"Geared towards complex analyst tasks, not high-volume support."}]]}],["$","tr","tr-3",{"children":[["$","td","td-0",{"style":{"textAlign":"left"},"children":["$","strong",null,{"className":"font-semibold","node":"$10f","children":"FINOS"}]}],["$","td","td-1",{"style":{"textAlign":"left"},"children":"Risk & Compliance Standards"}],["$","td","td-2",{"style":{"textAlign":"left"},"children":"Compliance Officers"}],["$","td","td-3",{"style":{"textAlign":"left"},"children":"Open-Source"}],["$","td","td-4",{"style":{"textAlign":"left"},"children":"A framework of principles, not a performance testing tool."}]]}]]}]]}],"\n",["$","h2",null,{"className":"text-[28px] tracking-[0px] font-semibold text-[#121212] tblsm:mb-8 leading-[120%] max-w-[600px] mt-14 mb-6 tblsm:text-4xl tblsm:leading-[110%] tblsm:max-w-none tblsm:mt-20","node":{"type":"element","tagName":"h2","properties":{},"children":[{"type":"text","value":"Key metrics for Fin AI Benchmarking: What do financial benchmarks actually measure?","position":{"start":{"line":59,"column":4,"offset":8120},"end":{"line":59,"column":87,"offset":8203}}}],"position":{"start":{"line":59,"column":1,"offset":8117},"end":{"line":59,"column":89,"offset":8205}}},"children":"Key metrics for Fin AI Benchmarking: What do financial benchmarks actually measure?"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"These frameworks don't just give you a single \"AI smartness score.\" They ","position":{"start":{"line":61,"column":1,"offset":8207},"end":{"line":61,"column":74,"offset":8280}}},{"type":"element","tagName":"a","properties":{"href":"https://www.thefin.ai/dataset-benchmark/overview"},"children":[{"type":"text","value":"test a handful of specific skills","position":{"start":{"line":61,"column":75,"offset":8281},"end":{"line":61,"column":108,"offset":8314}}}],"position":{"start":{"line":61,"column":74,"offset":8280},"end":{"line":61,"column":159,"offset":8365}}},{"type":"text","value":" that are essential for financial tasks. The good news is, these are the same underlying skills an AI needs to be useful in a support or internal help desk role.","position":{"start":{"line":61,"column":159,"offset":8365},"end":{"line":61,"column":320,"offset":8526}}}],"position":{"start":{"line":61,"column":1,"offset":8207},"end":{"line":61,"column":322,"offset":8528}}},"children":["These frameworks don't just give you a single \"AI smartness score.\" They ",["$","a",null,{"href":"https://www.thefin.ai/dataset-benchmark/overview","node":"$119","children":"test a handful of specific skills"}]," that are essential for financial tasks. The good news is, these are the same underlying skills an AI needs to be useful in a support or internal help desk role."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Title: Core Skills Measured in Fin AI Benchmarking","position":{"start":{"line":63,"column":1,"offset":8530},"end":{"line":63,"column":51,"offset":8580}}}],"position":{"start":{"line":63,"column":1,"offset":8530},"end":{"line":63,"column":53,"offset":8582}}},"children":"Title: Core Skills Measured in Fin AI Benchmarking"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Sections:","position":{"start":{"line":65,"column":1,"offset":8584},"end":{"line":65,"column":10,"offset":8593}}}],"position":{"start":{"line":65,"column":1,"offset":8584},"end":{"line":65,"column":12,"offset":8595}}},"children":"Sections:"}],"\n",["$","ol",null,{"className":"flex flex-col m-0 ml-5 list-decimal gap-2 ps-0 mb-6 [&>:last-child]:mb-0","node":{"type":"element","tagName":"ol","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"li","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Information Extraction & Structuring: Icon of a magnifying glass over a document. Text: \"Finds and organizes key data like names, dates, and policy numbers from unstructured text.\"","position":{"start":{"line":67,"column":4,"offset":8600},"end":{"line":67,"column":184,"offset":8780}}}],"position":{"start":{"line":67,"column":4,"offset":8600},"end":{"line":67,"column":186,"offset":8782}}},{"type":"text","value":"\n"}],"position":{"start":{"line":67,"column":1,"offset":8597},"end":{"line":67,"column":186,"offset":8782}}},{"type":"text","value":"\n"},{"type":"element","tagName":"li","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Quantitative & Numerical Reasoning: Icon of a calculator and a graph. Text: \"Performs math, compares figures, and understands numerical context, essential for pricing and refunds.\"","position":{"start":{"line":69,"column":4,"offset":8787},"end":{"line":69,"column":184,"offset":8967}}}],"position":{"start":{"line":69,"column":4,"offset":8787},"end":{"line":69,"column":186,"offset":8969}}},{"type":"text","value":"\n"}],"position":{"start":{"line":69,"column":1,"offset":8784},"end":{"line":69,"column":186,"offset":8969}}},{"type":"text","value":"\n"},{"type":"element","tagName":"li","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Domain-Specific Knowledge & QA: Icon of a brain connected to a knowledge base. Text: \"Answers complex questions by referencing specialized documents, like your company's internal policies.\"","position":{"start":{"line":71,"column":4,"offset":8974},"end":{"line":71,"column":193,"offset":9163}}}],"position":{"start":{"line":71,"column":4,"offset":8974},"end":{"line":71,"column":195,"offset":9165}}},{"type":"text","value":"\n"}],"position":{"start":{"line":71,"column":1,"offset":8971},"end":{"line":71,"column":195,"offset":9165}}},{"type":"text","value":"\n"}],"position":{"start":{"line":67,"column":1,"offset":8597},"end":{"line":71,"column":195,"offset":9165}}},"children":["\n",["$","li","li-0",{"children":["\n",["$","p",null,{"className":"","node":"$123","children":"Information Extraction & Structuring: Icon of a magnifying glass over a document. Text: \"Finds and organizes key data like names, dates, and policy numbers from unstructured text.\""}],"\n"]}],"\n",["$","li","li-1",{"children":["\n",["$","p",null,{"className":"","node":"$12d","children":"Quantitative & Numerical Reasoning: Icon of a calculator and a graph. Text: \"Performs math, compares figures, and understands numerical context, essential for pricing and refunds.\""}],"\n"]}],"\n",["$","li","li-2",{"children":["\n",["$","p",null,{"className":"","node":"$137","children":"Domain-Specific Knowledge & QA: Icon of a brain connected to a knowledge base. Text: \"Answers complex questions by referencing specialized documents, like your company's internal policies.\""}],"\n"]}],"\n"]}],"\n",["$","h3",null,{"className":"tracking-[0px] font-semibold text-2xl leading-[120%] pt-9 pb-6 tblsm:text-[28px] tblsm:pt-14","node":{"type":"element","tagName":"h3","properties":{},"children":[{"type":"text","value":"Information extraction and structuring","position":{"start":{"line":73,"column":5,"offset":9171},"end":{"line":73,"column":43,"offset":9209}}}],"position":{"start":{"line":73,"column":1,"offset":9167},"end":{"line":73,"column":45,"offset":9211}}},"children":"Information extraction and structuring"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"At its core, this is about the AI's ability to accurately find and pull specific bits of information, like names, dates, revenue figures, or policy numbers, from messy, unstructured text. This is the bread and butter of an AI support agent. It’s what lets it find an order number in a customer’s email, grab a specific clause from a knowledge base article, or spot a product name in a chat log.","position":{"start":{"line":75,"column":1,"offset":9213},"end":{"line":75,"column":395,"offset":9607}}}],"position":{"start":{"line":75,"column":1,"offset":9213},"end":{"line":75,"column":397,"offset":9609}}},"children":"At its core, this is about the AI's ability to accurately find and pull specific bits of information, like names, dates, revenue figures, or policy numbers, from messy, unstructured text. This is the bread and butter of an AI support agent. It’s what lets it find an order number in a customer’s email, grab a specific clause from a knowledge base article, or spot a product name in a chat log."}],"\n",["$","h3",null,{"className":"tracking-[0px] font-semibold text-2xl leading-[120%] pt-9 pb-6 tblsm:text-[28px] tblsm:pt-14","node":{"type":"element","tagName":"h3","properties":{},"children":[{"type":"text","value":"Quantitative and numerical reasoning","position":{"start":{"line":77,"column":5,"offset":9615},"end":{"line":77,"column":41,"offset":9651}}}],"position":{"start":{"line":77,"column":1,"offset":9611},"end":{"line":77,"column":43,"offset":9653}}},"children":"Quantitative and numerical reasoning"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"This tests whether the AI can actually do math, compare numbers, and understand what they mean in context. For example, it needs to know that a 5% increase is better than a 2% increase, or be able to calculate a total from a list of items. You absolutely need this for any support ticket that involves numbers. Whether it's calculating a prorated refund, confirming a tiered pricing plan, or checking a discount code, a bot that gets numbers wrong is a huge liability.","position":{"start":{"line":79,"column":1,"offset":9655},"end":{"line":79,"column":469,"offset":10123}}}],"position":{"start":{"line":79,"column":1,"offset":9655},"end":{"line":79,"column":471,"offset":10125}}},"children":"This tests whether the AI can actually do math, compare numbers, and understand what they mean in context. For example, it needs to know that a 5% increase is better than a 2% increase, or be able to calculate a total from a list of items. You absolutely need this for any support ticket that involves numbers. Whether it's calculating a prorated refund, confirming a tiered pricing plan, or checking a discount code, a bot that gets numbers wrong is a huge liability."}],"\n",["$","h3",null,{"className":"tracking-[0px] font-semibold text-2xl leading-[120%] pt-9 pb-6 tblsm:text-[28px] tblsm:pt-14","node":{"type":"element","tagName":"h3","properties":{},"children":[{"type":"text","value":"Domain-specific knowledge and question answering","position":{"start":{"line":81,"column":5,"offset":10131},"end":{"line":81,"column":53,"offset":10179}}}],"position":{"start":{"line":81,"column":1,"offset":10127},"end":{"line":81,"column":55,"offset":10181}}},"children":"Domain-specific knowledge and question answering"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"This is all about how well the AI can answer tricky questions by reading dense, specialized documents. In finance, that might be an annual report or a regulatory filing. For you, this is the heart and soul of any ","position":{"start":{"line":83,"column":1,"offset":10183},"end":{"line":83,"column":214,"offset":10396}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/blog/how-to-build-an-ai-knowledge-base-in-2025"},"children":[{"type":"text","value":"knowledge-based AI","position":{"start":{"line":83,"column":215,"offset":10397},"end":{"line":83,"column":233,"offset":10415}}}],"position":{"start":{"line":83,"column":214,"offset":10396},"end":{"line":83,"column":303,"offset":10485}}},{"type":"text","value":". A high score here is a good sign, but what really counts is how well the AI can answer questions based on ","position":{"start":{"line":83,"column":303,"offset":10485},"end":{"line":83,"column":411,"offset":10593}}},{"type":"element","tagName":"em","properties":{},"children":[{"type":"text","value":"your","position":{"start":{"line":83,"column":412,"offset":10594},"end":{"line":83,"column":416,"offset":10598}}}],"position":{"start":{"line":83,"column":411,"offset":10593},"end":{"line":83,"column":417,"offset":10599}}},{"type":"text","value":" internal documents, your help center articles, your company policies, your product specs. An AI trained on a generic financial library won't know the first thing about your business.","position":{"start":{"line":83,"column":417,"offset":10599},"end":{"line":83,"column":600,"offset":10782}}}],"position":{"start":{"line":83,"column":1,"offset":10183},"end":{"line":83,"column":602,"offset":10784}}},"children":["This is all about how well the AI can answer tricky questions by reading dense, specialized documents. In finance, that might be an annual report or a regulatory filing. For you, this is the heart and soul of any ",["$","a",null,{"href":"https://www.eesel.ai/blog/how-to-build-an-ai-knowledge-base-in-2025","node":"$141","children":"knowledge-based AI"}],". A high score here is a good sign, but what really counts is how well the AI can answer questions based on ",["$","em","em-0",{"children":"your"}]," internal documents, your help center articles, your company policies, your product specs. An AI trained on a generic financial library won't know the first thing about your business."]}],"\n",["$","h2",null,{"className":"text-[28px] tracking-[0px] font-semibold text-[#121212] tblsm:mb-8 leading-[120%] max-w-[600px] mt-14 mb-6 tblsm:text-4xl tblsm:leading-[110%] tblsm:max-w-none tblsm:mt-20","node":{"type":"element","tagName":"h2","properties":{},"children":[{"type":"text","value":"Beyond the leaderboard: How to apply Fin AI Benchmarking for your team","position":{"start":{"line":85,"column":4,"offset":10789},"end":{"line":85,"column":74,"offset":10859}}}],"position":{"start":{"line":85,"column":1,"offset":10786},"end":{"line":85,"column":76,"offset":10861}}},"children":"Beyond the leaderboard: How to apply Fin AI Benchmarking for your team"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"This brings us to the most important point of all: your goal isn't to pick the model with the highest academic score. It’s to find the AI platform that works best in your messy, real-world environment.","position":{"start":{"line":87,"column":1,"offset":10863},"end":{"line":87,"column":202,"offset":11064}}}],"position":{"start":{"line":87,"column":1,"offset":10863},"end":{"line":87,"column":204,"offset":11066}}},"children":"This brings us to the most important point of all: your goal isn't to pick the model with the highest academic score. It’s to find the AI platform that works best in your messy, real-world environment."}],"\n",["$","h3",null,{"className":"tracking-[0px] font-semibold text-2xl leading-[120%] pt-9 pb-6 tblsm:text-[28px] tblsm:pt-14","node":{"type":"element","tagName":"h3","properties":{},"children":[{"type":"text","value":"The challenge with Fin AI Benchmarking: From theoretical scores to real-world results","position":{"start":{"line":89,"column":5,"offset":11072},"end":{"line":89,"column":90,"offset":11157}}}],"position":{"start":{"line":89,"column":1,"offset":11068},"end":{"line":89,"column":92,"offset":11159}}},"children":"The challenge with Fin AI Benchmarking: From theoretical scores to real-world results"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Here’s the gap: an AI can get a perfect score on a standardized test but completely face-plant when it runs into your company's internal slang, unique customer problems, or multi-step escalation rules. The move ","position":{"start":{"line":91,"column":1,"offset":11161},"end":{"line":91,"column":212,"offset":11372}}},{"type":"element","tagName":"a","properties":{"href":"https://community.intercom.com/analyze-fin-93/fin-ai-benchmarks-10836"},"children":[{"type":"text","value":"from theoretical scores to real-world results","position":{"start":{"line":91,"column":213,"offset":11373},"end":{"line":91,"column":258,"offset":11418}}}],"position":{"start":{"line":91,"column":212,"offset":11372},"end":{"line":91,"column":330,"offset":11490}}},{"type":"text","value":" is a critical step.","position":{"start":{"line":91,"column":330,"offset":11490},"end":{"line":91,"column":350,"offset":11510}}}],"position":{"start":{"line":91,"column":1,"offset":11161},"end":{"line":91,"column":352,"offset":11512}}},"children":["Here’s the gap: an AI can get a perfect score on a standardized test but completely face-plant when it runs into your company's internal slang, unique customer problems, or multi-step escalation rules. The move ",["$","a",null,{"href":"https://community.intercom.com/analyze-fin-93/fin-ai-benchmarks-10836","node":"$14b","children":"from theoretical scores to real-world results"}]," is a critical step."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"The \"real\" benchmarks, the ones that actually affect your bottom line, are things like:","position":{"start":{"line":93,"column":1,"offset":11514},"end":{"line":93,"column":88,"offset":11601}}}],"position":{"start":{"line":93,"column":1,"offset":11514},"end":{"line":93,"column":90,"offset":11603}}},"children":"The \"real\" benchmarks, the ones that actually affect your bottom line, are things like:"}],"\n",["$","ul",null,{"className":"flex flex-col m-0 ml-5 list-disc gap-2 ps-0 mb-6 [&>:last-child]:mb-0","node":{"type":"element","tagName":"ul","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"li","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"Resolution Rate:","position":{"start":{"line":95,"column":5,"offset":11609},"end":{"line":95,"column":21,"offset":11625}}}],"position":{"start":{"line":95,"column":3,"offset":11607},"end":{"line":95,"column":23,"offset":11627}}},{"type":"text","value":" What percentage of questions does the AI actually solve on its own?","position":{"start":{"line":95,"column":23,"offset":11627},"end":{"line":95,"column":91,"offset":11695}}}],"position":{"start":{"line":95,"column":3,"offset":11607},"end":{"line":95,"column":93,"offset":11697}}},{"type":"text","value":"\n"}],"position":{"start":{"line":95,"column":1,"offset":11605},"end":{"line":95,"column":93,"offset":11697}}},{"type":"text","value":"\n"},{"type":"element","tagName":"li","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/blog/how-to-automate-zendesk-csat-surveys-for-better-response-rates"},"children":[{"type":"text","value":"Customer Satisfaction (CSAT)","position":{"start":{"line":97,"column":6,"offset":11704},"end":{"line":97,"column":34,"offset":11732}}}],"position":{"start":{"line":97,"column":5,"offset":11703},"end":{"line":97,"column":125,"offset":11823}}},{"type":"text","value":":","position":{"start":{"line":97,"column":125,"offset":11823},"end":{"line":97,"column":126,"offset":11824}}}],"position":{"start":{"line":97,"column":3,"offset":11701},"end":{"line":97,"column":128,"offset":11826}}},{"type":"text","value":" Do people walk away feeling good after talking to the AI?","position":{"start":{"line":97,"column":128,"offset":11826},"end":{"line":97,"column":186,"offset":11884}}}],"position":{"start":{"line":97,"column":3,"offset":11701},"end":{"line":97,"column":188,"offset":11886}}},{"type":"text","value":"\n"}],"position":{"start":{"line":97,"column":1,"offset":11699},"end":{"line":97,"column":188,"offset":11886}}},{"type":"text","value":"\n"},{"type":"element","tagName":"li","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"First-Response Time:","position":{"start":{"line":99,"column":5,"offset":11892},"end":{"line":99,"column":25,"offset":11912}}}],"position":{"start":{"line":99,"column":3,"offset":11890},"end":{"line":99,"column":27,"offset":11914}}},{"type":"text","value":" How fast does the AI jump in and give a helpful answer?","position":{"start":{"line":99,"column":27,"offset":11914},"end":{"line":99,"column":83,"offset":11970}}}],"position":{"start":{"line":99,"column":3,"offset":11890},"end":{"line":99,"column":85,"offset":11972}}},{"type":"text","value":"\n"}],"position":{"start":{"line":99,"column":1,"offset":11888},"end":{"line":99,"column":85,"offset":11972}}},{"type":"text","value":"\n"},{"type":"element","tagName":"li","properties":{},"children":[{"type":"text","value":"\n"},{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"Cost Savings:","position":{"start":{"line":101,"column":5,"offset":11978},"end":{"line":101,"column":18,"offset":11991}}}],"position":{"start":{"line":101,"column":3,"offset":11976},"end":{"line":101,"column":20,"offset":11993}}},{"type":"text","value":" How much time and money are you saving by having it handle tasks?","position":{"start":{"line":101,"column":20,"offset":11993},"end":{"line":101,"column":86,"offset":12059}}}],"position":{"start":{"line":101,"column":3,"offset":11976},"end":{"line":101,"column":88,"offset":12061}}},{"type":"text","value":"\n"}],"position":{"start":{"line":101,"column":1,"offset":11974},"end":{"line":101,"column":88,"offset":12061}}},{"type":"text","value":"\n"}],"position":{"start":{"line":95,"column":1,"offset":11605},"end":{"line":101,"column":88,"offset":12061}}},"children":["\n",["$","li","li-0",{"children":["\n",["$","p",null,{"className":"","node":"$155","children":[["$","strong",null,{"className":"font-semibold","node":"$158","children":"Resolution Rate:"}]," What percentage of questions does the AI actually solve on its own?"]}],"\n"]}],"\n",["$","li","li-1",{"children":["\n",["$","p",null,{"className":"","node":"$169","children":[["$","strong",null,{"className":"font-semibold","node":"$16c","children":[["$","a",null,{"href":"https://www.eesel.ai/blog/how-to-automate-zendesk-csat-surveys-for-better-response-rates","node":"$16f","children":"Customer Satisfaction (CSAT)"}],":"]}]," Do people walk away feeling good after talking to the AI?"]}],"\n"]}],"\n",["$","li","li-2",{"children":["\n",["$","p",null,{"className":"","node":"$187","children":[["$","strong",null,{"className":"font-semibold","node":"$18a","children":"First-Response Time:"}]," How fast does the AI jump in and give a helpful answer?"]}],"\n"]}],"\n",["$","li","li-3",{"children":["\n",["$","p",null,{"className":"","node":"$19b","children":[["$","strong",null,{"className":"font-semibold","node":"$19e","children":"Cost Savings:"}]," How much time and money are you saving by having it handle tasks?"]}],"\n"]}],"\n"]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"These are the numbers that matter, and you won't find them on any public leaderboard. You have to measure them yourself.","position":{"start":{"line":103,"column":1,"offset":12063},"end":{"line":103,"column":121,"offset":12183}}}],"position":{"start":{"line":103,"column":1,"offset":12063},"end":{"line":103,"column":123,"offset":12185}}},"children":"These are the numbers that matter, and you won't find them on any public leaderboard. You have to measure them yourself."}],"\n",["$","h3",null,{"className":"tracking-[0px] font-semibold text-2xl leading-[120%] pt-9 pb-6 tblsm:text-[28px] tblsm:pt-14","node":{"type":"element","tagName":"h3","properties":{},"children":[{"type":"text","value":"Introducing a practical approach with eesel AI","position":{"start":{"line":105,"column":5,"offset":12191},"end":{"line":105,"column":51,"offset":12237}}}],"position":{"start":{"line":105,"column":1,"offset":12187},"end":{"line":105,"column":53,"offset":12239}}},"children":"Introducing a practical approach with eesel AI"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"This is where a platform like ","position":{"start":{"line":107,"column":1,"offset":12241},"end":{"line":107,"column":31,"offset":12271}}},{"type":"element","tagName":"a","properties":{"href":"https://eesel.ai/"},"children":[{"type":"text","value":"eesel AI","position":{"start":{"line":107,"column":32,"offset":12272},"end":{"line":107,"column":40,"offset":12280}}}],"position":{"start":{"line":107,"column":31,"offset":12271},"end":{"line":107,"column":60,"offset":12300}}},{"type":"text","value":" fits in. It’s designed to let you run practical, risk-free benchmarks that are tailored to your business, and you can do it all yourself without having to sit through a sales call.","position":{"start":{"line":107,"column":60,"offset":12300},"end":{"line":107,"column":241,"offset":12481}}}],"position":{"start":{"line":107,"column":1,"offset":12241},"end":{"line":107,"column":243,"offset":12483}}},"children":["This is where a platform like ",["$","a",null,{"href":"https://eesel.ai/","node":"$1af","children":"eesel AI"}]," fits in. It’s designed to let you run practical, risk-free benchmarks that are tailored to your business, and you can do it all yourself without having to sit through a sales call."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"Simulate with confidence","position":{"start":{"line":109,"column":3,"offset":12487},"end":{"line":109,"column":27,"offset":12511}}}],"position":{"start":{"line":109,"column":1,"offset":12485},"end":{"line":109,"column":29,"offset":12513}}}],"position":{"start":{"line":109,"column":1,"offset":12485},"end":{"line":109,"column":31,"offset":12515}}},"children":["$","strong",null,{"className":"font-semibold","node":"$1b9","children":"Simulate with confidence"}]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Instead of just guessing how an AI might do, you can find out for sure. eesel AI has a powerful simulation mode that lets you connect your ","position":{"start":{"line":111,"column":1,"offset":12517},"end":{"line":111,"column":140,"offset":12656}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/blog/how-to-use-ai-helpdesk-tools-to-transform-support"},"children":[{"type":"text","value":"helpdesk","position":{"start":{"line":111,"column":141,"offset":12657},"end":{"line":111,"column":149,"offset":12665}}}],"position":{"start":{"line":111,"column":140,"offset":12656},"end":{"line":111,"column":227,"offset":12743}}},{"type":"text","value":" and run the AI on thousands of your past tickets in a safe, sandboxed environment. It gives you a precise, data-backed forecast of how it will perform, including projected resolution rates and cost savings, before it ever interacts with a live customer. This lets you create your own personal, super-relevant benchmark based on your actual data.","position":{"start":{"line":111,"column":227,"offset":12743},"end":{"line":111,"column":573,"offset":13089}}}],"position":{"start":{"line":111,"column":1,"offset":12517},"end":{"line":111,"column":575,"offset":13091}}},"children":["Instead of just guessing how an AI might do, you can find out for sure. eesel AI has a powerful simulation mode that lets you connect your ",["$","a",null,{"href":"https://www.eesel.ai/blog/how-to-use-ai-helpdesk-tools-to-transform-support","node":"$1c3","children":"helpdesk"}]," and run the AI on thousands of your past tickets in a safe, sandboxed environment. It gives you a precise, data-backed forecast of how it will perform, including projected resolution rates and cost savings, before it ever interacts with a live customer. This lets you create your own personal, super-relevant benchmark based on your actual data."]}],"\n",["$","pre",null,{"className":"flex flex-col gap-3 text-base text-[#808080] font-default mb-5 text-wrap","node":{"type":"element","tagName":"pre","properties":{},"children":[{"type":"element","tagName":"img","properties":{"loading":"lazy","decoding":"async","className":["alignnone","size-medium","wp-image"],"src":"https://website-cms.eesel.ai/wp-content/uploads/2025/09/eeselAI-screenshot-Testing-Simulation.png","alt":"A screenshot of the eesel AI simulation mode, which allows for practical Fin AI Benchmarking on your own historical data.","width":300,"height":169},"children":[],"position":{"start":{"line":113,"column":6,"offset":13098},"end":{"line":113,"column":341,"offset":13433}}},{"type":"text","value":"A screenshot of the eesel AI simulation mode, which allows for practical Fin AI Benchmarking on your own historical data.","position":{"start":{"line":113,"column":341,"offset":13433},"end":{"line":113,"column":462,"offset":13554}}}],"position":{"start":{"line":113,"column":1,"offset":13093},"end":{"line":113,"column":468,"offset":13560}}},"children":[["$","span",null,{"style":{"display":"block","position":"relative","width":"100%","aspectRatio":"300 / 169"},"children":["$","$L22",null,{"image":{"src":"https://website-cms.eesel.ai/wp-content/uploads/2025/09/eeselAI-screenshot-Testing-Simulation.png","alt":"A screenshot of the eesel AI simulation mode, which allows for practical Fin AI Benchmarking on your own historical data.","mediaDetails":{"width":300,"height":169}},"fill":true,"style":{"objectFit":"contain"},"className":"w-full h-auto border-2 border-[#e0e0e0] rounded-md overflow-hidden","sizes":"(max-width: 768px) 100vw, 700px"}]}],"A screenshot of the eesel AI simulation mode, which allows for practical Fin AI Benchmarking on your own historical data."]}]," \n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"Train on ","position":{"start":{"line":115,"column":3,"offset":13566},"end":{"line":115,"column":12,"offset":13575}}},{"type":"element","tagName":"em","properties":{},"children":[{"type":"text","value":"your","position":{"start":{"line":115,"column":13,"offset":13576},"end":{"line":115,"column":17,"offset":13580}}}],"position":{"start":{"line":115,"column":12,"offset":13575},"end":{"line":115,"column":18,"offset":13581}}},{"type":"text","value":" reality","position":{"start":{"line":115,"column":18,"offset":13581},"end":{"line":115,"column":26,"offset":13589}}}],"position":{"start":{"line":115,"column":1,"offset":13564},"end":{"line":115,"column":28,"offset":13591}}}],"position":{"start":{"line":115,"column":1,"offset":13564},"end":{"line":115,"column":30,"offset":13593}}},"children":["$","strong",null,{"className":"font-semibold","node":"$1cd","children":["Train on ",["$","em","em-0",{"children":"your"}]," reality"]}]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Generic models are tested on generic data. ","position":{"start":{"line":117,"column":1,"offset":13595},"end":{"line":117,"column":44,"offset":13638}}},{"type":"element","tagName":"a","properties":{"href":"https://eesel.ai/"},"children":[{"type":"text","value":"eesel AI","position":{"start":{"line":117,"column":45,"offset":13639},"end":{"line":117,"column":53,"offset":13647}}}],"position":{"start":{"line":117,"column":44,"offset":13638},"end":{"line":117,"column":73,"offset":13667}}},{"type":"text","value":" works differently. It connects to all of your company’s knowledge, past tickets from ","position":{"start":{"line":117,"column":73,"offset":13667},"end":{"line":117,"column":159,"offset":13753}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/integration/zendesk"},"children":[{"type":"text","value":"Zendesk","position":{"start":{"line":117,"column":160,"offset":13754},"end":{"line":117,"column":167,"offset":13761}}}],"position":{"start":{"line":117,"column":159,"offset":13753},"end":{"line":117,"column":210,"offset":13804}}},{"type":"text","value":" or ","position":{"start":{"line":117,"column":210,"offset":13804},"end":{"line":117,"column":214,"offset":13808}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/integration/freshdesk"},"children":[{"type":"text","value":"Freshdesk","position":{"start":{"line":117,"column":215,"offset":13809},"end":{"line":117,"column":224,"offset":13818}}}],"position":{"start":{"line":117,"column":214,"offset":13808},"end":{"line":117,"column":269,"offset":13863}}},{"type":"text","value":", internal wikis in ","position":{"start":{"line":117,"column":269,"offset":13863},"end":{"line":117,"column":289,"offset":13883}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/integration/confluence"},"children":[{"type":"text","value":"Confluence","position":{"start":{"line":117,"column":290,"offset":13884},"end":{"line":117,"column":300,"offset":13894}}}],"position":{"start":{"line":117,"column":289,"offset":13883},"end":{"line":117,"column":346,"offset":13940}}},{"type":"text","value":" or ","position":{"start":{"line":117,"column":346,"offset":13940},"end":{"line":117,"column":350,"offset":13944}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/integration/google-docs"},"children":[{"type":"text","value":"Google Docs","position":{"start":{"line":117,"column":351,"offset":13945},"end":{"line":117,"column":362,"offset":13956}}}],"position":{"start":{"line":117,"column":350,"offset":13944},"end":{"line":117,"column":409,"offset":14003}}},{"type":"text","value":", and even conversations in ","position":{"start":{"line":117,"column":409,"offset":14003},"end":{"line":117,"column":437,"offset":14031}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/integration/slack"},"children":[{"type":"text","value":"Slack","position":{"start":{"line":117,"column":438,"offset":14032},"end":{"line":117,"column":443,"offset":14037}}}],"position":{"start":{"line":117,"column":437,"offset":14031},"end":{"line":117,"column":484,"offset":14078}}},{"type":"text","value":", to build an AI that genuinely understands your business. That's what leads to real-world accuracy, not some abstract score on a test.","position":{"start":{"line":117,"column":484,"offset":14078},"end":{"line":117,"column":619,"offset":14213}}}],"position":{"start":{"line":117,"column":1,"offset":13595},"end":{"line":117,"column":621,"offset":14215}}},"children":["Generic models are tested on generic data. ",["$","a",null,{"href":"https://eesel.ai/","node":"$1e5","children":"eesel AI"}]," works differently. It connects to all of your company’s knowledge, past tickets from ",["$","a",null,{"href":"https://www.eesel.ai/integration/zendesk","node":"$1ef","children":"Zendesk"}]," or ",["$","a",null,{"href":"https://www.eesel.ai/integration/freshdesk","node":"$1f9","children":"Freshdesk"}],", internal wikis in ",["$","a",null,{"href":"https://www.eesel.ai/integration/confluence","node":"$203","children":"Confluence"}]," or ",["$","a",null,{"href":"https://www.eesel.ai/integration/google-docs","node":"$20d","children":"Google Docs"}],", and even conversations in ",["$","a",null,{"href":"https://www.eesel.ai/integration/slack","node":"$217","children":"Slack"}],", to build an AI that genuinely understands your business. That's what leads to real-world accuracy, not some abstract score on a test."]}],"\n",["$","pre",null,{"className":"flex flex-col gap-3 text-base text-[#808080] font-default mb-5 text-wrap","node":{"type":"element","tagName":"pre","properties":{},"children":[{"type":"element","tagName":"img","properties":{"loading":"lazy","decoding":"async","className":["alignnone","size-medium","wp-image"],"src":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/02-eeselAI-Bot-Training.png","alt":"The eesel AI platform showing how to train the AI on your company's reality for more accurate Fin AI Benchmarking.","width":300,"height":169},"children":[],"position":{"start":{"line":119,"column":6,"offset":14222},"end":{"line":119,"column":320,"offset":14536}}},{"type":"text","value":"The eesel AI platform showing how to train the AI on your company's reality for more accurate Fin AI Benchmarking.","position":{"start":{"line":119,"column":320,"offset":14536},"end":{"line":119,"column":434,"offset":14650}}}],"position":{"start":{"line":119,"column":1,"offset":14217},"end":{"line":119,"column":440,"offset":14656}}},"children":[["$","span",null,{"style":{"display":"block","position":"relative","width":"100%","aspectRatio":"300 / 169"},"children":["$","$L22",null,{"image":{"src":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/02-eeselAI-Bot-Training.png","alt":"The eesel AI platform showing how to train the AI on your company's reality for more accurate Fin AI Benchmarking.","mediaDetails":{"width":300,"height":169}},"fill":true,"style":{"objectFit":"contain"},"className":"w-full h-auto border-2 border-[#e0e0e0] rounded-md overflow-hidden","sizes":"(max-width: 768px) 100vw, 700px"}]}],"The eesel AI platform showing how to train the AI on your company's reality for more accurate Fin AI Benchmarking."]}]," \n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"strong","properties":{},"children":[{"type":"text","value":"Control the test","position":{"start":{"line":121,"column":3,"offset":14662},"end":{"line":121,"column":19,"offset":14678}}}],"position":{"start":{"line":121,"column":1,"offset":14660},"end":{"line":121,"column":21,"offset":14680}}}],"position":{"start":{"line":121,"column":1,"offset":14660},"end":{"line":121,"column":23,"offset":14682}}},"children":["$","strong",null,{"className":"font-semibold","node":"$221","children":"Control the test"}]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Benchmarking isn't something you do once and forget about. It's an ongoing process. With eesel AI’s gradual rollout and selective automation features, you're always in the driver's seat. You can start by benchmarking the AI on a small handful of simple, low-risk tickets. Then, you can use the reports to see how it did, tweak its persona or knowledge sources, and expand its role as you get more comfortable. It’s a controlled, step-by-step evaluation that you manage from a simple dashboard.","position":{"start":{"line":123,"column":1,"offset":14684},"end":{"line":123,"column":494,"offset":15177}}}],"position":{"start":{"line":123,"column":1,"offset":14684},"end":{"line":123,"column":496,"offset":15179}}},"children":"Benchmarking isn't something you do once and forget about. It's an ongoing process. With eesel AI’s gradual rollout and selective automation features, you're always in the driver's seat. You can start by benchmarking the AI on a small handful of simple, low-risk tickets. Then, you can use the reports to see how it did, tweak its persona or knowledge sources, and expand its role as you get more comfortable. It’s a controlled, step-by-step evaluation that you manage from a simple dashboard."}],"\n",["$","h2",null,{"className":"text-[28px] tracking-[0px] font-semibold text-[#121212] tblsm:mb-8 leading-[120%] max-w-[600px] mt-14 mb-6 tblsm:text-4xl tblsm:leading-[110%] tblsm:max-w-none tblsm:mt-20","node":{"type":"element","tagName":"h2","properties":{},"children":[{"type":"text","value":"Comparing Fin AI platform pricing and implementation","position":{"start":{"line":125,"column":4,"offset":15184},"end":{"line":125,"column":56,"offset":15236}}}],"position":{"start":{"line":125,"column":1,"offset":15181},"end":{"line":125,"column":58,"offset":15238}}},"children":"Comparing Fin AI platform pricing and implementation"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"When you're looking at AI platforms, the cost model is a pretty big piece of the puzzle. Academic frameworks like FinBen and FINOS are open initiatives, so there’s no price tag. But for the AI agents you'd actually use, the story is very different.","position":{"start":{"line":127,"column":1,"offset":15240},"end":{"line":127,"column":249,"offset":15488}}}],"position":{"start":{"line":127,"column":1,"offset":15240},"end":{"line":127,"column":251,"offset":15490}}},"children":"When you're looking at AI platforms, the cost model is a pretty big piece of the puzzle. Academic frameworks like FinBen and FINOS are open initiatives, so there’s no price tag. But for the AI agents you'd actually use, the story is very different."}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Some platforms, like ","position":{"start":{"line":129,"column":1,"offset":15492},"end":{"line":129,"column":22,"offset":15513}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/blog/intercom-ai-pricing-explained"},"children":[{"type":"text","value":"Intercom's Fin","position":{"start":{"line":129,"column":23,"offset":15514},"end":{"line":129,"column":37,"offset":15528}}}],"position":{"start":{"line":129,"column":22,"offset":15513},"end":{"line":129,"column":95,"offset":15586}}},{"type":"text","value":", use a per-resolution pricing model. They charge you for every ticket the AI resolves, often something like \"$0.99 per resolution.\" That might sound fair at first, but it creates unpredictable costs that go up as your support volume grows. If you have a busy month and the AI does a great job, you end up with a bigger bill. You're basically penalized for success.","position":{"start":{"line":129,"column":95,"offset":15586},"end":{"line":129,"column":460,"offset":15951}}}],"position":{"start":{"line":129,"column":1,"offset":15492},"end":{"line":129,"column":462,"offset":15953}}},"children":["Some platforms, like ",["$","a",null,{"href":"https://www.eesel.ai/blog/intercom-ai-pricing-explained","node":"$22b","children":"Intercom's Fin"}],", use a per-resolution pricing model. They charge you for every ticket the AI resolves, often something like \"$0.99 per resolution.\" That might sound fair at first, but it creates unpredictable costs that go up as your support volume grows. If you have a busy month and the AI does a great job, you end up with a bigger bill. You're basically penalized for success."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"element","tagName":"a","properties":{"href":"https://eesel.ai/"},"children":[{"type":"text","value":"eesel AI","position":{"start":{"line":131,"column":2,"offset":15956},"end":{"line":131,"column":10,"offset":15964}}}],"position":{"start":{"line":131,"column":1,"offset":15955},"end":{"line":131,"column":30,"offset":15984}}},{"type":"text","value":" uses a more straightforward and predictable approach. Our plans are based on a flat monthly fee that includes plenty of AI interactions (an answer or an action). You know exactly what you’re paying each month, which makes budgeting easy and avoids any surprise charges. Plus, with flexible month-to-month plans, you can get started without getting stuck in a long-term contract.","position":{"start":{"line":131,"column":30,"offset":15984},"end":{"line":131,"column":409,"offset":16363}}}],"position":{"start":{"line":131,"column":1,"offset":15955},"end":{"line":131,"column":411,"offset":16365}}},"children":[["$","a",null,{"href":"https://eesel.ai/","node":"$235","children":"eesel AI"}]," uses a more straightforward and predictable approach. Our plans are based on a flat monthly fee that includes plenty of AI interactions (an answer or an action). You know exactly what you’re paying each month, which makes budgeting easy and avoids any surprise charges. Plus, with flexible month-to-month plans, you can get started without getting stuck in a long-term contract."]}],"\n",["$","pre",null,{"className":"flex flex-col gap-3 text-base text-[#808080] font-default mb-5 text-wrap","node":{"type":"element","tagName":"pre","properties":{},"children":[{"type":"element","tagName":"img","properties":{"loading":"lazy","decoding":"async","className":["alignnone","size-medium","wp-image"],"src":"https://website-cms.eesel.ai/wp-content/uploads/2025/10/eeselAI-Public-Pricing-Page.png","alt":"A view of eesel AI's pricing page, showing a predictable cost model which is a key factor in Fin AI Benchmarking.","width":300,"height":169},"children":[],"position":{"start":{"line":133,"column":6,"offset":16372},"end":{"line":133,"column":323,"offset":16689}}},{"type":"text","value":"A view of eesel AI's pricing page, showing a predictable cost model which is a key factor in Fin AI Benchmarking.","position":{"start":{"line":133,"column":323,"offset":16689},"end":{"line":133,"column":436,"offset":16802}}}],"position":{"start":{"line":133,"column":1,"offset":16367},"end":{"line":133,"column":442,"offset":16808}}},"children":[["$","span",null,{"style":{"display":"block","position":"relative","width":"100%","aspectRatio":"300 / 169"},"children":["$","$L22",null,{"image":{"src":"https://website-cms.eesel.ai/wp-content/uploads/2025/10/eeselAI-Public-Pricing-Page.png","alt":"A view of eesel AI's pricing page, showing a predictable cost model which is a key factor in Fin AI Benchmarking.","mediaDetails":{"width":300,"height":169}},"fill":true,"style":{"objectFit":"contain"},"className":"w-full h-auto border-2 border-[#e0e0e0] rounded-md overflow-hidden","sizes":"(max-width: 768px) 100vw, 700px"}]}],"A view of eesel AI's pricing page, showing a predictable cost model which is a key factor in Fin AI Benchmarking."]}]," \n",["$","h2",null,{"className":"text-[28px] tracking-[0px] font-semibold text-[#121212] tblsm:mb-8 leading-[120%] max-w-[600px] mt-14 mb-6 tblsm:text-4xl tblsm:leading-[110%] tblsm:max-w-none tblsm:mt-20","node":{"type":"element","tagName":"h2","properties":{},"children":[{"type":"text","value":"Making Fin AI Benchmarking work for you","position":{"start":{"line":135,"column":4,"offset":16815},"end":{"line":135,"column":43,"offset":16854}}}],"position":{"start":{"line":135,"column":1,"offset":16812},"end":{"line":135,"column":45,"offset":16856}}},"children":"Making Fin AI Benchmarking work for you"}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"The world of Fin AI Benchmarking is clearly changing. It’s moving away from purely academic leaderboards and toward practical tools that help businesses check for risks, measure performance, and get real value.","position":{"start":{"line":137,"column":1,"offset":16858},"end":{"line":137,"column":211,"offset":17068}}}],"position":{"start":{"line":137,"column":1,"offset":16858},"end":{"line":137,"column":213,"offset":17070}}},"children":"The world of Fin AI Benchmarking is clearly changing. It’s moving away from purely academic leaderboards and toward practical tools that help businesses check for risks, measure performance, and get real value."}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"While the power of the underlying LLM is important, the true test of an AI agent is how it performs with your data, inside your workflows. The goal isn't just to find the \"smartest\" AI on paper. It's to find a platform that gives you the tools to roll out, test, and ","position":{"start":{"line":139,"column":1,"offset":17072},"end":{"line":139,"column":268,"offset":17339}}},{"type":"element","tagName":"a","properties":{"href":"https://www.eesel.ai/blog/a-practical-guide-to-mastering-ai-and-automation-in-customer-support"},"children":[{"type":"text","value":"control your automation","position":{"start":{"line":139,"column":269,"offset":17340},"end":{"line":139,"column":292,"offset":17363}}}],"position":{"start":{"line":139,"column":268,"offset":17339},"end":{"line":139,"column":389,"offset":17460}}},{"type":"text","value":" safely and effectively. A modern AI platform shouldn't just hand you an AI; it should give you the power to run your own benchmarks with confidence.","position":{"start":{"line":139,"column":389,"offset":17460},"end":{"line":139,"column":538,"offset":17609}}}],"position":{"start":{"line":139,"column":1,"offset":17072},"end":{"line":139,"column":540,"offset":17611}}},"children":["While the power of the underlying LLM is important, the true test of an AI agent is how it performs with your data, inside your workflows. The goal isn't just to find the \"smartest\" AI on paper. It's to find a platform that gives you the tools to roll out, test, and ",["$","a",null,{"href":"https://www.eesel.ai/blog/a-practical-guide-to-mastering-ai-and-automation-in-customer-support","node":"$23f","children":"control your automation"}]," safely and effectively. A modern AI platform shouldn't just hand you an AI; it should give you the power to run your own benchmarks with confidence."]}],"\n",["$","p",null,{"className":"","node":{"type":"element","tagName":"p","properties":{},"children":[{"type":"text","value":"Ready to see how an AI agent performs on your real support tickets? ","position":{"start":{"line":141,"column":1,"offset":17613},"end":{"line":141,"column":69,"offset":17681}}},{"type":"element","tagName":"strong","properties":{},"children":[{"type":"element","tagName":"a","properties":{"href":"https://dashboard.eesel.ai/api/auth/signup?returnTo=v2"},"children":[{"type":"text","value":"Start your free trial with eesel AI","position":{"start":{"line":141,"column":72,"offset":17684},"end":{"line":141,"column":107,"offset":17719}}}],"position":{"start":{"line":141,"column":71,"offset":17683},"end":{"line":141,"column":164,"offset":17776}}}],"position":{"start":{"line":141,"column":69,"offset":17681},"end":{"line":141,"column":166,"offset":17778}}},{"type":"text","value":" and run a simulation on your historical data in minutes. No sales call needed.","position":{"start":{"line":141,"column":166,"offset":17778},"end":{"line":141,"column":245,"offset":17857}}}],"position":{"start":{"line":141,"column":1,"offset":17613},"end":{"line":141,"column":247,"offset":17859}}},"children":["Ready to see how an AI agent performs on your real support tickets? ",["$","strong",null,{"className":"font-semibold","node":"$249","children":["$","a",null,{"href":"https://dashboard.eesel.ai/api/auth/signup?returnTo=v2","node":"$24c","children":"Start your free trial with eesel AI"}]}]," and run a simulation on your historical data in minutes. No sales call needed."]}],"\n",["$","$L259",null,{"categoryName":"guides-en"}]]}]]}]}]}]]}],false,["$","div",null,{"children":[["$","$L25a","0-AcfFaqs",{"children":["$","$11",null,{"fallback":null,"children":["$","$L25b",null,{"_data":"$25c","extra":{"faqs":{"hasTopMargin":true,"isBlogPage":true},"blogCategory":"guides-en","textBlock":{"isFirstTextBlock":false}}}]}]}]]}],false]}]]}],["$","div",null,{"className":"relative hidden dskxl:flex flex-col gap-6 ","children":["$","div",null,{"className":"sticky top-[92px]","children":["$","$L269",null,{"BASE_URL":"https://www.eesel.ai","locale":"EN","shareUrl":"https://www.eesel.ai/en/blog/fin-ai-benchmarking-en","categoryName":"guides-en"}]}]}]]}],["$","div",null,{"className":"grid gap-[72px] place-items-center py-12 tblsm:py-18 h-fit max-w-[800px] mx-auto dsklg:max-w-full","children":[["$","$L26a",null,{"url":"https://www.eesel.ai/en/blog/fin-ai-benchmarking-en","title":"A guide to Fin AI benchmarking - eesel AI","isTextCentered":true}],["$","$L26b",null,{"data":"$26c"}]]}]]}]]}],["$","$L28f",null,{"relateds":[{"id":"cG9zdDo3NTYyNQ==","title":"Koala AI pricing in 2025: A complete breakdown","excerpt":"

Is Koala AI pricing worth it? We break down every plan, the hidden costs of using GPT-4, and the real cost per article to help you decide.

\n","slug":"koala-ai-pricing-en","date":"2025-11-25T06:25:11","language":{"slug":"en"},"featuredImage":{"node":{"altText":"","mediaDetails":{"width":1785,"height":949},"sourceUrl":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/Banner-Top-7-solutions-for-AI-for-ticketing-systems-in-2025.png"}},"author":{"node":{"firstName":"Stevia","lastName":"Putri","authors":{"avatar":{"node":{"altText":"","mediaItemUrl":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/IMG-20250812-WA0014-e1755016187283.jpg","mediaDetails":{"width":544,"height":1013}}},"role":"Writer","roleFrench":"Writer","roleGerman":"Writer","roleSpanish":"Writer","rolePortuguese":"Writer","roleJapanese":"Writer"}}},"postMeta":{"minsRead":null}},{"id":"cG9zdDo3NTYxNA==","title":"Koala AI review","excerpt":"

Our in-depth Koala AI review explores its features, pros, and cons. Discover if this AI writer is right for you or if its pricing and support issues are a deal-breaker.

\n","slug":"koala-ai-review-en","date":"2025-11-25T06:16:50","language":{"slug":"en"},"featuredImage":{"node":{"altText":"","mediaDetails":{"width":1785,"height":949},"sourceUrl":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/Banner-The-6-best-AI-chat-for-e-commerce-solutions-for-brands-in-2025.png"}},"author":{"node":{"firstName":"Stevia","lastName":"Putri","authors":{"avatar":{"node":{"altText":"","mediaItemUrl":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/IMG-20250812-WA0014-e1755016187283.jpg","mediaDetails":{"width":544,"height":1013}}},"role":"Writer","roleFrench":"Writer","roleGerman":"Writer","roleSpanish":"Writer","rolePortuguese":"Writer","roleJapanese":"Writer"}}},"postMeta":{"minsRead":null}},{"id":"cG9zdDo3NTYxMw==","title":"What is Koala AI? A clear guide to the name on everyone's lips in 2025","excerpt":"

Confused by \"Koala AI\"? You're not alone. This guide breaks down the different tools, from content writers to chatbots, and helps you find the right solution.

\n","slug":"koala-ai-en","date":"2025-11-25T06:15:45","language":{"slug":"en"},"featuredImage":{"node":{"altText":"","mediaDetails":{"width":1785,"height":949},"sourceUrl":"https://website-cms.eesel.ai/wp-content/uploads/2025/08/Banner-The-7-Best-AI-Scheduling-Assistant-Tools-in-2025-Features-Pricing.png"}},"author":{"node":{"firstName":"Kenneth","lastName":"Pangan","authors":{"avatar":{"node":{"altText":"","mediaItemUrl":"https://website-cms.eesel.ai/wp-content/uploads/2025/01/ff982460-eca1-4f0e-b1db-aa9ad25df868.jpg","mediaDetails":{"width":1894,"height":3718}}},"role":"Writer","roleFrench":"Écrivain","roleGerman":"Schriftsteller","roleSpanish":"Escritor","rolePortuguese":"Escritor","roleJapanese":"作家"}}},"postMeta":{"minsRead":null}}]}]]}]