Pega Eliminates ‘AI Token Tax’ With More Efficient Way to Build and Run Agentic Workflows
Pega Infinity 26 provides predictable agent outcomes with predictable cost and no metered token charges
Market Context: The bill comes due on AI experimentation
Token bills are arriving, and they are shocking enterprise leaders. As organizations look to scale agent experiments to production, LLM providers are converting flat-rate subscriptions to more expensive token-metered pricing – while quietly running up expensive reasoning tokens behind the scenes. The more complex the request, the more reasoning steps are required – and the more likely it generates an inadequate and inconsistent answer.
A Closer Look: The Pega AI architectural difference
Pega applies AI reasoning at design time, when its creative power delivers the most value for reimagining outdated processes and systems. With Pega Blueprint AI™ and the new
Once the workflows are designed and deployed, Pega shifts to a lighter weight semantic mode of AI better suited for runtime, when agents are called on to process millions of user requests efficiently and consistently. Instead of re-reasoning each new workflow, agents use a lightweight AI query to understand the user intent, find the best Pega workflow for the job, and then follow it step-by-step to complete the work. If a specific step needs deeper LLM use (e.g., to parse a document or summarize previous interactions), the step provides specific and bounded instructions to ensure predictability.
This approach delivers two critical benefits:
- Predictable outcomes: Re-reasoning each workflow leads to inconsistent and unpredictable outcomes. Instead, agents connected to Pega follow pre-approved workflows consistently, which is critical for regulated industries - and smart for everybody.
- Predictable costs: Pega’s approach uses AI reasoning once at design time, rather than inefficiently re-reasoning repeatedly at runtime, making it dramatically more efficient and affordable for agents to drive the processes that do the most work in a business.
Interactive Token Calculator: How much are you wasting on inefficient AI agents?
To help enterprises quantify the benefits of this approach, Pega introduced the AI Token Cost Calculator. The interactive tool estimates possible savings by comparing Pega AI with token-metered alternatives based on users’ workflow volumes. Many clients can realize a savings of more than 20x depending on workflow complexity and scale. Visit www.pega.com/ai-token-calculator.
Availability: Pay for the work being done, not thinking about what to do
Pega’s outcomes-based approach charges per completed “case” – a task executed from start to finish – not per seat or per token. For example, when a customer uses an AI agent to change an existing order, that completed interaction is recorded as a single case.
Available in Q3 this year, Pega Infinity 26 clients pay a single, flat price per completed case, regardless of how much Pega AI is used behind the scenes. This aligns cost directly to business value.
For more information, visit www.pega.com/dontpayfortokens
Quotes & Commentary
“Enterprises are quickly waking up to the fact that tokenmaxxing is ridiculous: it can only lead to unsustainable costs and unpredictable results,” said
“We have hit a point in the AI hype cycle where value and accountability have entered the conversation in meaningful ways,“ said
About Pega
Pega delivers the platform to reimagine, run, and evolve the processes and decisions an enterprise can't afford to get wrong. We combine AI with proven architecture to keep mission-critical operations governed, scalable, and continuously adaptable. Since 1983, the world's largest organizations have trusted Pega to turn transformation ambition into durable results. Learn more at pega.com.
View source version on businesswire.com: https://www.businesswire.com/news/home/20260608778439/en/
Press Contact
Sean Audet
Pegasystems Inc.
sean.audet@pega.com
Source: Pegasystems Inc.