FuriosaAI Partners with Broadcom to Build Next-generation Inference Platform for the Agentic Era
Moving beyond a traditional ASIC partnership, Furiosa and
RNGD: A proven foundation for production inference
The partnership builds on the commercial maturity of RNGD, FuriosaAI’s data center inference chip. Currently in mass production and fabricated at TSMC’s 5nm advanced process, RNGD is a 180W, PCIe-based accelerator designed for high-performance LLM and agentic AI workloads.
RNGD has been validated in production environments by global leaders including Samsung SDS and
"Inference performance is no longer defined solely by raw compute. It is increasingly a function of data reuse and communication efficiency across servers and racks," said
Scalable chiplet architecture and advanced networking
Furiosa’s third-generation chip will feature a 2nm compute die, a dedicated IO die for scale-up networking, and HBM4/4E memory, utilizing Broadcom’s advanced packaging capabilities to integrate multiple silicon dies into a high-performance AI inference accelerator. By incorporating Broadcom’s scale-up Ethernet technologies, the system delivers low-latency, high-bandwidth all-to-all interconnect across hundreds of chips at rack scale.
“Bringing together Broadcom’s infrastructure capabilities and Furiosa’s Tensor Contraction Processor architecture and its industry-defining software stack allows us to move beyond the chip level and deliver a comprehensive solution for the token factory era,” said Furiosa Cofounder and CEO
The design is optimized for demanding real-world AI workloads, including intensive post-training sampling. By focusing on high-bandwidth data movement rather than the thread management required by GPUs, the chip will deliver higher performance-per-watt and greater token density than state-of-the-art GPUs.
Software designed for flexibility and developer velocity
Furiosa’s hardware is supported by a software stack that enables developers to deploy quickly, meet demanding throughput and latency requirements, and easily switch to new frontier models and new optimization techniques.
While legacy platforms require extensive hand-tuning of kernels for every new model, Furiosa’s SDK leverages a general compiler that automatically maps high-level PyTorch code to silicon. For developers requiring more granular control, Furiosa’s Virtual ISA offers a declarative programming model that provides hardware control without the non-deterministic complexity of traditional GPU programming.
Roadmap and availability
Sampling is scheduled to begin in the first half of 2028 to support the next decade of AI data center deployments.
About FuriosaAI
FuriosaAI builds high-performance, high-efficiency AI compute for the Inference Era. Founded in 2017 by veteran engineers from AMD, Qualcomm, and Samsung, Furiosa operates globally with offices in
For more information, visit furiosa.ai.
View source version on businesswire.com: https://www.businesswire.com/news/home/20260527775773/en/
Media contact: press@furiosa.ai
Source: FuriosaAI