Penguin Solutions Releases ICE ClusterWare Management Software 13.0 for Optimizing AI Infrastructure
New cluster management software capabilities deliver sustained peak performance and network-isolated resource segmentation for AI and HPC applications
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251117135297/en/
When an organization’s AI deployments progress from isolated pilot projects to enterprise-wide production environments, operational demands on infrastructure intensify immediately. Penguin’s ICE ClusterWare 13.0 addresses this with built-in anomaly detection and auto-remediation, along with network-isolated multi-tenancy—delivering the operational excellence required to support AI as a core business function.
“With the launch of our ICE ClusterWare software 13.0, we’re delivering pivotal advancements to help organizations manage the growing complexity of modern AI and HPC environments,” said
The patent-pending anomaly detection and auto-remediation technology ensures peak cluster performance and resource availability, continuously monitoring for hidden performance degradation that traditional diagnostic tools miss. Upon detection, the system automatically isolates underperforming nodes and initiates remediation in real time, ensuring that workloads are scheduled on validated, high performing nodes. This proactive approach reduces administrative burdens, prevents unplanned downtime, and maximizes the cluster’s usable capacity. As a result, this new capability significantly shortens model training by reducing restarts and loss of work.
The new optional network-isolated multi-tenancy feature enables organizations to securely and efficiently share high-value GPU clusters, creating dedicated subclusters to support different departments, projects, or GPU-as-a-Service (GPUaaS) customers. This capability provides isolated environments, giving tenants the autonomy to select their own workload manager, govern users, and run workloads with confidence that data and operations remain segregated and secure.
"The pace and quality of biomedical research are directly tied to the technology that supports it," said Assistant Dean for Information Technology
Reducing the security and resource utilization conflicts that previously forced organizations to build separate clusters drastically improves time to value. This capability is essential for cloud service providers and hyperscalers providing GPUaaS, enterprises and research institutes delivering AI computing to internal business groups, and federal or government agencies that require the highest level of security and resource isolation.
General availability for ICE ClusterWare software 13.0 is scheduled for
ICE ClusterWare is a trademark or registered trademark of
About
The most exciting technological advancements are also the most challenging for companies to adopt. At
For more information, visit https://www.penguinsolutions.com.
View source version on businesswire.com: https://www.businesswire.com/news/home/20251117135297/en/
Media Contact
Maureen O’Leary,
602-330-6846
pr@penguinsolutions.com
Source: