Broadcom has officially announced the shipment of the Tomahawk Ultra Ethernet Switch, a product set to redefine high-performance computing (HPC) and artificial intelligence (AI) networking. Designed for ultra-low latency, high throughput, and lossless operation, Tomahawk Ultra establishes a new standard for Ethernet switching in demanding technical environments.
Ram Velaga, Senior Vice President and General Manager of Broadcom’s Core Switching Group, emphasized that Tomahawk Ultra is the result of a multi-year engineering effort involving hundreds of specialists. This launch underscores Broadcom’s ongoing commitment to advancing Ethernet technology for the next generation of high-performance and AI-driven workloads.
Historically, Ethernet has been viewed as a high-latency, lossy technology, unsuitable for the most demanding compute clusters. Tomahawk Ultra challenges this perception by delivering:
Tomahawk Ultra is optimized for the low-latency, high-bandwidth communication patterns found in HPC systems and AI clusters. Its architecture is designed to deliver predictable, high-efficiency performance for large-scale simulations, scientific computing, and synchronized AI model training and inference.
When deployed with Scale-Up Ethernet (SUE), Tomahawk Ultra achieves sub-400ns XPU-to-XPU communication latency, including switch transit time, setting a new standard for tightly synchronized AI compute at scale.
The reduction of Ethernet header overhead from 46 bytes to 10 bytes, while maintaining compliance, significantly enhances network efficiency. This streamlined, adaptable header provides both flexibility and performance improvements across a range of HPC and AI workloads.
Tomahawk Ultra’s lossless fabric technology is engineered to prevent packet drops during high-volume data transfers. Using LLR, the switch detects link errors with Forward Error Correction (FEC) and automatically retransmits packets, avoiding physical-level drops. CBFC further prevents buffer overflows, a common cause of packet loss. Together, these mechanisms create a truly lossless Ethernet fabric, delivering the reliability required by today’s most data-intensive applications.
A significant bottleneck in AI and machine learning workloads is the overhead associated with collective operations, such as AllReduce, Broadcast, and AllGather. Tomahawk Ultra addresses this by performing these operations directly within the switch chip, reducing job completion times and maximizing the utilization of expensive compute resources. Notably, this feature operates independently of endpoints, allowing for rapid integration across diverse system architectures and vendor ecosystems.
Tomahawk Ultra is designed with advanced, topology-aware routing to support HPC topologies such as Dragonfly, Mesh, and Torus. The switch complies with the UEC standard and leverages the openness and rich ecosystem of Ethernet networking, ensuring broad compatibility and future-proofing for evolving data center architectures.
As part of Broadcom’s Ethernet-forward strategy for AI scaling, the company has introduced SUE-Lite, an optimized version of the SUE specification. SUE-Lite is tailored for power- and area-sensitive accelerator applications, retaining the core low-latency and lossless features of full SUE while further reducing the silicon footprint and power consumption of Ethernet interfaces on AI XPUs and CPUs. This lightweight approach simplifies the integration of standards-compliant Ethernet fabrics into AI platforms, promoting broader adoption of Ethernet as the preferred interconnect for scale-up architectures.
Together with the 102.4 Tb/s Tomahawk 6, Tomahawk Ultra forms the backbone of a unified Ethernet architecture, enabling both scalable AI training clusters and expansive HPC and distributed workloads.
The switch is currently shipping for use in rack-scale AI training clusters and supercomputing environments.
Rackspace Technology and AMD have signed a memorandum of understanding establishing a framework for a multi-year strategic partnership focused on…
Lenovo is expanding its business PC portfolio with new ThinkPad laptops and a ThinkStation desktop workstation aimed at organizations that…
HPE has announced general availability of the HPE Compute Scale-up Server 3250, a scale-up platform engineered for in-memory databases and…
Dell Technologies has announced two major updates to its Dell AI Platform with AMD, targeting organizations scaling from pilot AI…
NVIDIA and IREN Limited have announced a strategic partnership to accelerate the deployment of next-generation AI infrastructure, with plans to…
Anthropic’s new compute agreement with SpaceX gives the AI company access to all compute capacity at SpaceX’s Colossus 1 data…