GIGABYTE Solutions for NVIDIA Rubin Platform

GIGABYTE solutions usher in a new era of Agentic AI, built on the NVIDIA Rubin platform to accelerate intelligent reasoning, autonomous decision-making, and scalable AI infrastructure.

The Expert in Agentic AI and AI Reasoning

The era of agentic AI, a frequently discussed future, is now finally within reach. With the NVIDIA Rubin platform, purpose-built for agentic AI and reasoning, GIGABYTE strives to deliver the most efficient solutions across industries. The NVIDIA Rubin platform goes far beyond a simple GPU upgrade. It introduces six new chips, covering CPU, GPU, NVLink Switch, DPU, NIC, and Ethernet switch, accelerating every aspect of AI computation.

The Six New Chips

Vera CPU

88 custom-designed Olympus cores with improved bandwidth and next generation low-latency data movement.

Read More →

Rubin GPU

Built for the future of AI with increased compute density, memory bandwidth, and rack-scale communication.

Read More →

NVLink 6 Switch

The scale-up fabric that tightly couples accelerators with uniform latency and sustained bandwidth.

Read More →

ConnectX-9 SuperNIC

Delivers predictable scale-out performance while enforcing traffic isolation and secure operation.

Read More →

BlueField-4 DPU

The software-defined control plane which enforces security, isolation, and operational determinism.

Read More →

Spectrum-6 Switch

Purpose-built Ethernet fabric engineered for highly synchronized, bursty AI traffic.

Read More →

← Use scrollbar to view all 6 chips →

The Five Generational Breakthroughs

6th Gen NVLink & NVLink Switch

3.6 TB/s bandwidth per GPU for bandwidth-intensive applications like AI inference, featuring NVIDIA® SHARP™.

Vera CPU

Combines 88 NVIDIA-designed cores, up to 1.2 TB/s of LPDDR5X memory bandwidth, and Scalable Coherency Fabric.

3rd Gen Transformer Engine

Enables up to 50 PetaFLOPS NVFP4 for inference with new hardware-accelerated adaptive compression.

3rd Gen Confidential Computing

The world's first rack-scale confidential computing across CPU, GPU, and NVLink™ domains.

2nd Gen RAS Engine

Enables continuous in-system health monitoring, self-testing, and SRAM repair.

NVIDIA Vera Rubin NVL72

Unmatched Dense Performance

Delivers extreme GPU density in a single rack, enabling massive performance for trillion-parameter AI models and large-scale training workloads.

Designed for Next-Generation AI Factories

Built specifically for AI training efficiency and inference cost reduction. Tuned for large models and high throughput, providing maximum efficiency where every millisecond counts.

Integrated, Fully Engineered System

Comes as a cohesively engineered rack system, including custom cooling, power distribution, and networking, enabling rapid, cable-free deployment at scale.

Specifications¹

NVFP4 Inference	3,600 PFLOPS
NVFP4 Training²	2,520 PFLOPS
FP8 / FP6 Training²	1,260 PFLOPS
INT8²	18 POPS
FP16 / BF16²	288 PFLOPS
TF32²	144 PFLOPS
FP32	9,360 TFLOPS
FP64	2,400 TFLOPS
FP32 SGEMM³	28,800 TFLOPS
FP64 DGEMM³	14,400 TFLOPS
GPU Memory	20.7 TB HBM4
Bandwidth	1,580 TB/s
NVLink Bandwidth	260 TB/s
NVLink-C2C Bandwidth	65 TB/s

¹ All values are up to and subject to change.
² Dense specification.
³ Peak performance using Tensor Core-based emulation algorithms.

NVIDIA Rubin NVL8

Flexible Scaling for Any Deployment

Scale from single-node to multi-node GPU clusters without committing to a full rack-scale architecture. Ideal for phased expansion and mixed AI workloads.

Lower Infrastructure Requirements

Fits into more standard server and rack environments, reducing the need for specialized power, cooling, and facility redesign.

Broader Platform Compatibility

Supports a wider range of configurations, networking choices, and workload types. Fits perfectly into enterprises running everything from AI training to inference and HPC.

GPU	8x NVIDIA Rubin GPUs
Total GPU Memory \| Bandwidth	2.3 TB \| 160 TB/s
CPU	2x Intel^® Xeon^® 6 processors
NVIDIA NVLink Switch System	4x
NVIDIA NVLink Bandwidth	28.8 TB/s total bandwidth
Networking	8x OSFP ports serving 8x single-port NVIDIA ConnectX^®-9 VPI - up to 800 Gb/s NVIDIA InfiniBand and Ethernet 2x 400G QSP112 NVIDIA BlueField^®-4 DPUs - up to 800 Gb/s NVIDIA InfiniBand and Ethernet

Why GIGABYTE?

Short TTM for Agile Deployment

Flexible Scalability for Diverse Scenarios

One-Stop Deployment for Zero Hassle

Unified Management for Easy Maintenance

Strong Partnership for All-Round Support

Extensive Global Experience for Maximum Flexibility

GIGAPOD - One-Stop Scalable Solutions

At GIGABYTE, we offer GIGAPOD, a solution that scales from a single rack to POD-scale and containerized data center, with power, cabling, cooling, and all infrastructure carefully designed and evaluated. Providing a simple, one-step, pain-free adoption of AI data centers. Learn more about GIGAPOD →