HOME Explore Back
Solutions
May. 13, 2026

GIGABYTE Solutions for NVIDIA Rubin Platform

Share:
 

GIGABYTE Solutions for NVIDIA Rubin Platform

GIGABYTE solutions usher in a new era of Agentic AI, built on the NVIDIA Rubin platform to accelerate intelligent reasoning, autonomous decision-making, and scalable AI infrastructure.

The Expert in Agentic AI and AI Reasoning

The era of agentic AI, a frequently discussed future, is now finally within reach. With the NVIDIA Rubin platform, purpose-built for agentic AI and reasoning, GIGABYTE strives to deliver the most efficient solutions across industries. The NVIDIA Rubin platform goes far beyond a simple GPU upgrade. It introduces six new chips, covering CPU, GPU, NVLink Switch, DPU, NIC, and Ethernet switch, accelerating every aspect of AI computation.

The Six New Chips

Vera CPU
88 custom-designed Olympus cores with improved bandwidth and next generation low-latency data movement.
Read More →
Rubin GPU
Built for the future of AI with increased compute density, memory bandwidth, and rack-scale communication.
Read More →
NVLink 6 Switch
The scale-up fabric that tightly couples accelerators with uniform latency and sustained bandwidth.
Read More →
ConnectX-9 SuperNIC
Delivers predictable scale-out performance while enforcing traffic isolation and secure operation.
Read More →
BlueField-4 DPU
The software-defined control plane which enforces security, isolation, and operational determinism.
Read More →
Spectrum-6 Switch
Purpose-built Ethernet fabric engineered for highly synchronized, bursty AI traffic.
Read More →
← Use scrollbar to view all 6 chips →

The Five Generational Breakthroughs

6th Gen NVLink & NVLink Switch

3.6 TB/s bandwidth per GPU for bandwidth-intensive applications like AI inference, featuring NVIDIA® SHARP™.

Vera CPU

Combines 88 NVIDIA-designed cores, up to 1.2 TB/s of LPDDR5X memory bandwidth, and Scalable Coherency Fabric.

3rd Gen Transformer Engine

Enables up to 50 PetaFLOPS NVFP4 for inference with new hardware-accelerated adaptive compression.

3rd Gen Confidential Computing

The world's first rack-scale confidential computing across CPU, GPU, and NVLink™ domains.

2nd Gen RAS Engine

Enables continuous in-system health monitoring, self-testing, and SRAM repair.

NVIDIA Vera Rubin NVL72

Unmatched Dense Performance

Delivers extreme GPU density in a single rack, enabling massive performance for trillion-parameter AI models and large-scale training workloads.

Designed for Next-Generation AI Factories

Built specifically for AI training efficiency and inference cost reduction. Tuned for large models and high throughput, providing maximum efficiency where every millisecond counts.

Integrated, Fully Engineered System

Comes as a cohesively engineered rack system, including custom cooling, power distribution, and networking, enabling rapid, cable-free deployment at scale.

Product Image

Specifications1

NVFP4 Inference 3,600 PFLOPS
NVFP4 Training2 2,520 PFLOPS
FP8 / FP6 Training2 1,260 PFLOPS
INT82 18 POPS
FP16 / BF162 288 PFLOPS
TF322 144 PFLOPS
FP32 9,360 TFLOPS
FP64 2,400 TFLOPS
FP32 SGEMM3 28,800 TFLOPS
FP64 DGEMM3 14,400 TFLOPS
GPU Memory 20.7 TB HBM4
Bandwidth 1,580 TB/s
NVLink Bandwidth 260 TB/s
NVLink-C2C Bandwidth 65 TB/s
1 All values are up to and subject to change.
2 Dense specification.
3 Peak performance using Tensor Core-based emulation algorithms.

NVIDIA Rubin NVL8

Flexible Scaling for Any Deployment

Scale from single-node to multi-node GPU clusters without committing to a full rack-scale architecture. Ideal for phased expansion and mixed AI workloads.

Lower Infrastructure Requirements

Fits into more standard server and rack environments, reducing the need for specialized power, cooling, and facility redesign.

Broader Platform Compatibility

Supports a wider range of configurations, networking choices, and workload types. Fits perfectly into enterprises running everything from AI training to inference and HPC.

Product Image
GPU 8x NVIDIA Rubin GPUs
Total GPU Memory | Bandwidth 2.3 TB | 160 TB/s
CPU 2x Intel® Xeon® 6 processors
NVIDIA NVLink Switch System 4x
NVIDIA NVLink Bandwidth 28.8 TB/s total bandwidth
Networking 8x OSFP ports serving 8x single-port NVIDIA ConnectX®-9 VPI
- up to 800 Gb/s NVIDIA InfiniBand and Ethernet
2x 400G QSP112 NVIDIA BlueField®-4 DPUs
- up to 800 Gb/s NVIDIA InfiniBand and Ethernet

Why GIGABYTE?

Short TTM for Agile Deployment

Flexible Scalability for Diverse Scenarios

One-Stop Deployment for Zero Hassle

Unified Management for Easy Maintenance

Strong Partnership for All-Round Support

Extensive Global Experience for Maximum Flexibility

GIGAPOD - One-Stop Scalable Solutions

At GIGABYTE, we offer GIGAPOD, a solution that scales from a single rack to POD-scale and containerized data center, with power, cabling, cooling, and all infrastructure carefully designed and evaluated. Providing a simple, one-step, pain-free adoption of AI data centers. Learn more about GIGAPOD →

Contact Our Experts to Find the Right Solution

Contact Us

Other

Solutions
Dec. 05, 2025

Intel Gaudi 3

Solutions
Nov. 01, 2025

Intel Xeon 6 Processors Solutions

language

Under the General Data Protection Regulation (GDPR) enforced by the European Union, we are committed to safeguarding your personal data and providing you with control over its use.

Manage Cookies

Privacy preferences

Under the General Data Protection Regulation (GDPR) enforced by the European Union, we are committed to safeguarding your personal data and providing you with control over its use.

Privacy Policy

Manage preferences

Necessary cookie

Always on

By clicking "Accept All," you consent to our use of cookies to enhance your experience on this website, assist with performance analysis, and deliver relevant marketing content. You can manage your cookie preferences below. Clicking "Confirm" indicates your agreement to the current settings.