GB300 NVL72

NVIDIA GB300 NVL72

AvailableNVIDIA DGX

NVIDIA GB300 NVL72

Name: EMARQUE NVIDIA GB300 NVL72
Brand: EMARQUE AI
SKU: nvidia-gb300-nvl72-ai-factory
Availability: InStock

Rack-scale AI infrastructure for the era of reasoning — 36 NVIDIA Grace CPUs and 72 NVIDIA Blackwell Ultra GPUs in a single NVLink-connected, liquid-cooled rack.

Pricing on request — allocation and configuration confirmed at quotation.

NVIDIA GB300 NVL72 — built by EMARQUE in Malaysia

72Blackwell Ultra GPUs

21 TBHBM3e pool

1.4ExaFLOPS FP4 inference

Key features

Configuration overview.

Manufacturer-defined features from the published datasheet.

36 Grace CPUs + 72 Blackwell Ultra GPUs

NVIDIA GB300 NVL72 connects 36 NVIDIA Grace CPUs and 72 NVIDIA Blackwell Ultra GPUs in a rack-scale, liquid-cooled design. All 72 GPUs are interconnected via fifth-generation NVLink and present a coherent ~21 TB HBM3e memory pool.

1.8× inference vs GB200 NVL72

Per NVIDIA's published figures, NVIDIA GB300 NVL72 delivers approximately 1.8× the inference throughput and 1.5× the training performance of the prior-generation NVIDIA GB200 NVL72.

1.4 ExaFLOPS dense FP4 inference

Per NVIDIA's published figures, up to 1.4 ExaFLOPS of dense FP4 inference performance per rack. The architectural primitive behind NVIDIA's AI Factory reference designs.

NVLink Switch fabric — 130 TB/s

Fifth-generation NVLink with NVLink Switch delivers 130 TB/s aggregate intra-rack bandwidth — all 72 GPUs interconnected all-to-all, coherent memory addressing across the rack.

NVIDIA Mission Control software

Cluster-level orchestration platform from NVIDIA — health monitoring, telemetry aggregation, validated recipe deployment, job-scheduler integration. Standard operating model for NVIDIA DGX SuperPOD with GB300.

ConnectX-8 scale-out networking

Multi-rack scale-out via NVIDIA ConnectX-8 SuperNICs at 800 Gb/s, paired with NVIDIA Quantum-X800 InfiniBand or NVIDIA Spectrum-X800 Ethernet rack fabrics.

Two ways to buy

Same platform — choose the supply path.

The NVIDIA HGX baseboard is identical on both paths. The turnkey DGX is fastest to deploy; OEM HGX platforms (Dell, Giga Computing, Supermicro) give wider configuration choice.

NVIDIA

NVIDIA DGX GB300 (turnkey)

NVIDIA-built and NVIDIA-supported packaging of the NVIDIA GB300 NVL72 platform. Ships pre-configured with NVIDIA DGX OS, NVIDIA AI Enterprise, NVIDIA Mission Control, and a multi-year NVIDIA Enterprise Support contract.

NVIDIA DGX OS · NVIDIA AI Enterprise · NVIDIA Mission Control
NVIDIA Enterprise Support contract included
NVIDIA-validated reference recipes for DGX SuperPOD with GB300
Reference architecture for sovereign-compute and AI Factory builds

Giga Computing · Supermicro · Dell · ASUS

NVIDIA GB300 NVL72 — OEM platforms

The same NVIDIA GB300 NVL72 rack platform from NVIDIA's OEM partners — customer-operated, no DGX software bundle. Same NVIDIA Grace CPUs, NVIDIA Blackwell Ultra GPUs, and NVLink Switch fabric. OEM warranty and support model.

Giga Computing GIGAPOD-class rack (GB300 NVL72)
Supermicro NVL72-class rack (NVIDIA GB300 NVL72)
Dell PowerEdge XE-class rack (NVIDIA GB300 NVL72)
Customer-operated orchestration (Slurm, Kubernetes, OpenStack)

EMARQUE supplies both paths in Malaysia. Final configuration and warranty terms are confirmed in writing at quotation.

Architecture

Under the hood.

The four sub-systems that determine real-workload behaviour. We tune each before delivery.

GPU + CPU complex (per rack)

72 × NVIDIA B300 (Blackwell Ultra) SXM modules · NVLink Switch domain
36 × NVIDIA Grace 72-core Arm Neoverse V2 CPUs
Up to ~21 TB HBM3e per rack (288 GB × 72)
Up to ~1.4 ExaFLOPS dense FP4 per rack (NVIDIA published)

NVLink Switch fabric

5th-gen NVLink with NVLink Switch (intra-rack all-to-all)
130 TB/s aggregate NVLink bandwidth within the rack
Coherent memory addressing across all 72 GPUs
Tensor parallelism scales linearly — no cross-node penalty inside the rack

Scale-out fabric & networking

ConnectX-8 800 Gb/s InfiniBand or Spectrum-X Ethernet for inter-rack scale-out
Quantum-X800 InfiniBand switches at the cluster layer
GPUDirect RDMA across racks for multi-rack training
SuperPOD architecture supports scaling to hundreds of racks

Site, cooling, power

Direct-to-chip liquid cooling — CDU integration required
≈ 120 kW per rack typical load
Inlet liquid temperature 30-35 °C; warm-water cooling capable
Raised-floor or in-row CDU; structured cabling for fabric scale-out

Software & operations

NVIDIA Mission Control (cluster orchestration, telemetry, recipes)
DGX OS · NVIDIA AI Enterprise · NeMo · NIM microservices
TensorRT-LLM tuned for B300 inference and reasoning
Reference recipes for trillion-parameter training included

Next step

Get a GB300 NVL72 configuration from your Malaysian NVIDIA systems specialist.

Talk to a GB300 NVL72 specialist Compare systems Contact us

Supported workloads

Reference workload categories.

Workload categories documented in the manufacturer's reference materials. Sizing is confirmed with your technical team during scoping.

AI reasoning

Long-context inference and agentic workloads

NVIDIA positions GB300 NVL72 for the AI reasoning era. The coherent ~21 TB HBM3e memory pool supports extended-context inference and multi-step agentic workloads at production scale.

Foundation model training

Trillion-parameter training

Single-rack tensor parallelism across 72 NVLink-connected GPUs supports trillion-parameter foundation model training without cross-rack communication penalties for in-rack workloads.

DGX SuperPOD

Reference architecture for AI Factory deployments

NVIDIA GB300 NVL72 is the building block for NVIDIA DGX SuperPOD with GB300 systems. Multi-rack scale-out via NVIDIA Quantum-X800 InfiniBand or NVIDIA Spectrum-X800 Ethernet.

Sovereign AI

National-compute infrastructure

NVIDIA references the NVL72 rack as the standard planning unit for sovereign-compute and national-lab AI infrastructure projects. EMARQUE handles in-country project delivery, commissioning, and Tier-1 support for multi-rack deployments.

Full spec sheet

Every line documented at quotation.

As supplied by NVIDIA. EMARQUE handles in-country delivery, commissioning, and Tier-1 support handoff.

GPUs per rack: 72 × NVIDIA B300 (Blackwell Ultra) with NVLink 5
CPUs per rack: 36 × NVIDIA Grace 72-core Arm
GPU memory pool: Up to ~21 TB HBM3e per rack (288 GB × 72)
FP4 compute: Up to ~1.4 ExaFLOPS dense (NVIDIA published)
NVLink fabric: 5th-gen NVLink Switch — 130 TB/s aggregate, all-to-all
Networking (scale-out): ConnectX-8 800 Gb/s InfiniBand / Spectrum-X Ethernet
Cooling: Direct-to-chip liquid (CDU integration required)
Form factor: Single rack — building block for DGX SuperPOD
Software: NVIDIA Mission Control · DGX OS · AI Enterprise · NeMo · NIM
Site prep: ≈ 120 kW per rack, 30–35 °C inlet liquid, raised floor or in-row CDU

FAQ

Common questions about GB300 NVL72

What is NVIDIA GB300 NVL72?

Per NVIDIA: a rack-scale AI compute platform that connects 36 NVIDIA Grace CPUs and 72 NVIDIA Blackwell Ultra GPUs in a single liquid-cooled rack via fifth-generation NVLink and NVLink Switch. All 72 GPUs operate as a coherent NVLink Switch domain with approximately 21 TB of HBM3e memory exposed as one pool. It is the building block for NVIDIA DGX SuperPOD with GB300 systems.

What are the published performance figures?

Per NVIDIA: approximately 1.4 ExaFLOPS of dense FP4 inference performance per rack, approximately 1.8× the inference throughput and 1.5× the training performance of the prior-generation NVIDIA GB200 NVL72, and 130 TB/s aggregate NVLink bandwidth within the rack.

How does NVIDIA GB300 NVL72 differ from NVIDIA DGX B300?

NVIDIA DGX B300 is a single-node 8-GPU air-cooled system in a 10U form factor. NVIDIA GB300 NVL72 is a rack-scale 72-GPU liquid-cooled platform. The NVLink Switch fabric in NVIDIA GB300 NVL72 interconnects all 72 GPUs as a single coherent accelerator from a tensor-parallelism perspective; NVIDIA DGX B300 nodes connect to each other via NVIDIA Quantum-X800 InfiniBand at the cluster layer.

What is the site readiness requirement?

Liquid-cooled rack — CDU (Coolant Distribution Unit) integration required. Approximately 120 kW per rack typical load. Three-phase high-amperage power circuits with appropriate PDU planning. Inlet liquid temperature 30–35 °C per NVIDIA's reference design (warm-water cooling capable). Raised-floor or in-row cooling configuration. EMARQUE conducts a full site readiness assessment as part of project scoping.

What's the difference between NVIDIA DGX GB300 and NVIDIA GB300 NVL72 from OEM partners?

Both use the same NVIDIA GB300 NVL72 rack platform — same NVIDIA Grace CPUs, same NVIDIA Blackwell Ultra GPUs, same NVLink Switch fabric, same liquid-cooled rack. NVIDIA DGX GB300 is the NVIDIA-branded packaging delivered with NVIDIA DGX OS, NVIDIA Mission Control orchestration, and a multi-year NVIDIA Enterprise Support contract. OEM versions (Giga Computing, Supermicro, Dell, ASUS) ship the same hardware platform with the OEM's warranty and support model; NVIDIA AI Enterprise software is available separately. Both options are presented on this page.

What is the upgrade path beyond NVIDIA GB300 NVL72?

NVIDIA's next-generation rack-scale platform is NVIDIA Vera Rubin NVL72, now in production following the Computex 2026 announcement. Available to order through EMARQUE — configuration, allocation, and timelines are confirmed with your EMARQUE Key Account Manager on enquiry.

Also in this class

NVIDIA Vera Rubin NVL72

The next-generation NVIDIA rack-scale AI factory platform. Available to order through EMARQUE.

Request configuration & quotation.

Manufacturer specifications and warranty terms apply. EMARQUE issues a formal quotation through your Key Account Manager.

02Talk to EMARQUE

Tell us about your workload.

Model size, concurrency, latency budget, deployment site. EMARQUE returns a quote in MYR within one Malaysian business day, sized to the workload — not the salesperson’s quota.

Request a quote Contact sales

01
Key Account Manager
+6012 627 2280
02
Request for Quotation
business@emarque.co

NVIDIA GB300 NVL72

Configuration overview.

36 Grace CPUs + 72 Blackwell Ultra GPUs

1.8× inference vs GB200 NVL72

1.4 ExaFLOPS dense FP4 inference

NVLink Switch fabric — 130 TB/s

NVIDIA Mission Control software

ConnectX-8 scale-out networking

Same platform — choose the supply path.

NVIDIA DGX GB300 (turnkey)

NVIDIA GB300 NVL72 — OEM platforms

Under the hood.

Reference workload categories.

Long-context inference and agentic workloads

Trillion-parameter training

Reference architecture for AI Factory deployments

National-compute infrastructure

Every line documented at quotation.

Common questions about GB300 NVL72

Request configuration & quotation.

Tell us about your workload.

Key Account Manager

Request for Quotation