Skip to content
EMARQUE.AI
    HomeProductsNVIDIA GB300 NVL72
Shipping nowNVIDIA DGX

NVIDIA GB300 NVL72

Rack-scale AI infrastructure for the era of reasoning — 36 NVIDIA Grace CPUs and 72 NVIDIA Blackwell Ultra GPUs in a single NVLink-connected, liquid-cooled rack.

Pricing on request — allocation and configuration confirmed at quotation.
NVIDIA GB300 NVL72 — built by EMARQUE in Malaysia
72Blackwell Ultra GPUs
21 TBHBM3e pool
1.4ExaFLOPS FP4 inference
Key features

Configuration overview.

Manufacturer-defined features from the published datasheet.

36 Grace CPUs + 72 Blackwell Ultra GPUs

NVIDIA GB300 NVL72 connects 36 NVIDIA Grace CPUs and 72 NVIDIA Blackwell Ultra GPUs in a rack-scale, liquid-cooled design. All 72 GPUs are interconnected via fifth-generation NVLink and present a coherent ~21 TB HBM3e memory pool.

1.8× inference vs GB200 NVL72

Per NVIDIA's published figures, NVIDIA GB300 NVL72 delivers approximately 1.8× the inference throughput and 1.5× the training performance of the prior-generation NVIDIA GB200 NVL72.

1.4 ExaFLOPS dense FP4 inference

Per NVIDIA's published figures, up to 1.4 ExaFLOPS of dense FP4 inference performance per rack. The architectural primitive behind NVIDIA's AI Factory reference designs.

NVLink Switch fabric — 130 TB/s

Fifth-generation NVLink with NVLink Switch delivers 130 TB/s aggregate intra-rack bandwidth — all 72 GPUs interconnected all-to-all, coherent memory addressing across the rack.

NVIDIA Mission Control software

Cluster-level orchestration platform from NVIDIA — health monitoring, telemetry aggregation, validated recipe deployment, job-scheduler integration. Standard operating model for NVIDIA DGX SuperPOD with GB300.

ConnectX-8 scale-out networking

Multi-rack scale-out via NVIDIA ConnectX-8 SuperNICs at 800 Gb/s, paired with NVIDIA Quantum-X800 InfiniBand or NVIDIA Spectrum-X800 Ethernet rack fabrics.

Two ways to buy

Same platform — choose the supply path.

The NVIDIA HGX baseboard is identical on both paths. The turnkey DGX is fastest to deploy; OEM HGX platforms (Dell, Giga Computing, Supermicro) give wider configuration choice.

NVIDIA

NVIDIA DGX GB300 (turnkey)

NVIDIA-built and NVIDIA-supported packaging of the NVIDIA GB300 NVL72 platform. Ships pre-configured with NVIDIA DGX OS, NVIDIA AI Enterprise, NVIDIA Mission Control, and a multi-year NVIDIA Enterprise Support contract.

  • NVIDIA DGX OS · NVIDIA AI Enterprise · NVIDIA Mission Control
  • NVIDIA Enterprise Support contract included
  • NVIDIA-validated reference recipes for DGX SuperPOD with GB300
  • Reference architecture for sovereign-compute and AI Factory builds
Giga Computing · Supermicro · Dell · ASUS

NVIDIA GB300 NVL72 — OEM platforms

The same NVIDIA GB300 NVL72 rack platform from NVIDIA's OEM partners — customer-operated, no DGX software bundle. Same NVIDIA Grace CPUs, NVIDIA Blackwell Ultra GPUs, and NVLink Switch fabric. OEM warranty and support model.

  • Giga Computing GIGAPOD-class rack (GB300 NVL72)
  • Supermicro NVL72-class rack (NVIDIA GB300 NVL72)
  • Dell PowerEdge XE-class rack (NVIDIA GB300 NVL72)
  • Customer-operated orchestration (Slurm, Kubernetes, OpenStack)

EMARQUE supplies both paths in Malaysia. Final configuration, lead time, and warranty terms are confirmed in writing at quotation.

Architecture

Under the hood.

The four sub-systems that determine real-workload behaviour. We tune each before delivery.

GPU + CPU complex (per rack)
  • 72 × NVIDIA B300 (Blackwell Ultra) SXM modules · NVLink Switch domain
  • 36 × NVIDIA Grace 72-core Arm Neoverse V2 CPUs
  • Up to ~21 TB HBM3e per rack (288 GB × 72)
  • Up to ~1.4 ExaFLOPS dense FP4 per rack (NVIDIA published)
NVLink Switch fabric
  • 5th-gen NVLink with NVLink Switch (intra-rack all-to-all)
  • 130 TB/s aggregate NVLink bandwidth within the rack
  • Coherent memory addressing across all 72 GPUs
  • Tensor parallelism scales linearly — no cross-node penalty inside the rack
Scale-out fabric & networking
  • ConnectX-8 800 Gb/s InfiniBand or Spectrum-X Ethernet for inter-rack scale-out
  • Quantum-X800 InfiniBand switches at the cluster layer
  • GPUDirect RDMA across racks for multi-rack training
  • SuperPOD architecture supports scaling to hundreds of racks
Site, cooling, power
  • Direct-to-chip liquid cooling — CDU integration required
  • ≈ 120 kW per rack typical load
  • Inlet liquid temperature 30-35 °C; warm-water cooling capable
  • Raised-floor or in-row CDU; structured cabling for fabric scale-out
Software & operations
  • NVIDIA Mission Control (cluster orchestration, telemetry, recipes)
  • DGX OS · NVIDIA AI Enterprise · NeMo · NIM microservices
  • TensorRT-LLM tuned for B300 inference and reasoning
  • Reference recipes for trillion-parameter training included
Next step

Get a GB300 NVL72 configuration and lead time from your Malaysian NVIDIA systems specialist.

Supported workloads

Reference workload categories.

Workload categories documented in the manufacturer's reference materials. Sizing is confirmed with your technical team during scoping.

AI reasoning

Long-context inference and agentic workloads

NVIDIA positions GB300 NVL72 for the AI reasoning era. The coherent ~21 TB HBM3e memory pool supports extended-context inference and multi-step agentic workloads at production scale.

Foundation model training

Trillion-parameter training

Single-rack tensor parallelism across 72 NVLink-connected GPUs supports trillion-parameter foundation model training without cross-rack communication penalties for in-rack workloads.

DGX SuperPOD

Reference architecture for AI Factory deployments

NVIDIA GB300 NVL72 is the building block for NVIDIA DGX SuperPOD with GB300 systems. Multi-rack scale-out via NVIDIA Quantum-X800 InfiniBand or NVIDIA Spectrum-X800 Ethernet.

Sovereign AI

National-compute infrastructure

NVIDIA references the NVL72 rack as the standard planning unit for sovereign-compute and national-lab AI infrastructure projects. EMARQUE handles in-country project delivery, commissioning, and Tier-1 support for multi-rack deployments.

Full spec sheet

Every line documented at quotation.

As supplied by NVIDIA. EMARQUE handles in-country delivery, commissioning, and Tier-1 support handoff.

GPUs per rack
72 × NVIDIA B300 (Blackwell Ultra) with NVLink 5
CPUs per rack
36 × NVIDIA Grace 72-core Arm
GPU memory pool
Up to ~21 TB HBM3e per rack (288 GB × 72)
FP4 compute
Up to ~1.4 ExaFLOPS dense (NVIDIA published)
NVLink fabric
5th-gen NVLink Switch — 130 TB/s aggregate, all-to-all
Networking (scale-out)
ConnectX-8 800 Gb/s InfiniBand / Spectrum-X Ethernet
Cooling
Direct-to-chip liquid (CDU integration required)
Form factor
Single rack — building block for DGX SuperPOD
Software
NVIDIA Mission Control · DGX OS · AI Enterprise · NeMo · NIM
Site prep
≈ 120 kW per rack, 30–35 °C inlet liquid, raised floor or in-row CDU
FAQ

Common questions about GB300 NVL72

What is NVIDIA GB300 NVL72?

Per NVIDIA: a rack-scale AI compute platform that connects 36 NVIDIA Grace CPUs and 72 NVIDIA Blackwell Ultra GPUs in a single liquid-cooled rack via fifth-generation NVLink and NVLink Switch. All 72 GPUs operate as a coherent NVLink Switch domain with approximately 21 TB of HBM3e memory exposed as one pool. It is the building block for NVIDIA DGX SuperPOD with GB300 systems.

What are the published performance figures?

Per NVIDIA: approximately 1.4 ExaFLOPS of dense FP4 inference performance per rack, approximately 1.8× the inference throughput and 1.5× the training performance of the prior-generation NVIDIA GB200 NVL72, and 130 TB/s aggregate NVLink bandwidth within the rack.

How does NVIDIA GB300 NVL72 differ from NVIDIA DGX B300?

NVIDIA DGX B300 is a single-node 8-GPU air-cooled system in a 10U form factor. NVIDIA GB300 NVL72 is a rack-scale 72-GPU liquid-cooled platform. The NVLink Switch fabric in NVIDIA GB300 NVL72 interconnects all 72 GPUs as a single coherent accelerator from a tensor-parallelism perspective; NVIDIA DGX B300 nodes connect to each other via NVIDIA Quantum-X800 InfiniBand at the cluster layer.

What is the site readiness requirement?

Liquid-cooled rack — CDU (Coolant Distribution Unit) integration required. Approximately 120 kW per rack typical load. Three-phase high-amperage power circuits with appropriate PDU planning. Inlet liquid temperature 30–35 °C per NVIDIA's reference design (warm-water cooling capable). Raised-floor or in-row cooling configuration. EMARQUE conducts a full site readiness assessment as part of project scoping.

What's the difference between NVIDIA DGX GB300 and NVIDIA GB300 NVL72 from OEM partners?

Both use the same NVIDIA GB300 NVL72 rack platform — same NVIDIA Grace CPUs, same NVIDIA Blackwell Ultra GPUs, same NVLink Switch fabric, same liquid-cooled rack. NVIDIA DGX GB300 is the NVIDIA-branded packaging delivered with NVIDIA DGX OS, NVIDIA Mission Control orchestration, and a multi-year NVIDIA Enterprise Support contract. OEM versions (Giga Computing, Supermicro, Dell, ASUS) ship the same hardware platform with the OEM's warranty and support model; NVIDIA AI Enterprise software is available separately. Both options are presented on this page.

What is the typical lead time?

Lead time follows NVIDIA's allocation schedule for NVIDIA GB300 NVL72 platforms. Multi-rack NVIDIA DGX SuperPOD projects are scoped individually — typical timeline spans allocation reservation, freight, in-country commissioning, fabric build-out, and acceptance testing. EMARQUE confirms projected delivery window at order acknowledgement following NVIDIA allocation.

What is the upgrade path beyond NVIDIA GB300 NVL72?

NVIDIA's announced next-generation rack-scale platform is NVIDIA Vera Rubin NVL72, targeted for the late-2026 / 2027 window — same NVL72 rack architecture with the NVIDIA Vera CPU and NVIDIA Rubin GPU generation, plus HBM4 memory. EMARQUE accepts allocation conversations for NVIDIA Vera Rubin NVL72 for customers planning multi-year refresh cycles.

Request configuration & quotation.

Manufacturer specifications, factory lead times, and warranty terms apply. EMARQUE responds within one business day with a formal quotation and projected delivery window.

Contact Us

Get in Touch with Us

Tell us about your workload. We reply within one business day with a quote sized to fit.

  1. 01

    Key Account Manager

    +6012 627 2280
  2. 02

    Request for Quotation

    business@emarque.co