NVIDIA DGX Spark

AI Hardware NVIDIA

27 min read

Updated Jun 24, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 24, 2026

Fact-checked

In review queue

Sources

27 citations

Revision

v5 · 5,428 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

NVIDIA DGX Spark is a compact, deskside AI development system, marketed by NVIDIA as a "personal AI supercomputer," that puts a Grace Blackwell superchip on a desktop and delivers up to 1 petaFLOP (1,000 TOPS) of FP4 AI performance with 128 GB of unified memory. First announced as "Project DIGITS" at CES 2025 and officially launched on 15 October 2025, DGX Spark targets developers, researchers, and students who need local fine-tuning and inference without relying exclusively on the cloud.^[1]^[2]

DGX Spark is powered by the GB10 Grace Blackwell Superchip and is specified by NVIDIA for up to one FP4 petaFLOP of AI performance (theoretical, using sparsity). It features 128 GB of coherent unified memory, a 4 TB self-encrypting NVMe M.2 SSD, and a ConnectX-7 NIC that enables low-latency peer-to-peer links between two units. NVIDIA positions Spark for prototyping, local fine-tuning (up to roughly 70 billion parameters), and inference with models up to roughly 200 billion parameters; two linked units can serve a model as large as Llama 3.1 405B in FP4.^[3]^[4]

The product was initially priced at $3,999 USD. In February 2026, NVIDIA raised the Founders Edition MSRP to $4,699 following LPDDR5x memory supply constraints that increased production costs.^[5]

Background and announcement timeline

When was DGX Spark announced? (CES 2025: Project DIGITS)

NVIDIA CEO Jensen Huang unveiled what would become DGX Spark during his keynote at the Consumer Electronics Show (CES) in Las Vegas on January 6, 2025. Announced under the codename "Project DIGITS," the device was described as a personal AI supercomputer small enough to sit on a desk but capable of running trillion-parameter-class models when two units were linked together. At the time, NVIDIA set an expected availability in May 2025 and a suggested price of around $3,000 USD.

The CES unveiling drew wide coverage because it brought NVIDIA Blackwell architecture into a form factor previously associated with consumer mini-PCs. Huang's demonstration showed the device running large language models that previously required data center hardware.^[6]

Why was Project DIGITS renamed DGX Spark? (GTC 2025)

At NVIDIA's GPU Technology Conference (GTC) on March 18, 2025, the company formally renamed Project DIGITS to NVIDIA DGX Spark. The rename placed the product inside NVIDIA's established DGX product family alongside DGX Cloud (NVIDIA's cloud AI platform) and a newly announced sibling product, DGX Station. The GTC announcement also confirmed the final production price of $3,999 USD, a roughly $1,000 increase from the initial concept price, and revised the availability window to fall 2025.^[7]

In the official announcement, Huang framed the two new machines as a new computing category: "AI has transformed every layer of the computing stack. It stands to reason a new class of computers would emerge, designed for AI-native developers and to run AI-native applications," adding that "with these new DGX personal AI computers, AI can span from cloud services to desktop and edge applications."^[27] At the GTC keynote, Huang held up a unit roughly the size of a Wi-Fi router and likened it to the original DGX-1 server shrunk down "with Pym particles."

The March 18 press release named ASUS, BOXX, Dell Technologies, HP Inc., Lambda, Lenovo, and Supermicro as the initial system manufacturers expected to ship GB10-based DGX Spark and GB300-based DGX Station systems.^[27]

At the same GTC event, NVIDIA unveiled the DGX Station, a larger deskside workstation based on the GB300 Grace Blackwell Ultra Superchip, targeting users who need to run models with one trillion or more parameters locally. This clearly separated the two products: DGX Spark for entry-level local AI development, DGX Station for demanding research workloads.

October 2025: general availability

NVIDIA confirmed that DGX Spark systems would begin shipping the week of October 13, 2025. Sales opened on October 15, 2025 through NVIDIA's own marketplace and authorized retail partners, with Micro Center serving as a launch-day retail outlet in the United States. The October launch arrived roughly five months later than the May 2025 window NVIDIA had originally indicated at CES. Channel partners and OEM variants from Acer, ASUS, Dell, Gigabyte, HP, Lenovo, and MSI followed in the weeks after the initial launch.^[1]^[8]

To mark the launch, Jensen Huang personally delivered some of the first units to prominent AI figures. Huang visited SpaceX's Starbase facility in Texas to hand-deliver a DGX Spark to Elon Musk, calling back to Huang's delivery of the original DGX-1 to Musk nine years earlier. Huang also delivered a unit to OpenAI CEO Sam Altman at OpenAI's Mission Bay office.^[9]

February 2026: price increase

On February 23, 2026, NVIDIA announced a price increase for the Founders Edition DGX Spark from $3,999 to $4,699, an 18 percent rise. NVIDIA attributed the increase to ongoing LPDDR5x memory supply constraints that raised production costs. The price change was announced on the NVIDIA Developer Forums and took effect immediately.^[10]

Hardware design

What does DGX Spark look like? (Form factor)

The DGX Spark occupies a 150 x 150 x 50.5 mm footprint and weighs approximately 1.2 kg, placing it among the most compact AI development systems ever built for this class of workload. The enclosure uses a machined metal shell with passive and active cooling. Power is supplied through an external 240 W power supply unit connected via USB-C Power Delivery, which means the device itself carries no internal AC power supply and has no proprietary power connector. All connectivity except the 10 GbE and QSFP ports uses standard consumer interfaces.

What is the GB10 Grace Blackwell Superchip?

DGX Spark uses the NVIDIA GB10 Grace Blackwell Superchip, a multi-die system-on-chip co-designed by NVIDIA and MediaTek. Both dies are fabricated on TSMC's 3nm process node, and the complete package contains approximately 208 billion transistors. NVIDIA disclosed the GB10's detailed architecture at the Hot Chips 2025 symposium in August 2025, where engineers noted the chip reached production on TSMC 3nm A0 silicon without silicon revision, attributable to its assembly from pre-validated IP blocks rather than a ground-up design.^[11]

The two dies inside the GB10 are:

The CPU die, designed by MediaTek, contains 20 Arm v9.2 cores arranged in two clusters: 10 Cortex-X925 high-performance cores and 10 Cortex-A725 efficiency cores. Each cluster has a 16 MB L3 cache, giving 32 MB total. The CPU connects to the memory subsystem and handles the operating system, software stack, and non-GPU compute.
The GPU die, designed by NVIDIA, contains 48 Streaming Multiprocessors (SMs) totaling 6,144 CUDA cores and fifth-generation Tensor Cores. It delivers up to 1 PFLOP of sparse FP4 AI compute and 31 TFLOPS of FP32 compute.

The two dies communicate via NVLink-C2C at 600 GB/s bidirectional bandwidth, which gives both CPU and GPU coherent access to the shared memory pool without copying data between separate address spaces. NVIDIA states the NVLink-C2C link provides five times the bandwidth of fifth-generation PCIe.^[12]^[27]

How much memory does DGX Spark have? (Unified memory architecture)

The memory subsystem is the most discussed aspect of DGX Spark's design. The system uses 128 GB of LPDDR5x-9400 memory on a 256-bit interface, delivering approximately 273 GB/s of bandwidth. This pool is shared coherently between the CPU and GPU, which means a 70 billion parameter model loaded in memory is directly addressable by both the CPU and the GPU Tensor Cores without separate transfers.

The 128 GB capacity is enough to load models up to roughly 200 billion parameters at 4-bit quantization (FP4 or MXFP4 format), or up to approximately 70 billion parameters at higher precision (BF16). This puts models like Llama 3.1 70B and Mistral 7B within reach for local inference and fine-tuning without any cloud dependency. The 128 GB is soldered LPDDR5x and is not user-upgradeable.

The bandwidth constraint (273 GB/s) is frequently cited in reviews as the main performance ceiling for autoregressive token generation, which is a memory-bandwidth-bound operation. For comparison, Apple's M4 Ultra chip in the Mac Studio provides approximately 819 GB/s of unified memory bandwidth, roughly three times higher. This tradeoff means DGX Spark is faster than Mac Studio on compute-bound tasks (prompt processing, prefill) but slower on memory-bandwidth-bound tasks (token generation decode).^[13]

How do you cluster two DGX Spark units? (Networking and multi-unit scaling)

DGX Spark includes an onboard Mellanox ConnectX-7 Smart NIC with a QSFP port. NVIDIA specifies this NIC at up to 200 Gb/s. The ConnectX-7 enables direct peer-to-peer links between two DGX Spark units using a single cable, without requiring a network switch. In a two-unit linked configuration, the combined 256 GB of unified memory allows running models up to roughly 405 billion parameters. NVIDIA's marquee two-node demonstration is serving Llama 3.1 405B in FP4 across a pair of Sparks using tensor parallelism.^[3]

The ConnectX-7 NIC supports both InfiniBand and Ethernet protocols, giving DGX Spark the same networking silicon used in NVIDIA's data center infrastructure. This is a notable departure from consumer workstations, which typically use commodity Ethernet.

A standard 10 GbE port (RJ-45) is also present for conventional network connectivity. Wi-Fi 7 and Bluetooth 5.4 provide wireless options, and four USB-C ports (one dedicated to power delivery) handle peripheral connectivity alongside a single HDMI 2.1a port.

Power and thermals

The system is rated at a 140 W TDP with a 240 W external power supply to provide headroom. Real-world AI workloads have been measured at approximately 170 W typical power draw. The combination of 140 W TDP with a roughly 1.2 kg enclosure requires efficient thermal design; the unit uses a combination of heat pipes and a small active fan, which reviewers have described as quieter than a typical gaming desktop under load.^[14]

Specifications

Category	Detail
Architecture	Grace Blackwell (GB10 Superchip)
Process node	TSMC 3nm, ~208 billion transistors
CPU	20-core Arm v9.2 (10x Cortex-X925 + 10x Cortex-A725)
CPU cache	32 MB L3 (16 MB per cluster)
GPU	NVIDIA Blackwell GPU, 48 SMs, 6,144 CUDA cores, 5th-gen Tensor Cores
GPU cache	24 MB L2
AI performance	1 PFLOP sparse FP4 (theoretical); 31 TFLOPS FP32
Unified memory	128 GB LPDDR5x-9400, coherent, ~273 GB/s bandwidth
Storage	4 TB self-encrypting NVMe M.2 (Founders Edition)
High-speed networking	ConnectX-7 Smart NIC (QSFP), up to 200 Gb/s
Standard networking	1x RJ-45 10 GbE
Wireless	Wi-Fi 7, Bluetooth 5.4
Display output	1x HDMI 2.1a
USB	4x USB-C (one for power delivery)
Power supply	240 W external PSU via USB-C PD; TDP 140 W
Dimensions	150 x 150 x 50.5 mm
Weight	~1.2 kg
Operating system	NVIDIA DGX OS (Ubuntu 24.04-based, Linux 6.11 kernel)
Interconnect	NVLink-C2C at 600 GB/s (CPU-GPU)

Software stack

What operating system does DGX Spark run? (DGX OS)

DGX Spark ships with NVIDIA DGX OS preinstalled. This is a customized distribution based on Ubuntu 24.04 LTS running a Linux 6.11 kernel, preconfigured with the full NVIDIA AI software stack. Canonical has partnered with NVIDIA on the DGX OS base, with Ubuntu providing the package management and security update infrastructure.

The operating system runs on the ARM64 architecture (Arm v9.2). This means x86-compiled binaries will not run natively, and some third-party software packages that only ship x86 builds require recompilation or emulation. NVIDIA and the open-source community have progressively expanded ARM64 support for major AI frameworks since launch, and early reviews noted that NVIDIA's official containers eased setup substantially for developers who encountered missing ARM64 wheels for certain framework versions.

CUDA and GPU acceleration

DGX Spark supports CUDA for GPU-accelerated workloads, though the GB10's GPU is based on a consumer-tier Blackwell variant rather than the datacenter-grade Hopper or Blackwell Ultra chips used in DGX H100 and DGX B200 systems. As of early 2026, the GB10 GPU reports a different SM architecture identifier (SM12x) compared to datacenter Blackwell (SM100), which has caused compatibility issues with some tools in the vLLM and SGLang ecosystems that maintain separate code paths for Hopper, datacenter Blackwell, and consumer Blackwell. NVIDIA and the open-source community have been addressing these through software updates.^[15]

Included libraries and frameworks

DGX Spark ships preconfigured with:

CUDA toolkit and cuDNN libraries
TensorRT for optimized inference
NVIDIA Container Runtime (Docker-compatible GPU containers)
NVIDIA NIM (NVIDIA Inference Microservices) for model serving
NVIDIA NeMo for model fine-tuning and training
NVIDIA RAPIDS libraries for GPU-accelerated data science (cuDF, cuML, cuGraph)
PyTorch, TensorFlow, and JAX with native ARM64 support
Access to the NVIDIA NGC catalog for pre-built containers and pre-trained models
NVIDIA Blueprints: standardized AI application reference patterns
OpenShell: NVIDIA's framework for building and running autonomous agents locally

NVIDIA's Isaac Sim / Isaac Lab robotics simulation environment, Metropolis for computer vision, and Holoscan for healthcare AI also run on DGX Spark, making it suitable as a development platform for embedded AI applications that will eventually deploy to NVIDIA Jetson devices at the edge.

Cloud migration path

A design goal of DGX Spark is to serve as the first step in a pipeline that ends in data center deployment. Models prototyped and initially fine-tuned on Spark can migrate to DGX Cloud or NVIDIA-accelerated data center infrastructure through the same CUDA-based software stack, minimizing porting work. NVIDIA markets this as "develop local, deploy at scale."^[16]

Pricing and availability

How much does DGX Spark cost? (Founders Edition pricing)

The DGX Spark Founders Edition launched on October 15, 2025 at $3,999 USD MSRP. On February 23, 2026, NVIDIA raised the price to $4,699 USD, citing LPDDR5x memory supply constraints. Regional pricing at launch was approximately:

Region	Launch price	Post-Feb 2026
United States	$3,999 USD	$4,699 USD
United Kingdom	~£3,700 GBP	~£4,400 GBP
Germany	~€3,689 EUR	~€4,400 EUR
Japan	~¥899,980 JPY	~¥1,050,000 JPY

Retail and distribution

DGX Spark is available through NVIDIA's own marketplace (marketplace.nvidia.com) and authorized partners. In the United States, Micro Center was a launch-day retail partner. International distribution is handled through regional NVIDIA partners.

OEM and partner systems

Several major PC manufacturers announced GB10-based systems alongside or shortly after the DGX Spark Founders Edition launch. These products use the same GB10 Grace Blackwell Superchip and largely the same specifications, but differ in storage configurations, chassis design, bundled software, and price.

Manufacturer	Model	Storage	Price (approx.)	Notes
NVIDIA	DGX Spark Founders Edition	4 TB NVMe	$4,699 (current)	Reference design, includes full NVIDIA software stack
ASUS	Ascent GX10	1 TB or 2 TB NVMe	from $3,099	Stackable chassis; ships with DGX OS
Dell Technologies	Dell Pro Max with GB10	2 TB NVMe	similar to NVIDIA	Enterprise support options
HP Inc.	ZGX Nano AI Station	2 TB NVMe	similar to NVIDIA	HP enterprise warranty and deployment tools
Lenovo	ThinkStation PGX	2 TB NVMe	similar to NVIDIA	ThinkStation brand; enterprise focus
MSI	EdgeXpert GB10	1 TB NVMe	from $2,999	Budget entry point in the GB10 ecosystem
Acer	Veriton GN100	2 TB NVMe	similar to NVIDIA	Acer enterprise and education channels
Gigabyte	AI Top Atom	2 TB NVMe	similar to NVIDIA	Gigabyte enterprise integration

All of these systems share the same 128 GB LPDDR5x unified memory, 1 PFLOP FP4 AI compute, ConnectX-7 200 Gb/s networking, Wi-Fi 7, and DGX OS. The main differentiation points are SSD capacity, chassis design, and vendor support agreements.

The ASUS Ascent GX10 has received particular attention for its stackable chassis design, which allows multiple units to be physically stacked for cleaner multi-node deployments. ServeTheHome gave the Ascent GX10 a positive review, noting that the chassis engineering offers practical advantages for labs deploying several units.^[17]

Comparison with competing systems

DGX Spark occupies a niche between consumer discrete-GPU workstations (which offer higher bandwidth but less total memory) and full data center nodes (which offer far greater performance but require infrastructure investment). Its most frequently cited competitors are Apple's Mac Studio and AMD's Ryzen AI Max-based systems.

How does DGX Spark compare to the Apple Mac Studio?

Apple's Mac Studio, based on Apple Silicon, is the most common comparison target. Both systems use unified memory architectures that share memory between CPU and GPU, but they differ substantially in memory bandwidth, software ecosystem, and design philosophy.

Specification	NVIDIA DGX Spark	Apple Mac Studio (M4 Ultra)
Chip	NVIDIA GB10 Grace Blackwell	Apple M4 Ultra
CPU cores	20 Arm (10 perf + 10 eff)	24 Arm (16 perf + 8 eff)
GPU compute	1 PFLOP sparse FP4	~175 TFLOPS FP16 (est.)
Unified memory	128 GB LPDDR5x	up to 192 GB LPDDR5
Memory bandwidth	~273 GB/s	~819 GB/s
High-speed NIC	ConnectX-7 200 Gb/s	none
AI software	CUDA, NVIDIA NIM, NeMo	MLX, Core ML, ONNX
OS	DGX OS (Ubuntu 24.04)	macOS
Price (approx.)	$4,699	$9,999+ (M4 Ultra 192 GB)

The key tradeoffs are: DGX Spark has roughly 3x more theoretical AI compute due to Tensor Core acceleration and FP4 support, while Mac Studio has roughly 3x more memory bandwidth, which benefits autoregressive token generation. In benchmarks using Ollama with Llama 3.3 70B, the Mac Studio with M4 Ultra produced faster token generation (memory-bandwidth-bound) while DGX Spark produced faster prompt processing (compute-bound). For workloads that rely heavily on CUDA-native libraries (PyTorch training, RAPIDS, NIM microservices), DGX Spark has a structural software advantage; for macOS-native workflows and Apple MLX-based inference, Mac Studio is the natural choice.^[18]^[13]

A combined deployment test by the EXO Labs team demonstrated that linking a DGX Spark and a Mac Studio together over a local network (using EXO 1.0's distributed inference software) produced roughly 4x the inference speed of either unit alone, illustrating a complementary rather than purely competitive relationship between the two platforms.^[19]

How does DGX Spark compare to AMD Ryzen AI Max?

AMD's Ryzen AI Max+ 395 ("Strix Halo") is the silicon behind several competing mini-PC and compact workstation products including the Framework Desktop and the AMD Ryzen AI Halo reference platform announced at CES 2026.

Specification	NVIDIA DGX Spark	AMD Ryzen AI Max+ 395 system
Chip	NVIDIA GB10 Grace Blackwell	AMD Ryzen AI Max+ 395
CPU cores	20 Arm (10 perf + 10 eff)	16 Zen 5 (x86-64)
GPU	NVIDIA Blackwell (6,144 CUDA cores)	AMD RDNA 3.5 (40 CUs)
AI compute	1 PFLOP sparse FP4	~1,000 TOPS (NPU + GPU combined)
Unified memory	128 GB LPDDR5x	up to 128 GB LPDDR5x
Memory bandwidth	~273 GB/s	~256 GB/s
AI software	CUDA, ROCm not supported	ROCm (improving), ONNX
OS	DGX OS (Ubuntu 24.04, ARM64)	Windows or Linux (x86-64)
Price (approx.)	$4,699	$2,999-3,999 (varies by OEM)

Tom's Hardware's review of DGX Spark concluded that it outperformed the AMD Ryzen AI Max+ 395 in AI inference benchmarks, particularly on prompt prefill throughput, due to NVIDIA's Tensor Core architecture and mature CUDA software stack. AMD's ROCm software has improved substantially but still trails CUDA in maturity for many research and production workflows. AMD-based systems have the advantage of running standard x86-64 software without recompilation, while DGX Spark requires ARM64 binaries, which can create friction for teams with existing x86-only toolchains.^[20]

How does DGX Spark differ from the DGX Station?

DGX Station, announced at GTC 2025 alongside DGX Spark, uses the GB300 Grace Blackwell Ultra Desktop Superchip and targets a substantially different user base.

Specification	DGX Spark	DGX Station
Chip	GB10 Grace Blackwell	GB300 Grace Blackwell Ultra
CPU	20-core Arm	72-core Arm Neoverse V2
GPU memory	LPDDR5x shared	288 GB HBM3e (GPU) + 496 GB LPDDR5x (CPU)
Total memory	128 GB	~784 GB
AI performance	1 PFLOP FP4	20 PFLOP FP4
Networking NIC	ConnectX-7 (200 Gb/s)	ConnectX-8 (800 Gb/s)
Max single-unit model size	~200B params	~1T+ params
Starting price	$4,699	~$80,000+

DGX Station occupies the space between DGX Spark and a full rack-mounted DGX system. It can run models exceeding one trillion parameters on a single desktop unit and supports cluster configurations via its ConnectX-8 SuperNIC. DGX Station systems became available from ASUS, Dell, Gigabyte, MSI, Supermicro, and HP in 2026, with starting prices in the $80,000-$125,000 range depending on configuration and vendor support.^[21]

What is DGX Spark used for?

Prototyping and development

NVIDIA positions DGX Spark primarily as a prototyping and development platform. The combination of 128 GB unified memory, a full CUDA software stack, and integrated DGX OS allows developers to iterate on models locally before committing workloads to cloud infrastructure. Common workflows include loading pre-trained models from Hugging Face or the NVIDIA NGC catalog, running inference tests, and writing training or fine-tuning scripts that will later run on larger data center hardware.

Fine-tuning

Fine-tuning models up to 70 billion parameters is supported on a single DGX Spark unit. Parameter-efficient methods like LoRA (Low-Rank Adaptation) work well within the memory budget. NVIDIA's NeMo framework, included with DGX OS, provides validated fine-tuning pipelines for Llama, Mistral, and other open-weight models. Distributed fine-tuning using Fully Sharded Data Parallel (FSDP) across two linked DGX Spark units extends the feasible parameter count higher.^[22]

Inference and evaluation

For inference, DGX Spark can run models up to roughly 200 billion parameters using FP4 quantization in a single unit. LMSYS's in-depth review documented that Spark "shines" for smaller models (7B-13B range) with excellent batching throughput, and can handle 70B and 120B models for experimentation even if throughput is more limited. LMSYS found that with speculative decoding enabled through EAGLE 3 in SGLang, end-to-end inference throughput improved by up to 2x compared to standard autoregressive decoding.^[23]

Agentic AI development

NVIDIA's OpenShell framework, included with DGX Spark, supports building and testing autonomous AI agent pipelines locally. The system's 128 GB unified memory allows running multiple models simultaneously (for example, a planner model and an executor model), which is common in multi-agent architectures. NVIDIA's technical blog has documented workflows for building RAG (Retrieval-Augmented Generation) systems and agentic pipelines entirely on a single DGX Spark unit.^[16]

Robotics and physical AI

NVIDIA's Isaac Sim and Isaac Lab robotics simulation frameworks run on DGX Spark, making it a local development station for physical AI applications. Developers can train reinforcement learning policies in GPU-accelerated simulation on Spark, then deploy to NVIDIA Jetson-based embedded systems at the edge. At CES 2026, a DGX Spark powered a Reachy Mini robot in an interactive demo with Pollen Robotics and Hugging Face, demonstrating the platform's applicability to consumer robotics development.

Data science

NVIDIA RAPIDS libraries (cuDF, cuML, cuGraph) run on DGX Spark's Blackwell GPU, accelerating data processing pipelines that would otherwise run on CPU. These tools allow GPU-accelerated alternatives to pandas, scikit-learn, and NetworkX, and can operate entirely within the unified memory space without GPU-to-CPU data transfers.

Edge AI development

DGX Spark fits into NVIDIA's end-to-end edge AI workflow: prototype and train on Spark, optimize with TensorRT and TAO Toolkit, simulate in Omniverse, then deploy to Jetson devices at the edge. The NVIDIA Metropolis framework for computer vision and the Holoscan SDK for medical device AI both support DGX Spark as a development target.

Early evaluators

NVIDIA provided DGX Spark units to a range of organizations prior to general availability. Publicly disclosed early evaluators included:

Anaconda
Cadence
ComfyUI
Docker
Google
Hugging Face
JetBrains
LM Studio
Meta
Microsoft
Ollama
Roboflow (computer vision platform)
NYU Global AI Frontier Lab

Kyunghyun Cho, professor of computer and data science at the NYU Global AI Frontier Lab, stated that "DGX Spark allows us to access peta-scale computing on our desktop" and emphasized its value for rapid prototyping of AI algorithms. Roboflow published an early hands-on evaluation focused on computer vision workloads, noting strong performance for training and deploying object detection models.^[1]^[24]

Reception

Technical reviews

Early third-party reviews characterized DGX Spark as an excellent developer platform whose strengths are compactness, unified memory capacity, CUDA software maturity, and integrated setup, while consistently noting that raw token generation throughput trails systems with higher memory bandwidth.

LMSYS's in-depth review, published October 13, 2025, found that Spark "shines for smaller models" at batch sizes above one, and that the ConnectX-7 networking enables meaningful scale-out. The review documented 1,723 tokens per second prefill throughput on Llama-class 120B models in MXFP4 format, and approximately 38 tokens per second decode throughput.^[23]

Tom's Hardware concluded that DGX Spark "beats out AMD's Ryzen AI Max+ 395" in AI inference benchmarks, citing NVIDIA's more mature software stack and Tensor Core architecture as decisive advantages. The review noted that the $4,000 price was steep relative to AMD alternatives but justified by software ecosystem depth for CUDA-centric developers.^[20]

ServeTheHome gave the system a broadly positive assessment as "a tiny 128GB AI mini PC made for scale-out clustering," noting the ConnectX-7 NIC as an unusual and valuable feature for a form factor normally associated with single-node consumer workstations.^[25]

StorageReview described it as "the AI appliance bringing datacenter capabilities to desktops" and measured consistent performance across a range of model sizes, with memory bandwidth proving the main throughput ceiling.

IntuitionLabs summarized the product as best suited for "developers, institutions, and curious local AI enthusiasts who want a stable, dependable platform to build and explore with."

Community and industry response

Sam Altman commented on receiving his unit: "Thanks Jensen for the hand delivery of DGX Spark. Amazing to see so much compute (1 petaflop!) in such a tiny form factor." NVIDIA's social media posts documenting the Jensen-to-Musk delivery called back to Huang's delivery of the original DGX-1 server to Musk nine years earlier, framing DGX Spark as a historically resonant product. Musk, then involved in public disputes with Altman over OpenAI's direction, received his unit separately at SpaceX's Starbase facility.^[9]

Broader coverage in Wired, Ars Technica, and The Verge emphasized the "personal AI supercomputer" framing and traced the product's lineage from Project DIGITS through the GTC rename to general availability.

Limitations

Reviewers and community discussions have identified several recurring limitations of the DGX Spark platform:

Memory bandwidth ceiling. With approximately 273 GB/s of LPDDR5x bandwidth, autoregressive token generation (the decode phase of inference) is slower than on systems with higher-bandwidth memory. Apple's Mac Studio with M4 Ultra offers roughly 819 GB/s, and discrete GPU systems with GDDR7 memory can exceed 1 TB/s. For workloads that are decode-bound rather than compute-bound, DGX Spark underperforms its theoretical AI FLOP count suggests.^[13]

ARM64 software compatibility. DGX OS runs on ARM64, which means x86-only binaries do not run natively. Most major AI frameworks have ARM64 builds, but some tools, enterprise software, and Python packages lag behind in ARM64 support. NVIDIA's official containers reduce this friction for common workflows, but developers with specialized toolchains may encounter missing packages.

Fixed memory configuration. The 128 GB LPDDR5x is soldered and non-upgradeable. Users who need more memory must either link two units via ConnectX-7 or move to a DGX Station, at significantly higher cost.

Price-to-throughput ratio. Community forums and some reviewers have noted that discrete GPU workstations with cards like the NVIDIA RTX 5090 can offer higher raw inference throughput for similar or lower cost, though they lack the 128 GB memory capacity and integrated software stack. The LMSYS benchmark showed an RTX Pro 6000 Blackwell achieving approximately 4x higher prefill and decode throughput on the same models, illustrating the gap between DGX Spark and higher-end dedicated GPU setups.^[23]

"Supercomputer" branding skepticism. Some technical observers have questioned NVIDIA's "personal AI supercomputer" marketing. The Spark's 1 PFLOP FP4 performance, while exceptional for a desktop device, sits between a consumer RTX 5070 and RTX 5070 Ti in raw GPU compute, and critics have noted that NVIDIA has applied similar supercomputer terminology to prior Jetson embedded boards. The theoretical FP4 figure relies on sparsity acceleration and does not directly compare to dense FP32 or FP16 floating-point benchmarks used in traditional HPC rankings.

Ecosystem maturity at launch. Reviewers including Simon Willison noted that while NVIDIA's official containers and DGX OS eased setup considerably, the broader ARM64 ecosystem for Python wheels was still maturing around the October 2025 launch date, with certain framework versions requiring workarounds.

Relationship with Apache Spark

The "Spark" name is a branding choice and does not indicate a technical relationship with Apache Spark, the open-source distributed data processing framework. NVIDIA separately develops the RAPIDS Accelerator for Apache Spark, a software plugin that uses NVIDIA GPUs to accelerate Apache Spark SQL and DataFrame operations with no code changes required. DGX Spark supports the RAPIDS Accelerator as part of the RAPIDS library suite, allowing it to function as a compact development platform for GPU-accelerated big data workflows.^[26]

References

NVIDIA Newsroom. "NVIDIA DGX Spark Arrives for World's AI Developers." October 13, 2025. https://nvidianews.nvidia.com/news/nvidia-dgx-spark-arrives-for-worlds-ai-developers ↩
NVIDIA Investor Relations. "NVIDIA Announces DGX Spark and DGX Station Personal AI Computers." March 2025. https://investor.nvidia.com/news/press-release-details/2025/NVIDIA-Announces-DGX-Spark-and-DGX-Station-Personal-AI-Computers/default.aspx ↩
NVIDIA Product Page. "Personal AI Supercomputer Powered by Blackwell." https://www.nvidia.com/en-us/products/workstations/dgx-spark/ ↩
NVIDIA DGX Spark Hardware Overview. https://docs.nvidia.com/dgx/dgx-spark/hardware.html ↩
NVIDIA Developer Forums. "2/23/2026 Price Change Announcement." February 23, 2026. https://forums.developer.nvidia.com/t/2-23-2026-price-change-announcement/361713 ↩
HotHardware. "NVIDIA Project DIGITS Renamed DGX Spark, DGX Station Unveiled." 2025. https://hothardware.com/news/nvidia-project-digits-renamed-dgx-spark-and-dgx-station ↩
NVIDIA Newsroom. "NVIDIA Announces DGX Spark and DGX Station Personal AI Computers." https://nvidianews.nvidia.com/news/nvidia-announces-dgx-spark-and-dgx-station-personal-ai-computers ↩
NVIDIA Newsroom. "NVIDIA Launches AI-First DGX Personal Computing Systems With Global Computer Makers." https://nvidianews.nvidia.com/news/nvidia-launches-ai-first-dgx-personal-computing-systems-with-global-computer-makers ↩
Tom's Hardware. "Jensen Huang personally delivers DGX Spark Mini PCs to Elon Musk and Sam Altman." October 2025. https://www.tomshardware.com/tech-industry/artificial-intelligence/jensen-huang-personally-delivers-dgx-spark-mini-pcs-to-elon-musk-and-sam-altman-separately ↩
Tom's Hardware. "Nvidia DGX Spark gets $700 price hike as memory shortages bite." February 2026. https://www.tomshardware.com/desktops/mini-pcs/nvidia-dgx-spark-gets-18-percent-price-increase-as-memory-shortages-bite-founders-edition-now-usd4-699-up-from-usd3-999 ↩
ServeTheHome. "NVIDIA Outlines GB10 SoC Architecture at Hot Chips 2025." August 2025. https://www.servethehome.com/nvidia-outlines-gb10-soc-architecture-at-hot-chips-2025/ ↩
MediaTek Press Room. "Newly-Launched NVIDIA DGX Spark Features GB10 Superchip Co-Designed by MediaTek." https://www.mediatek.com/press-room/newly-launched-nvidia-dgx-spark-features-gb10-superchip-co-designed-by-mediatek ↩
Skorppio Blog. "NVIDIA DGX Spark vs Mac Studio: Efficiency Benchmark." https://skorppio.com/blog/dgx-spark-vs-mac-studio-efficiency-benchmark ↩
Robert McDermott. "NVIDIA's DGX Spark: Mini AI Supercomputer overview and review." Medium. https://robert-mcdermott.medium.com/the-nvidia-dgx-spark-0e2ca7833c2c ↩
Backend.AI Blog. "Inside NVIDIA DGX Spark: Is DGX Spark Actually Blackwell?" February 2026. https://www.backend.ai/blog/2026-02-is-dgx-spark-actually-a-blackwell ↩
NVIDIA Technical Blog. "Scaling Autonomous AI Agents and Workloads with NVIDIA DGX Spark." https://developer.nvidia.com/blog/scaling-autonomous-ai-agents-and-workloads-with-nvidia-dgx-spark/ ↩
ServeTheHome. "ASUS Ascent GX10 Review: A New NVIDIA GB10 Solution." https://www.servethehome.com/asus-ascent-gx10-review-a-new-nvidia-gb10-solution/ ↩
Compute Market. "DGX Spark vs Mac Studio M4 Max -- 128GB AI Desktop 2026." https://www.compute-market.com/blog/nvidia-dgx-spark-vs-mac-studio-m4-max-local-ai-2026 ↩
EXO Labs. "Combining NVIDIA DGX Spark + Apple Mac Studio for 4x Faster LLM Inference with EXO 1.0." https://blog.exolabs.net/nvidia-dgx-spark/ ↩
Tom's Hardware. "Nvidia DGX Spark review: the GB10 Superchip powers a fast and fun AI toolbox that beats out AMD's Ryzen AI Max+ 395." https://www.tomshardware.com/pc-components/gpus/nvidia-dgx-spark-review ↩
ServeTheHome. "NVIDIA DGX Station Systems Available At Last GB300 and GB200 Workstations For Your Desktop." https://www.servethehome.com/nvidia-dgx-station-systems-available-at-last-gb300-gb200-workstations-for-your-desktop/ ↩
Benjamin Marie. "DGX Spark: Use It for Fine-Tuning." Kaitchup Substack. https://kaitchup.substack.com/p/dgx-spark-use-it-for-fine-tuning ↩
LMSYS. "NVIDIA DGX Spark In-Depth Review: A New Standard for Local AI Inference." October 13, 2025. https://www.lmsys.org/blog/2025-10-13-nvidia-dgx-spark/ ↩
NVIDIA Blog. "Elon Musk Gets Just-Launched NVIDIA DGX Spark: Petaflop AI Supercomputer Lands at SpaceX." https://blogs.nvidia.com/blog/live-dgx-spark-delivery/ ↩
ServeTheHome. "The NVIDIA DGX Spark is a Tiny 128GB AI Mini PC Made for Scale-Out Clustering." https://www.servethehome.com/the-nvidia-dgx-spark-is-a-tiny-128gb-ai-mini-pc-made-for-scale-out-clustering-arm/ ↩
NVIDIA RAPIDS Accelerator for Apache Spark. https://nvidia.github.io/spark-rapids/ ↩
NVIDIA Newsroom. "NVIDIA Announces DGX Spark and DGX Station Personal AI Computers." March 18, 2025. https://nvidianews.nvidia.com/news/nvidia-announces-dgx-spark-and-dgx-station-personal-ai-computers ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

4 revisions by 1 contributors · full history

Suggest edit

What links here

AI Wiki Artificial intelligence terms GPU Technology Conference NVIDIA DGX NVIDIA DGX Cloud NVIDIA DGX Station for Windows NVIDIA Isaac Lab NVIDIA RTX Spark Nvidia Terms

Background and announcement timeline

When was DGX Spark announced? (CES 2025: Project DIGITS)

Why was Project DIGITS renamed DGX Spark? (GTC 2025)

October 2025: general availability

February 2026: price increase

Hardware design

What does DGX Spark look like? (Form factor)

What is the GB10 Grace Blackwell Superchip?

How much memory does DGX Spark have? (Unified memory architecture)

How do you cluster two DGX Spark units? (Networking and multi-unit scaling)

Power and thermals

Specifications

Software stack

What operating system does DGX Spark run? (DGX OS)

CUDA and GPU acceleration

Included libraries and frameworks

Cloud migration path

Pricing and availability

How much does DGX Spark cost? (Founders Edition pricing)

Retail and distribution

OEM and partner systems

Comparison with competing systems

How does DGX Spark compare to the Apple Mac Studio?

How does DGX Spark compare to AMD Ryzen AI Max?

How does DGX Spark differ from the DGX Station?

What is DGX Spark used for?

Prototyping and development

Fine-tuning

Inference and evaluation

Agentic AI development

Robotics and physical AI

Data science

Edge AI development

Early evaluators

Reception

Technical reviews

Community and industry response

Limitations

Relationship with Apache Spark

See also

References

Improve this article

Related Articles

CuDNN

Jetson Thor

NVIDIA Blackwell

NVIDIA Picasso

Jensen Huang

NVIDIA H100

What links here

Related Articles

CuDNN

Jetson Thor

NVIDIA Blackwell

NVIDIA Picasso

Jensen Huang

NVIDIA H100

What links here