Nvidia RTX Spark May Light a Fire for Windows on Arm


Buckle up: Nvidia is “reinventing the personal computer,” according to CEO Jensen Huang. Microsoft and Nvidia have been cozying up to one another in preparation for Nvidia’s highly anticipated launch of the RTX Spark. It’s a new Arm-based system-on-chip (or “SoC”) platform that brings Nvidia’s Blackwell architecture to thin and light Windows laptops and mini desktops. The goal is to provide high-power processing performance for running personal agents, creative work and gaming, but without the space, power needs and cooling requirements usually imposed by discrete graphics.

The RTX Spark joins Qualcomm’s Snapdragon X processors running Windows on Arm, with similar claims of “all-day battery life.” Snapdragons achieve that, but one thing to remember about Nvidia’s chip is that it’s intended for far heavier workloads than Snapdragon processors. 

Those aren’t meant to “render ultralarge 90GB-plus 3D scenes, edit 12K 4:2:2 video, generate 4K AI videos, run 120B-parameter LLMs with up to 1 million tokens context using agents locally, and play AAA games at 1440p and over 100 frames per second,” all of which can tank your battery life. It remains to be seen if the Spark can live up to that under normal usage. 

This is the first of what Nvidia says it plans to be a line of chips across a variety of price segments. These first models are slated to ship this fall:

  • Microsoft Surface Laptop Ultra
  • Dell XPS 16 
  • Asus ProArt P14 and P16
  • HP Omnibook X 14, Omnibook Ultra 16
  • Lenovo Yoga Pro 9n
  • MSI Prestige N16 Flip AI

The 15-inch Surface Laptop Ultra is particularly notable because Microsoft hasn’t updated its screens in far too long, and the Surfaces (both desktop and laptop) never incorporated discrete GPUs their prices seemed to demand. The Ultra has a higher-resolution (262ppi) 15-inch mini LED touchscreen that supports HDR (with peak brightness of 2,000 nits), unlike the older, meh model. Microsoft hasn’t updated its Surface Laptop Studio in three years, and this is the chip and screen it needs if Microsoft plans to bring it back from the dead.

There will also be mini desktops. It seems to have been a resurgence of these — at least an increase in the number of manufacturers offering them — thanks to developers. The RTX Spark models will compete with AMD Ryzen AI Halo-based models for example. They’re expected from companies such as Acer, Asus, Dell, HP and Lenovo, among others.

Nvidia’s planning to have a desktop, laptop and workstation for each generation of chips.

AI Atlas art badge tag

Given current price volatility, we won’t know how much they’ll cost until they’re closer to shipping. AI’s ravenous demand for components — and the resources needed to make them — has created severe shortages of memory, processors and SSD storage, driving computer and phone prices higher and even affecting available configuration options

Spark it up

The chip is an offshoot of the DGX Spark (GB10), which powers Linux-based compact desktops specifically targeted at developers and now Windows-based DGX Station. The Spark was designed in conjunction with MediaTek, and has similar specs to the DGX: 6,144 CUDA cores, a 20-core Grace CPU, ability to access up to 128GB RAM and more. Nvidia says it supports up to 120B parameter agents with a 1M context. (For reference, AMD says its top Ryzen AI Max Pro 400 series chip can can handle up to 300B parameter models). 

surface-laptop-ultra-image-2

The RTX Spark under the hood of the Surface Laptop Ultra.

Microsoft

Its GPU specs are more or less comparable to an RTX 5070, but the unified memory architecture means it has access to a lot more RAM than 12GB. Nvidia says that system configurations can go as low as 16GB, though, which means it could potentially bottleneck when a dedicated 5070, with 12GB VRAM, might not. The company gave 100fps 1440p as its reference for gaming performance (though it wasn’t clear whether that was with or without DLSS 4.5 enabled).

Nvidia claims the chip’s overall AI performance is one PFLOPS (a billion floating point operations per second), but that’s based on FP4 calculations. On one hand, FP4 is the current darling of the data formats because it’s faster than the other floating point choices and more accurate than integer, but there are some tradeoffs. (Procyon has a great visual example of what speed versus accuracy tradeoffs can mean for image generation.) But among the consumer SoCs, this is the first to support it in hardware.

The real competition for these is the M5 Pro and M5 Max MacBook Pros, which target the same users, but the M5 line doesn’t support FP4 and FP8 data types, which may turn out to be a hindrance.

The part itself can run at anywhere from “single digits” to 80W, which means you’ll really need to pay attention to whether a laptop runs at full power or if the manufacturer is throttling it. In other words, it sounds like performance, especially on battery, may vary a lot. Typically, mobile processor power envelopes are smaller bands; for instance, the Intel Core X9 388H specifies 15W-85W.

It has an NPU, which Nvidia doesn’t seem to want to talk much about, but the systems with the Spark are considered Copilot Plus-qualifying, so it must be able to hit at least 40 TOPS

An illustration of the RTX Spark

This illustration of the RTX Spark in situ has the fuzzy, glowy look of a generated image.  

Nvidia

RTX Spark might seem powerful, but Nvidia is maintaining its strict division between pro and consumer markets. For instance, it doesn’t plan to run a certification program for applications or support ECC memory.

In addition to being one of Nvidia’s launch partners with its Surface Laptop Ultra, Microsoft has been working to make the necessary updates to Windows in order to take advantage of the new chip. 

Like Qualcomm’s Snapdragon X series processors, Windows doesn’t natively support the Arm instruction set the way it does Intel and AMD’s x86-architecture chips, which were foundational to the PC. Instead, Arm-based systems use an emulation layer called Prism to translate instructions. Emulation is partly why the early systems based on Qualcomm chips experienced performance and compatibility problems.

Windows modifications

Many of the updates to Windows that are necessary to support the hardware are under the hood, but one will be right in your face: Microsoft’s putting Spark-run agents on the Taskbar. 

A lot of the changes we’ve seen in Windows recently have been laying the groundwork for this. Prism was written specifically for Qualcomm’s SoCs, since it was the only Arm-based silicon the operating system needed to run on. Supporting the RTX Spark meant updating Prism and other core parts of Windows to efficiently distribute workloads across the CPU cores, balance cooling and performance, address and intelligently manage a larger amount of the unified memory available to the GPU (for AI processing with TensorRT) and more.

Qualcomm doesn’t have nearly as much invested in Windows gaming performance as Nvidia does, for obvious reasons. For example, Nvidia has been working with Microsoft to improve compatibility with anti-cheat software (such as Epic’s Easy Anti-Cheat), which has prevented some games from running on the devices, as well as support for the Xbox app, which is key to Microsoft’s game-on-everything strategy. 

Adobe is also reengineering parts of its imaging engines to tap into the Spark directly, notably with several new pipelines to accelerate more GPU- and AI-intensive features such as rendering complex timelines in Premiere Pro and improving natural brushes in Photoshop. While CUDA and TensorRT already operate on Nvidia’s discrete mobile GPUs, taking optimal advantage of them on this different architecture requires some rejiggering. The applications will also be able to interact with Windows agents.

Plus, Nvidia is porting OpenShell — its security protocols for running agents — to Windows, via new controls that Microsoft will reveal at its Build conference in the first week of June. OpenShell, in theory, lets you define guardrails for your agents, route queries to approved local models based on your privacy policies and let it “disguise” personal information when querying cloud-based models. 

Nvidia is trying to expand everyday agenting beyond developers, with the notion that “broad adoption has been limited by the inability to run agents securely and privately on users’ primary PCs.” I suspect the trust issues are more complicated than that. The company says that OpenShell will be incorporated into the current agenting faves, OpenClaw and Hermes.





Source link

Leave a Reply

Subscribe to Our Newsletter

Get our latest articles delivered straight to your inbox. No spam, we promise.

Recent Reviews


Researchers in South Korea developed a wearable system that uses seven smart rings to read finger and hand motions to translate American Sign Language and International Sign Language into text. The purpose is to make communicating easier between those who sign and nonsigners without needing a separate human interpreter. 

AI Atlas

According to the study, published Friday in the journal Science Advances, the system reliably recognized 100 ASL and ISL words during testing. It also performed well with users the system had not seen before, and it didn’t require recalibration for each person. Because the system detects words in sequence, it can produce sentence-level translations without extra training on grammar. 

ASL and ISL are the everyday languages of more than 72 million deaf and hard-of-hearing people. However, most hearing people do not know any words in these languages or have a very basic understanding. That gap makes certain tasks, like ordering at a restaurant or asking for help, much more difficult. 

A graphic shows two illustrated people talking in sign language, ASL and ISL. The graphic also shows the different components of the ring as well as pictures of hands modeling the rings.

A concept of how the rings work in the real world. 

American Association for the Advancement of Science (AAAS)

Existing sign language translator prototypes often rely on bulky gloves that can distract from or block natural hand movement or feel uncomfortable for the wearer, which limits real word adaption. Camera-based technologies can work well in controlled environments but are often limited to those places where a camera can be set up with a clear line of sight, the researchers wrote. 

To solve these problems, the researchers designed sensing rings for each finger that can capture precise motion and finger position while letting the hands move naturally. The rings can detect both signs that involve movement, like the words for “dance,” “fly” and “sun,” and signs that are held still, like “I” and “you.”

“These advances suggest that [the device could enable] barrier-free public translation systems for unseen users and unrestricted daily assistive interfaces,” the authors wrote in the study. 

The authors are affiliated with Yonsei University, Hankuk University of Foreign Studies and the Korea Institute of Science and Technology, among others. While the technology is still experimental, the authors wrote that the technology has the potential to ease communication difficulties. The underlying idea could also help improve controls for other systems, like virtual or augmented reality.

“Beyond sign language translation, the ring-type, wireless, and modular architecture of (wirelessly connected, ring-type sign language translators) may also be extended to other gesture-driven applications such as virtual or augmented reality control, touchless device interfaces, or rehabilitation monitoring systems where fine-grained hand movement tracking is essential,” they wrote.





Source link