Nvidia RTX 6000 Ada Generation

from DEVELOP3D June / July 2023

One can’t deny the breathtaking performance of Nvidia’s new flagship workstation GPU and its potential to completely transform viz workflows, but some will find the price offputting

Price £7,150 + VAT www.nvidia.com | www.pny.com

New GPU architectures are delivered from the top down. And the new Nvidia RTX 6000 Ada Generation is very much at the top of the stack. With a price tag of £7,150 (Ex VAT), this 48 GB professional GPU is reserved for those that take design visualisation, simulation or AI extremely seriously.

The first thing to get out of the way is the name of this new workstation-class GPU. It is built on Nvidia’s Ada Lovelace architecture, named after the English Mathematician credited with being the first computer programmer.

Recently, Nvidia has used a single letter prefix for its pro GPUs — P for Pascal, T for Turing, A for Ampere, and so on.

As ‘A’ was already taken, Nvidia initially referred to the Ada Lovelace GPU as the Nvidia RTX 6000, but soon after tagged ‘Ada Generation’ on to the end, presumably to avoid confusion with 2018’s Turing-based Nvidia Quadro RTX 6000.

We don’t know why Nvidia didn’t use an ‘L’ prefix as it has done for its ‘Ada Lovelace’ datacentre GPUs (the Nvidia L4 and L40), but this is where we are at now. The Nvidia RTX 6000 Ada Generation might be a bit of a mouthful, but at least it has a clear identity.

The workstation card

The Nvidia RTX 6000 Ada Generation is a dual slot, full height, PCIe 4.0 workstation GPU with four DisplayPort 1.4a connectors. It looks virtually identical to its predecessor, the Nvidia RTX A6000, and has a minimal angular black and gold design. The radial type fan blows hot air directly out of the rear of the workstation via the grille on the bracket.

The total board power is 300W, which is less than its triple slot ‘Ada Lovelace’ consumer counterparts — the Nvidia GeForce RTX 4090 and GeForce RTX 4080. Power is delivered via a single PCIe CEM5 16-pin connector.

The card has a phenomenal amount of processors: 142 third-gen RT Cores for ray tracing (delivering 211 TFLOPs), 568 fourth-gen Tensor Cores for AI compute (delivering 1,457 TFLOPs), and 18,176 next-gen CUDA cores for general purpose operations, boasting 91 TFLOPs of single precision performance. This is a significant jump up from the Nvidia RTX A6000 it replaces, which delivers RT Core performance of 76 TFLOPs, Tensor performance of 310 TFLOPs, and singleprecision performance of 39 TFLOPS.

The Nvidia RTX 6000 Ada comes with 48 GB of GDDR6 memory, which should be plenty for most viz-centric workflows. However, unlike its predecessor, the RTX 6000 Ada Generation does not support NVLink, so two GPUs cannot be bridged together with an adapter to create a 96 GB memory pool. While this shouldn’t matter to most users, it could be a barrier for those working with exceptionally high poly count models / high resolution textures.

It could also limit its use in engineering simulation, including Computational Fluid Dynamics (CFD).

48 GB is still double that on offer in the top-end consumer Nvidia GeForce RTX 40-Series. The RTX 6000 Ada also differentiates itself from Nvidia’s consumer cards in several other ways, including pro drivers, pro software certifications, support for Error Correction code (ECC) memory, and some niche features for pro viz, including stereo and Frame Lock for viz clusters.

It also supports Nvidia virtual GPU (vGPU) software, which allows a workstation to be repurposed into multiple GPU-accelerated virtual workstation instances. With workstation vendors, especially Lenovo and HP, actively making their new ‘Sapphire Rapids’ desktop workstations rack friendly, this feature is likely to be more important than ever before.

Finally, it boasts 3x the video encoding performance of the Nvidia RTX A6000, for streaming multiple simultaneous XR sessions using Nvidia CloudXR.

Optimised for visualisation

The Nvidia RTX 6000 Ada offers all the generational improvements you’d expect from a new GPU architecture, but there are also significant changes in the way the GPU carries out calculations to increase performance in viz-centric workflows.

Deep Learning Super Sampling 3 (DLSS) and Shader Execution Reordering (SER) are the two technologies that stand out.

Nvidia DLSS has been around for several years and with the new Nvidia RTX 6000 Ada, it is now on its third generation. It uses the GPU’s AI Tensor cores to boost performance.

With Nvidia’s previous generation

‘Ampere’ GPUs, DLSS 2 took a lowresolution current frame and the highresolution previous frame to predict, on a pixel-by-pixel basis, what a highresolution current frame would look like.

With DLSS 3, the Tensor cores generate entirely new frames rather than just pixels. It processes the new frame, and the prior frame, to discover how the scene is changing, then generates entirely new frames without having to process the graphics pipeline.

So far, we’ve only seen DLSS 3 implemented in Nvidia Omniverse, but we expect others to follow. Enscape and Autodesk VRED, for example, both support DLSS 2.

As a background to Shader Execution Reordering (SER), Nvidia explains that GPUs are most efficient when processing similar work at the same time. However, with ray tracing, rays bounce in different directions and intersect surfaces of various types. This can lead to different threads processing different shaders or accessing memory that is hard to coalesce or cache. With SER, the Nvidia RTX 6000 Ada Generation can dynamically reorganise its workload, so similar shaders are processed together. According to Nvidia, SER can give a two to three times speed up for ray tracing and a frame rate increase of up to 25%. But these are probably extremes. For offline path tracing in Unreal Engine 5.1, for example, Nvidia quotes speed improvements of 40% or more.

Engineering simulation

In product development, while visualisation is the primary use case for the Nvidia RTX 6000 Ada, the GPU can also be used for engineering simulation.

At launch, Nvidia highlighted the use of Ansys software, including Ansys Discovery and Ansys Fluent for Computational Fluid Dynamics (CFD).

Compared to the RTX A6000, the RTX 6000 Ada not only has more cores and faster cores, but significantly larger L2 cache (96 MB vs 6 MB) and increased memory bandwidth (960 GB/s vs 768 GB/s). According to Ansys, this results in ‘impressive performance gains’ for the broad Ansys application portfolio.

However, the Nvidia RTX 6000 Ada is not suited to all simulation tools. Some simulation solvers require double precision and with relatively poor FP64 performance (which at 1,423 GFLOPSs is 1/64 of its FP32 performance), the RTX 6000 Ada is unlikely to perform that well in those that do. In fact, for double precision solvers, even 2016’s Nvidia Quadro GP100 boasts better FP64 performance of 5.17 TFLOPs.

Testing the Nvidia RTX 6000 Ada DEVELOP3D put the Nvidia RTX 6000 Ada Generation through a series of real-world application benchmarks, both GPU rendering and real time visualisation.

The GPU is simply overkill for current generation CAD and BIM software, so we didn’t do any testing in that regard. However, it’s important to note that it will still be certified for the likes of Solidworks and Solid Edge, which is useful if you plan to use those types of core applications alongside viz focused tools like KeyShot, V-Ray, and Unreal Engine.

The full spec of our AMD Ryzen Threadripper Pro test machine, the Scan 3XS GWP-ME A1128T can be seen below. You can read a full review on page WS22

Scan 3XS GWP-ME A1128T

• AMD Ryzen Threadripper Pro 5995WX processor (2.7 GHz, 4.5 GHz boost) (64-cores, 128 threads)

• 256 GB (8 x 32GB) Samsung ECC

Registered DDR4 3200MHz memory

• 2TB Samsung 990 Pro PCIe 4.0 SSD

• 8TB RAID array (4 x 2TB Samsung 990 Pro NVMe PCIe 4.0 SSDs)

• Asus Pro WS WRX80E Sage SE WiFi motherboard

• Corsair H150i Elite Cappelix RGB with Noctua fans

• 1,500W Corsair HXi, 80PLUS Platinum

• Microsoft Windows 11 Pro

For comparison, we also tested the Nvidia RTX A6000 GPU inside the same machine. Nvidia’s 528.24 pro driver was used for both GPUs.

Real time 3D

Real time 3D visualisation with applications that use OpenGL, DirectX or Vulkan graphics APIs continue to be a very important part of design visualisation. Key applications include Unreal Engine, Autodesk VRED, and the architerctural focused TwinMotion and Enscape. We recorded frame rates (Frame Per Second) within Enscape, Unreal Engine, Nvidia Omniverse, and Autodesk VRED Professional, a pro viz application commonly used in automotive design.

We only tested at 4K (3,840 x 2,160) resolution. At FHD (1,920 x 1,080) , it’s a given that the Nvidia RTX 6000 Ada Generation can deliver more than enough performance.

In Enscape, we tested with five different models. Overall, our experience was incredibly smooth, even with the large RTX-enabled Enscape 3.0 building complex which uses 11 GB of GPU memory.

However, our preferred benchmark model, an urban scene from Enscape 3.1, was a little unresponsive, sometimes taking a few seconds to react to mouse or keyboard movements. We don’t know why this was, but it could be because it includes custom assets and textures and there is a conflict of some sort. Once it got going, however, we recorded a phenomenal 124.95 FPS, 63% faster than the Nvidia RTX A6000.

In Unreal Engine 4.26, the generationon-generation gains were smaller. The biggest increase came when ray tracing was enabled on our Audi test model, with the RTX 6000 Ada Generation delivering 1.5 times more FPS than the RTX A6000.

In Autodesk VRED Professional 2023, performance increases ranged from 1.63 to 1.89. The biggest came when antialiasing was disabled.

In Nvidia Omniverse Create 2022.3.3, we tested with the Brownstone building sample model. In RTX - Real-Time mode with DLSS enabled the RTX 6000 Ada was a whopping 3.62 times faster than the RTX A6000. However, there’s a case of comparing apples with pears here as the RTX 6000 Ada uses DLSS 3 while the RTX A6000 use DLSS 2 (see earlier on). In saying that, we saw no visual difference between the two. In RTX – interactive

(path tracing) mode the RTX 6000 Ada was 2.16 times faster.

Ray trace rendering

We tested with a range of GPUaccelerated ray-trace renderers, including V-Ray GPU, KeyShot, Solidworks Visualize, Nvidia Omniverse and blender.

With the V-Ray, KeyShot and blender benchmarks, the RTX 6000 Ada Generation shot ahead, outperforming the RTX A6000 by a factor of 1.93, 2.05 and 2.17 respectively. We saw similar gains in Nvidia Omniverse, with the RTX 6000 Ada taking less than half the time of the RTX A6000 to render the brownstone building scene.

However, in some of our real-world application tests, the gains were nowhere near as large. In Solidworks Visualize 2023, rendering the 1969 Camaro test scene with Nvidia Iray and the new 3DS Stellar Physically Correct global illumination engine, showed the RTX 6000 Ada to be between 32% and 40% faster than the RTX A6000.

In KeyShot 2023 the RTX 6000 Ada Generation rendered our motorbike model between 33% and 41% quicker in a range of stills and turntable animations. With KeyShot ‘s sample drone scene it went down to 23%-25%.

Conclusion

The Nvidia RTX 6000 Ada Generation is a phenomenally powerful GPU. In several visualisation workflows, it delivered more than double the performance of the previous generation Nvidia RTX A6000. And when Nvidia has full control over both software and hardware, and the GPU’s fourthgen Tensor cores kick in with DLSS 3, the boost in real-time performance

Nvidia RTX 4000 SFF Ada Generation

Nvidia is having a staggered roll out of its Nvidia RTX Ada Lovelace GPUs for desktop workstations. Following on from the ultra-high-end Nvidia RTX 6000 Ada Generation, which launched at the tail end of 2022, Nvidia has now released a second desktop GPU designed specifically for compact workstations with form factors like those of the HP Z2 SFF, HP Z2 Mini, Lenovo ThinkStation P3 Ultra, Dell Precision 3460 SFF and Dell Precision 3260 Compact. The GPU will likely be of interest to users of CAD or BIM software who want to extend their workflows into visualisation, VR and simulation with tools like KeyShot, Twinmotion, Lumion, V-Ray, Omniverse and more.

The Nvidia RTX 4000 SFF Ada Generation features 20 GB of graphics memory, nearly double that of its predecessor, the Nvidia RTX A2000

(12 GB), and is said to offer a 2x performance improvement. It also offers greater memory bandwidth, so it can transfer data to and from its memory more quickly. According to Nvidia, this results in improved graphics, compute and rendering performance.

The Nvidia RTX 4000 SFF is a low-profile, double height graphics card, which takes up two slots on the motherboard. It has four mini DisplayPort1.4a connectors.

The GPU is designed to operate with PCIe slot power alone and has a max power consumption of 70W. This is significantly lower than previous ‘4000’ class GPUs, which typically draw up to 140W. However, Nvidia confirmed that, in the future, it will also launch a standard Nvidia RTX 4000 GPU include 6,144 CUDA parallel processing cores, 192 Nvidia Tensor Cores and 48 Nvidia RT Cores. Nvidia quotes 19.2 TFLOPs single precision performance, 44.3 TFLOPs RT Core performance, and 306.8 TFLOPS Tensor core performance. This is significantly more than the Nvidia RTX A2000 (8.0 TFLOPs, 15.6 TFLOPS and 63.9 TFLOPS respectively). And while single precision performance and RT core performance is very close to the 140W Nvidia RTX A4000, Tensor performance for AI operations has doubled. both GPUs will be share the same silicon, but the Nvidia RTX 4000 SFF will be clocked lower.

(with seemingly no impact on end user experience) is simply breathtaking.

But these are the extremes of what you can expect from this super high-end workstation GPU. In some of our real world tests the performance increases were as low as 23%. But even this can save hours in the working day. When rendering out a 600 frame 4K animation in KeyShot for example, render times dropped from 6 hours 26 mins to 4 hours 50 mins.

There are some downsides to the new GPU. First, there is no NVLink, which may come as a disappointment to those really pushing the boundaries of complexity. Second it is quite power hungry, drawing 300W at peak. And finally, of course, there’s the price.

Nvidia RTX 4000 SFF specs

£7,150 (Ex VAT) is an incredible amount to pay for a single graphics card and considerably more than Nvidia previously charged for its top end workstation GPUs. The Nvidia RTX A6000, for example, only cost £3,730 (Ex VAT) in February 2021.

While some will consider £7,150 to be a price worth paying for the transformative effect it could have on their workflows, others may seek better value elsewhere.

The new AMD Radeon Pro W7900 (48 GB) is one such option (see page 28), although it does not currently offer the same breadth of software support.

More than ever, perhaps Nvidia is facing its biggest competition from itself. The consumer-focused ‘Ada Generation’ GeForce RTX 4090 comes in around

The Nvidia RTX 4000 SFF GPU costs around $1,250, and will be available from workstation manufacturers later this year.

£1,500, but you miss out on some pro features, superior build quality and access to 48 GB of memory, double that of the 4090. And for some viz artists that’s a big deal. 48 GB allows you to work with more complex datasets and render them at higher resolutions. In simulation, engineers can increase the fidelity of the solver for more accurate results.

The additional memory will not just offer potential benefits for single app workflows. These days many viz artists to use multiple apps at the same time - render in V-Ray while working on real-time experiences in Unreal Engine, for example. And the RTX 6000 Ada Generation is much more likely to allow them to do this without having to compromise their workflow or output.

Preview: AMD Radeon Pro W7800 / W7900

AMD’s new RDNA 3 pro graphics cards target price / performance to take the fight to Nvidia and its RTX 6000 Ada Generation, writes

Greg Corke

Price $2,499 (W7800) / $3,999 (W7900) www.amd.com/radeonpro

It’s been a while coming, but AMD has finally delivered its new professional graphics cards, built on the same RDNA 3 architecture of its consumer Radeon RX 7900 Series, which launched in December 2022. We say ‘delivered’ but AMD is not quite there yet. Our review cards missed the deadline for this Workstation Special Report by a matter of days.

The new boards — the Radeon Pro W7900 and Radeon Pro W7800 — target workflows including visualisation, real-time 3D, ray trace rendering, photogrammetry, VR, simulation, video editing, compositing and more. But with pro driver optimisations and certifications they can also be used for CAD and BIM.

The AMD Radeon Pro W7900 is a triple (2.5) slot GPU with 48 GB of GDDR6 memory, 61 TFLOPs of peak single precision performance and a total board power of 295W. It costs $3,999.

The AMD Radeon Pro W7800 is a dual slot GPU with 32 GB of GDDR6 memory, 45 TFLOPs of peak single precision performance and a total board power of 260W. It costs $2,499.

Both GPUs comprise multiple unified RDNA 3 compute units, each with 64 dual issue stream processors, two AI accelerators and a second gen ray tracing (RT) accelerator. According to AMD, RDNA 3 offers up to 50% more raytracing performance per compute unit than the previous generation.

Optimised ray tracing

For ray tracing, the new GPUs are compatible with Unreal Engine, Unity,

Lumion, Enscape, Solidworks Visualise, D5 Render, Maxon Redshift, plus other applications that support DirectX Raytracing (DXR), Vulkan ray tracing, or AMD Radeon ProRender, including Acca Edificius, Autodesk Inventor, Rhino, Autodesk Maya, and Blender (Cycles X).

This list should grow. AMD is also working with other software developers to help convert their existing Nvidia CUDA applications to run on the new GPUs and other AMD hardware. This is being done through AMD’s open-source toolset HIP (Heterogeneous-Compute Interface for Portability), which includes a ray tracing library, HIP RT, so developers can take advantage of the dedicated ray accelerators in AMD’s GPUs.

The new GPUs will go up against the 48 GB Nvidia RTX performance.

In SPECviewperf 2020 GeoMean, for example, AMD claims the Radeon Pro W7900 is within 7% of the performance of the Nvidia RTX A6000 Ada Generation but offers more than double the price/performance, as it costs less than half as much ($3,999 vs $8,615).

AMD also highlights support for DisplayPort 2.1, the latest version of the digital display standard which offers three times the data rate of DisplayPort 1.4. According to AMD, this means its new GPUs are future proofed for next gen displays in terms of refresh rate, pixel resolution and colour bit-depth, while pointing out that the Nvidia RTX 6000 Ada Generation supports DisplayPort 1.4.

Both the AMD Radeon Pro W7800 and W7900 feature three DisplayPort 2.1 and one Mini DisplayPort 2.1 connectors, a change from the previous generation Radeon Pro W6800 with six Mini DisplayPort 1.4.

Memory boost

With 48 GB, the Radeon Pro W7900 also marks a step up in terms of memory, with 50% more than its predecessor, the Radeon Pro W6800, putting it on par with the Nvidia RTX 6000

Memory is becoming increasingly important for viz workflows, not just to support extremely complex high-polygon datasets, but for multi-tasking as well, as product designer, Dr. Adi Pandzic, Ph.D, explains, “Large format renders require more horsepower, especially when doing 4K raytraced animations using [Solidworks] Visualize. The Radeon Pro W7900 allows me to easily keep working on the model [in Solidworks CAD] while rendering in

Rich Hurrey, president, founder, Kitestring, shares similar experiences, “The increased memory that the new AMD RDNA 3 GPUs offer, allows us to have multiple instances of Maya, Modo, and Unreal Engine open at the same time. All of this means that production work gets

Memory also differentiates the new GPUs from AMD’s consumer focused RDNA 3 GPU, the AMD Radeon RX 7900 XTX which has 24 GB. Both pro GPUs also come with AMD Software: Pro Edition. This offers pro software certifications for ‘performance and stability’, and pro features such as ViewPort Boost, which dynamically adjusts viewport resolution to boost performance, remote workstation support and more.

Look out for a review soon.