March 30, 2026 · 9 min read Nemotron-3-Super-120B: Topology Benchmark on DGX Spark The model where cluster never loses. Nemotron-3-Super benefits more from TP=2 than any other model tested -- and the SM12.1 CUTLASS patch doubles performance vs FlashInfer. #DGX Spark #benchmarking #Nemotron #vLLM #CUTLASS #AI #Local AI