Nvidia

Llama-3.1-Nemotron-Ultra-253B-v1

Name: Llama-3.1-Nemotron-Ultra-253B-v1
Author: Nvidia

A fast decision page for teams comparing performance, cost, context window, and critical capabilities without digging through raw specs.

Max Context (In)

131K

Max Output (Out)

Input (1M tokens)

Output (1M tokens)

Quick signals

Vision

Tool Calling

Structured Output (JSON)

File Attachments

Reasoning

Open Source

Nemotron 3 Super

Nvidia · 262K

Llama Nemotron Embed VL 1B V2 (free)

Nvidia · 131K

Step 3.5 Flash

Nvidia · 256K

Kimi K2.5

Nvidia · 262K