Provider analysis
NVIDIA
Publisher of the Nemotron family of open enterprise models, embedding models, and reasoning-optimized LLMs.
This provider page blends full-profile entries with broader verified listings. Use it to separate deeply evaluated flagship models from source-backed records that are tracked primarily for market visibility, access data, and freshness coverage.
Tracked models available through provider-managed APIs.
Models with downloadable weights or self-hosted distribution paths.
Total source references attached across this provider catalog.
NVIDIA
Nemotron 3 Super 120B
Nemotron
NVIDIA's flagship 120B/12B-active LatentMoE model with 1M context, trained on 25T tokens. Strong on agentic workflows, reasoning, and long-context tasks. Requires 8x H100-80GB.
- Context
- 1,048,576
- Input
- Unpublished
- Output
- Unpublished
- Coverage
- Verified 2026-03-29
NVIDIA
Nemotron-Cascade 2
Nemotron
NVIDIA's 32B (30B-A3B MoE) Nemotron-Cascade 2 trained with cascade RL and multi-domain on-policy distillation. 74.8K downloads on HuggingFace.
- Context
- 131,072
- Input
- Unpublished
- Output
- Unpublished
- Coverage
- Verified 2026-03-29
NVIDIA
Nemotron 3 Nano 4B
Nemotron
NVIDIA's compact 4B Nemotron Nano for efficient local AI with hybrid Mamba-2 architecture. Runs on consumer GPUs.
- Context
- 131,072
- Input
- Unpublished
- Output
- Unpublished
- Coverage
- Verified 2026-03-29
NVIDIA
Llama Nemotron Super 49B
Llama Nemotron
NVIDIA's Llama-based Nemotron Super 49B for high-accuracy reasoning, agentic tasks, and RAG workflows.
- Context
- 131,072
- Input
- Unpublished
- Output
- Unpublished
- Coverage
- Verified 2026-03-29
NVIDIA
Llama Nemotron Nano 4B
Llama Nemotron
NVIDIA's compact 4B Llama Nemotron Nano for edge AI with high-accuracy reasoning. Runs on consumer GPUs.
- Context
- 131,072
- Input
- Unpublished
- Output
- Unpublished
- Coverage
- Verified 2026-03-29
NVIDIA
NV-Embed v2
Nemotron embedding
NVIDIA's state-of-the-art text embedding model ranked #1 on MTEB leaderboard for retrieval and semantic similarity tasks.
- Context
- 32,768
- Input
- Unpublished
- Output
- Unpublished
- Coverage
- Verified 2026-03-29
NVIDIA
Llama-Embed-Nemotron 8B
Nemotron embedding
NVIDIA's Llama-Embed-Nemotron 8B ranked #1 on multilingual MTEB leaderboard with text and image retrieval support.
- Context
- 32,768
- Input
- Unpublished
- Output
- Unpublished
- Coverage
- Verified 2026-03-29