LLM AtlasLLM AtlasSearch models

Provider analysis

NVIDIA

Publisher of the Nemotron family of open enterprise models, embedding models, and reasoning-optimized LLMs.

Last verified: 2026-03-29Confidence: HighPrimary sources: 4

This provider page blends full-profile entries with broader verified listings. Use it to separate deeply evaluated flagship models from source-backed records that are tracked primarily for market visibility, access data, and freshness coverage.

Headquarters
Santa Clara, CA
Founded
1993
Models tracked
7
Full-profile models
0
Catalog last verified
2026-03-29
Latest model verification
2026-03-29
Newest release tracked
2026-03-19
Confidence
High
Verified listings
7
Access mix
open-weight, self-hosted, hosted, api
API models
7

Tracked models available through provider-managed APIs.

Open-weight models
7

Models with downloadable weights or self-hosted distribution paths.

Primary source links
32

Total source references attached across this provider catalog.

Provider sources

Official links used to verify the provider profile and platform coverage.

Last verified: 2026-03-29

NVIDIA

Nemotron 3 Super 120B

Nemotron

NVIDIA's flagship 120B/12B-active LatentMoE model with 1M context, trained on 25T tokens. Strong on agentic workflows, reasoning, and long-context tasks. Requires 8x H100-80GB.

Verified listing5 sources
textreasoningcodeopen-sourceopen-weightself-hostedhostedapi
Context
1,048,576
Input
Unpublished
Output
Unpublished
Coverage
Verified 2026-03-29

NVIDIA

Nemotron-Cascade 2

Nemotron

NVIDIA's 32B (30B-A3B MoE) Nemotron-Cascade 2 trained with cascade RL and multi-domain on-policy distillation. 74.8K downloads on HuggingFace.

Verified listing5 sources
textreasoningcodeopen-sourceopen-weightself-hostedhostedapi
Context
131,072
Input
Unpublished
Output
Unpublished
Coverage
Verified 2026-03-29

NVIDIA

Nemotron 3 Nano 4B

Nemotron

NVIDIA's compact 4B Nemotron Nano for efficient local AI with hybrid Mamba-2 architecture. Runs on consumer GPUs.

Verified listing5 sources
textreasoningcodeopen-sourceopen-weightself-hostedhostedapi
Context
131,072
Input
Unpublished
Output
Unpublished
Coverage
Verified 2026-03-29

NVIDIA

Llama Nemotron Super 49B

Llama Nemotron

NVIDIA's Llama-based Nemotron Super 49B for high-accuracy reasoning, agentic tasks, and RAG workflows.

Verified listing4 sources
textreasoningcodeopen-sourceopen-weightself-hostedhostedapi
Context
131,072
Input
Unpublished
Output
Unpublished
Coverage
Verified 2026-03-29

NVIDIA

Llama Nemotron Nano 4B

Llama Nemotron

NVIDIA's compact 4B Llama Nemotron Nano for edge AI with high-accuracy reasoning. Runs on consumer GPUs.

Verified listing4 sources
textreasoningcodeopen-sourceopen-weightself-hostedhostedapi
Context
131,072
Input
Unpublished
Output
Unpublished
Coverage
Verified 2026-03-29

NVIDIA

NV-Embed v2

Nemotron embedding

NVIDIA's state-of-the-art text embedding model ranked #1 on MTEB leaderboard for retrieval and semantic similarity tasks.

Verified listing5 sources
textopen-sourceopen-weightself-hostedapi
Context
32,768
Input
Unpublished
Output
Unpublished
Coverage
Verified 2026-03-29

NVIDIA

Llama-Embed-Nemotron 8B

Nemotron embedding

NVIDIA's Llama-Embed-Nemotron 8B ranked #1 on multilingual MTEB leaderboard with text and image retrieval support.

Verified listing4 sources
textimageopen-sourceopen-weightself-hostedapi
Context
32,768
Input
Unpublished
Output
Unpublished
Coverage
Verified 2026-03-29