Provider analysis

NVIDIA

Publisher of the Nemotron family of open enterprise models, embedding models, and reasoning-optimized LLMs.

Last verified: 2026-03-29Confidence: HighPrimary sources: 4

This provider page blends full-profile entries with broader verified listings. Use it to separate deeply evaluated flagship models from source-backed records that are tracked primarily for market visibility, access data, and freshness coverage.

Headquarters

Santa Clara, CA

Founded

1993

Models tracked

Full-profile models

Catalog last verified

2026-03-29

Latest model verification

2026-03-29

Newest release tracked

2026-03-19

Confidence

High

Verified listings

Access mix

open-weight, self-hosted, hosted, api

Website

https://www.nvidia.com/

API models

Tracked models available through provider-managed APIs.

Open-weight models

Models with downloadable weights or self-hosted distribution paths.

Primary source links

Total source references attached across this provider catalog.

Provider sources

Official links used to verify the provider profile and platform coverage.

Last verified: 2026-03-29

NVIDIA website
official-website
Open link
NVIDIA NIM docs
official-docs
Open link
NVIDIA NIM pricing
official-pricing
Open link
NVIDIA HuggingFace
cloud-platform
Open link

NVIDIA

Nemotron 3 Super 120B

Nemotron

NVIDIA's flagship 120B/12B-active LatentMoE model with 1M context, trained on 25T tokens. Strong on agentic workflows, reasoning, and long-context tasks. Requires 8x H100-80GB.

Verified listing5 sources

textreasoningcodeopen-sourceopen-weightself-hostedhostedapi

Context: 1,048,576
Input: Unpublished
Output: Unpublished
Coverage: Verified 2026-03-29

View analysis

NVIDIA

Nemotron-Cascade 2

Nemotron

NVIDIA's 32B (30B-A3B MoE) Nemotron-Cascade 2 trained with cascade RL and multi-domain on-policy distillation. 74.8K downloads on HuggingFace.

Verified listing5 sources

textreasoningcodeopen-sourceopen-weightself-hostedhostedapi

Context: 131,072
Input: Unpublished
Output: Unpublished
Coverage: Verified 2026-03-29

View analysis

NVIDIA

Nemotron 3 Nano 4B

Nemotron

NVIDIA's compact 4B Nemotron Nano for efficient local AI with hybrid Mamba-2 architecture. Runs on consumer GPUs.

Verified listing5 sources

textreasoningcodeopen-sourceopen-weightself-hostedhostedapi

Context: 131,072
Input: Unpublished
Output: Unpublished
Coverage: Verified 2026-03-29

View analysis

NVIDIA

Llama Nemotron Super 49B

Llama Nemotron

NVIDIA's Llama-based Nemotron Super 49B for high-accuracy reasoning, agentic tasks, and RAG workflows.

Verified listing4 sources

textreasoningcodeopen-sourceopen-weightself-hostedhostedapi

Context: 131,072
Input: Unpublished
Output: Unpublished
Coverage: Verified 2026-03-29

View analysis

NVIDIA

Llama Nemotron Nano 4B

Llama Nemotron

NVIDIA's compact 4B Llama Nemotron Nano for edge AI with high-accuracy reasoning. Runs on consumer GPUs.

Verified listing4 sources

textreasoningcodeopen-sourceopen-weightself-hostedhostedapi

Context: 131,072
Input: Unpublished
Output: Unpublished
Coverage: Verified 2026-03-29

View analysis

NVIDIA

NV-Embed v2

Nemotron embedding

NVIDIA's state-of-the-art text embedding model ranked #1 on MTEB leaderboard for retrieval and semantic similarity tasks.

Verified listing5 sources

textopen-sourceopen-weightself-hostedapi

Context: 32,768
Input: Unpublished
Output: Unpublished
Coverage: Verified 2026-03-29

View analysis

NVIDIA

Llama-Embed-Nemotron 8B

Nemotron embedding

NVIDIA's Llama-Embed-Nemotron 8B ranked #1 on multilingual MTEB leaderboard with text and image retrieval support.

Verified listing4 sources

textimageopen-sourceopen-weightself-hostedapi

Context: 32,768
Input: Unpublished
Output: Unpublished
Coverage: Verified 2026-03-29

View analysis