LLM AtlasLLM AtlasSearch models

Leaderboard

overall rankings

Updated weekly

Each leaderboard uses transparent weighted scoring, current model context, and supporting analysis to help teams interpret the results with confidence. Only full-profile entries appear in rankings; broader catalog records remain available elsewhere on the site when only source-backed metadata is currently available.

Full-profile entries
245

Models with complete enough metadata and scoring coverage to be meaningfully ranked in this category.

Ranking basis
overall

Scores combine benchmark evidence, product metadata, and cost/context signals when those fields are published.

Catalog caveat
25

Tracked models without full scoring remain in the directory and provider pages, but are not relied on for analytical ranking claims.

RankModelProviderScoreContext
#1GPT-5.4OpenAI931,000,000
#2Claude Sonnet 4.6Anthropic921,000,000
#3Claude Opus 4.6Anthropic911,000,000
#4Claude Sonnet 4.5Anthropic911,000,000
#5Gemini 3.1 ProGoogle DeepMind911,048,576
#6GPT-4oOpenAI91128,000
#7GPT-5.2OpenAI911,000,000
#8GPT-5.4 ProOpenAI911,000,000
#9Claude 3.7 SonnetAnthropic90200,000
#10Gemini 2.5 ProGoogle DeepMind901,048,576
#11GPT-5.2 ProOpenAI901,000,000
#12Claude Opus 4.1Anthropic89200,000
#13Claude Sonnet 4Anthropic891,000,000
#14Gemini 3.0 ProGoogle DeepMind891,048,576
#15Gemini 3.1 FlashGoogle DeepMind891,048,576
#16GPT-5OpenAI891,000,000
#17Gemini 2.5 Pro TTSGoogle DeepMind881,048,576
#18Claude Opus 4Anthropic87200,000
#19Gemini 2.5 FlashGoogle DeepMind871,048,576
#20Gemini 3.0 FlashGoogle DeepMind871,048,576
#21GPT-5.3-CodexOpenAI871,000,000
#22GPT-5 miniOpenAI861,000,000
#23Mistral Large 25Mistral AI86128,000
#24Gemini 2.0 FlashGoogle DeepMind851,048,576
#25GPT-4.1OpenAI851,048,576
#26GPT-5.3 InstantOpenAI851,000,000
#27Claude Haiku 4.5Anthropic84200,000
#28Llama 4 MaverickMeta841,048,576
#29Gemini 2.5 Flash LiveGoogle DeepMind831,048,576
#30Gemini 2.5 Flash Native Audio PreviewGoogle DeepMind831,048,576
#31Gemini 2.5 Flash-LiteGoogle DeepMind831,048,576
#32Command R+ 2026Cohere82128,000
#33Gemini 1.5 ProGoogle DeepMind822,097,152
#34Gemini 3.1 Flash-LiteGoogle DeepMind821,048,576
#35o3OpenAI81200,000
#36o4-miniOpenAI81200,000
#37GPT-4.1 miniOpenAI801,048,576
#38o3-deep-researchOpenAI80200,000
#39o4-mini-deep-researchOpenAI80200,000
#40Claude Haiku 3.5Anthropic79200,000
#41Claude Opus 3Anthropic79200,000
#42Gemini 2.0 Flash-LiteGoogle DeepMind791,048,576
#43GPT-5 nanoOpenAI791,000,000
#44Nova ProAmazon Web Services79300,000
#45Claude Sonnet 3Anthropic78200,000
#46Gemini 1.5 FlashGoogle DeepMind781,048,576
#47GPT-4o-miniOpenAI78128,000
#48o1OpenAI75200,000
#49Sonar Reasoning ProPerplexity75200,000
#50Llama 3.3 70B InstructMeta74128,000
#51Nova LiteAmazon Web Services74300,000
#52o1-miniOpenAI74128,000
#53Sonar ProPerplexity74200,000
#54Gemini 1.5 Flash-8BGoogle DeepMind731,048,576
#55Llama 4 ScoutMeta7310,485,760
#56Claude Haiku 3Anthropic72200,000
#57Claude Haiku 3Anthropic72200,000
#58Claude Haiku 3.5Anthropic72200,000
#59Claude Haiku 4.5Anthropic72200,000
#60CodestralMistral AI72256,000
#61Codestral EmbedMistral AI7232,768
#62CogVideoXZ.AI72128,000
#63CogView 4Z.AI72128,000
#64Devstral Medium 1.0Mistral AI72128,000
#65Doubao-Seed-1.6ByteDance / Doubao72128,000
#66Doubao-Seed-1.6-FlashByteDance / Doubao72128,000
#67Doubao-Seed-2.0-CodeByteDance / Doubao72128,000
#68Doubao-Seed-CodeByteDance / Doubao72128,000
#69ERNIE 3.5 128KBaidu / ERNIE72128,000
#70ERNIE 4.0 Turbo 8KBaidu / ERNIE72128,000
#71ERNIE Functions 8KBaidu / ERNIE72128,000
#72ERNIE Speed 128KBaidu / ERNIE72128,000
#73GLM-4.5Z.AI72128,000
#74GLM-4.5VZ.AI72128,000
#75GLM-4.6Z.AI72128,000
#76GLM-4.6VZ.AI72128,000
#77GLM-4.7Z.AI72128,000
#78GLM-5Z.AI72128,000
#79GLM-ImageZ.AI72128,000
#80GLM-OCRZ.AI72128,000
#81gpt-audioOpenAI72128,000
#82gpt-oss-120bOpenAI72131,072
#83gpt-realtimeOpenAI72128,000
#84Hunyuan CodeTencent / Hunyuan72128,000
#85Hunyuan LiteTencent / Hunyuan72128,000
#86Hunyuan StandardTencent / Hunyuan72128,000
#87Hunyuan T1Tencent / Hunyuan72256,000
#88Hunyuan T1 VisionTencent / Hunyuan72128,000
#89Hunyuan TurboSTencent / Hunyuan72128,000
#90Hunyuan TurboS LongText 128KTencent / Hunyuan72128,000
#91Kimi K2Moonshot AI / Kimi72131,072
#92Kimi K2 ThinkingMoonshot AI / Kimi72256,000
#93Kimi K2 Turbo PreviewMoonshot AI / Kimi72256,000
#94Kimi K2.5Moonshot AI / Kimi72256,000
#95Magistral Medium 1.2Mistral AI72128,000
#96MiniMax-M1MiniMax72204,800
#97MiniMax-M2MiniMax72204,800
#98MiniMax-M2.1MiniMax72204,800
#99MiniMax-M2.1-highspeedMiniMax72204,800
#100MiniMax-M2.5MiniMax72204,800
#101MiniMax-M2.5-highspeedMiniMax72204,800
#102MiniMax-Text-01MiniMax72204,800
#103MiniMax-VL-01MiniMax72204,800
#104Mistral EmbedMistral AI7232,768
#105Mistral Large 3Mistral AI72128,000
#106Mistral Medium 3.1Mistral AI72128,000
#107Mistral ModerationMistral AI7232,768
#108Mistral Small 3.1Mistral AI72128,000
#109Mistral Small 3.2 OpenMistral AI72128,000
#110Pixtral 12BMistral AI72131,072
#111Pixtral LargeMistral AI72131,072
#112Sonar Deep ResearchPerplexity72200,000
#113Vidu Q1Z.AI72128,000
#114Voxtral Mini OpenMistral AI72131,072
#115Voxtral Mini TranscribeMistral AI72131,072
#116Voxtral Small OpenMistral AI72131,072
#117Claude Sonnet 3Anthropic71200,000
#118Claude Sonnet 4Anthropic711,000,000
#119Command ACohere71256,000
#120Command A ReasoningCohere71128,000
#121Command A TranslateCohere71128,000
#122Command A VisionCohere71128,000
#123Command R+Cohere71128,000
#124Command R7BCohere71128,000
#125DeepSeek-Coder-V2DeepSeek71128,000
#126DeepSeek-Math-V2DeepSeek71128,000
#127DeepSeek-R1DeepSeek71128,000
#128DeepSeek-R1-Distill-Llama-70BDeepSeek71128,000
#129DeepSeek-V2.5DeepSeek71128,000
#130DeepSeek-V3DeepSeek71128,000
#131DeepSeek-V3.1DeepSeek71128,000
#132DeepSeek-V3.1-BaseDeepSeek71128,000
#133DeepSeek-V3.2DeepSeek71128,000
#134DeepSeek-V3.2-ExpDeepSeek71128,000
#135Devstral 2 OpenMistral AI71128,000
#136Devstral Small 2Mistral AI71128,000
#137Embed 4Cohere71128,000
#138ERNIE 4.5 Turbo 32KBaidu / ERNIE7132,768
#139Grok 3xAI71131,072
#140Grok 3 MinixAI71131,072
#141Grok 4xAI71256,000
#142Grok 4 Fast ReasoningxAI71131,072
#143grok-imagexAI71131,072
#144Jamba 3BAI21 Labs71256,000
#145Jamba LargeAI21 Labs71256,000
#146Jamba Large 1.6AI21 Labs71256,000
#147Jamba MiniAI21 Labs71256,000
#148Jamba Mini 1.6AI21 Labs71256,000
#149Jamba Mini 1.7AI21 Labs71256,000
#150Magistral Small 1.2 OpenMistral AI71128,000
#151Ministral 3 14B OpenMistral AI71128,000
#152Ministral 3 3B OpenMistral AI71128,000
#153Ministral 3 8B OpenMistral AI71128,000
#154Mistral Large 3 OpenMistral AI71128,000
#155Mistral Nemo 12BMistral AI71128,000
#156Mistral OCR 2505Mistral AI7132,768
#157morph-v3-fast-applyMorph71128,000
#158Phi-3-vision-128k-instructMicrosoft71128,000
#159Phi-3.5-mini-instructMicrosoft71131,072
#160Phi-3.5-MoE-instructMicrosoft71131,072
#161Phi-3.5-vision-instructMicrosoft71131,072
#162Phi-4-mini-flash-reasoningMicrosoft71131,072
#163Phi-4-mini-instructMicrosoft71131,072
#164Phi-4-multimodal-instructMicrosoft71131,072
#165Phi-4-reasoningMicrosoft71131,072
#166Phi-4-reasoning-plusMicrosoft71131,072
#167Phi-4-reasoning-vision-15BMicrosoft71131,072
#168Qwen2.5-1.5B-InstructAlibaba Qwen71131,072
#169Qwen2.5-14B-InstructAlibaba Qwen71131,072
#170Qwen2.5-32B-InstructAlibaba Qwen71131,072
#171Qwen2.5-3B-InstructAlibaba Qwen71131,072
#172Qwen2.5-72B-InstructAlibaba Qwen71131,072
#173Qwen2.5-7B-InstructAlibaba Qwen71131,072
#174Qwen2.5-MaxAlibaba Qwen71131,072
#175Qwen2.5-VL-72B-InstructAlibaba Qwen71131,072
#176Qwen2.5-VL-7B-InstructAlibaba Qwen71131,072
#177Qwen3-Coder-NextAlibaba Qwen71131,072
#178Qwen3.5-0.8BAlibaba Qwen71131,072
#179Qwen3.5-122B-A10BAlibaba Qwen71131,072
#180Qwen3.5-27BAlibaba Qwen71131,072
#181Qwen3.5-2BAlibaba Qwen71131,072
#182Qwen3.5-35B-A3BAlibaba Qwen71131,072
#183Qwen3.5-397B-A17BAlibaba Qwen71131,072
#184Qwen3.5-4BAlibaba Qwen71131,072
#185Qwen3.5-9BAlibaba Qwen71131,072
#186SonarPerplexity71128,000
#187warpgrep-v2Morph71128,000
#188DeepSeek-OCRDeepSeek7016,384
#189DeepSeek-OCR-2DeepSeek7016,384
#190DeepSeek-VL2-SmallDeepSeek7016,384
#191image-01MiniMax708,192
#192image-01-liveMiniMax708,192
#193Janus-Pro-7BDeepSeek7016,384
#194Llama 3.1 405B InstructMeta70128,000
#195Llama 3.1 70B InstructMeta70128,000
#196MiniMax-Speech-02MiniMax708,192
#197music-2.0MiniMax708,192
#198Phi-4Microsoft7016,384
#199Step-3.5-FlashStepFun70131,072
#200Claude Opus 3Anthropic69200,000
#201Claude Opus 4Anthropic69200,000
#202Llama 3.2 90B Vision InstructMeta69128,000
#203phi-1Microsoft694,096
#204phi-1_5Microsoft694,096
#205phi-2Microsoft694,096
#206Phi-3-medium-4k-instructMicrosoft694,096
#207Phi-3-mini-4k-instructMicrosoft694,096
#208Phi-tiny-MoE-instructMicrosoft694,096
#209flash-compactMorph68200,000
#210GPT Image 1OpenAI6732,768
#211Nova MicroAmazon Web Services67128,000
#212Step3-VL-10BStepFun67131,072
#213gpt-audio-miniOpenAI66128,000
#214gpt-realtime-miniOpenAI66128,000
#215MiMo-VL-7BXiaomi66131,072
#216chatgpt-image-latestOpenAI6532,768
#217gpt-image-1-miniOpenAI6532,768
#218gpt-oss-20bOpenAI65131,072
#219Llama 3.1 8B InstructMeta64128,000
#220Llama 3.2 11B Vision InstructMeta64128,000
#221Llama 3.2 1B InstructMeta61128,000
#222Llama 3.2 3B InstructMeta61128,000
#223MiMo-Audio-7BXiaomi61131,072
#224Code Llama 70B InstructMeta608,192
#225LFM2-24B-A2BLiquid AI6032,768
#226Llama Guard 4 12BMeta60131,072
#227Meta Llama 3 70B InstructMeta608,192
#228Step-Audio-R1.1StepFun59131,072
#229GPT-4o mini TranscribeOpenAI58128,000
#230GPT-4o mini TTSOpenAI58128,000
#231GPT-4o TranscribeOpenAI58128,000
#232Code Llama 34B InstructMeta578,192
#233Llama Guard 3 11B VisionMeta57131,072
#234Meta Llama 3 8B InstructMeta578,192
#235LFM2-8B-A1BLiquid AI5532,768
#236FLUX 1.1 ProBlack Forest Labs51512
#237FLUX 1.1 Pro UltraBlack Forest Labs51512
#238FLUX 1 ProBlack Forest Labs49512
#239LFM2.5-1.2B-InstructLiquid AI49131,072
#240LFM2.5-1.2B-ThinkingLiquid AI49131,072
#241LFM2-2.6BLiquid AI4732,768
#242pplx-embed-v1-4bPerplexity448,192
#243pplx-embed-v1-0.6bPerplexity438,192
#244NextStep-1.1StepFun42512
#245Prompt Guard 86MMeta41512

Why #1: GPT-5.4

OpenAI's GPT-5.4, the most capable and efficient frontier model for professional work. First general-purpose model with native computer-use capabilities. Combines industry-leading coding from GPT-5.3-Codex with improved agentic workflows.

This model clears the current full-profile threshold for leaderboard methodology.

Why #2: Claude Sonnet 4.6

Anthropic's current Sonnet tier for fast frontier reasoning, coding, and long-context agent work.

This model clears the current full-profile threshold for leaderboard methodology.

Why #3: Claude Opus 4.6

Anthropic's most intelligent Claude model for complex agents, coding, and deep reasoning, with 1M token context and 128K output.

This model clears the current full-profile threshold for leaderboard methodology.