Provider analysis
MiniMax
Provider of the MiniMax M-series, speech, video, music, and agent APIs for multimodal application builders.
This provider page blends full-profile entries with broader verified listings. Use it to separate deeply evaluated flagship models from source-backed records that are tracked primarily for market visibility, access data, and freshness coverage.
Tracked models available through provider-managed APIs.
Models with downloadable weights or self-hosted distribution paths.
Total source references attached across this provider catalog.
Provider sources
Official links used to verify the provider profile and platform coverage.
MiniMax
MiniMax-Speech-02
MiniMax media
MiniMax's TTS model for high-fidelity speech synthesis with voice cloning capabilities.
- Context
- 8,192
- Input
- $0.001/1K tok
- Output
- $0.005/1K tok
- Coverage
- Full profile
MiniMax
MiniMax-VL-01
MiniMax media
MiniMax's vision-language model with 200K context for multimodal understanding and image analysis.
- Context
- 204,800
- Input
- $0.001/1K tok
- Output
- $0.005/1K tok
- Coverage
- Full profile
MiniMax
MiniMax-Text-01
MiniMax media
MiniMax's text generation models with 200K context for general-purpose language tasks.
- Context
- 204,800
- Input
- $0.001/1K tok
- Output
- $0.005/1K tok
- Coverage
- Full profile
MiniMax
MiniMax-M1
MiniMax media
MiniMax's text generation models with 200K context for general-purpose language tasks.
- Context
- 204,800
- Input
- $0.001/1K tok
- Output
- $0.005/1K tok
- Coverage
- Full profile
MiniMax
image-01
MiniMax media
MiniMax's image generation models for text-to-image creation, including a live animation variant.
- Context
- 8,192
- Input
- $0.001/1K tok
- Output
- $0.005/1K tok
- Coverage
- Full profile
MiniMax
image-01-live
MiniMax media
MiniMax's image generation models for text-to-image creation, including a live animation variant.
- Context
- 8,192
- Input
- $0.001/1K tok
- Output
- $0.005/1K tok
- Coverage
- Full profile
MiniMax
music-2.0
MiniMax media
MiniMax's music generation model for AI-composed audio tracks.
- Context
- 8,192
- Input
- $0.001/1K tok
- Output
- $0.005/1K tok
- Coverage
- Full profile
MiniMax
MiniMax-M2.5
MiniMax
MiniMax's latest M2.5 text model for coding agents, multimodal assistants, and high-speed inference.
- Context
- 204,800
- Input
- $0.002/1K tok
- Output
- $0.01/1K tok
- Coverage
- Full profile
MiniMax
MiniMax-M2.5-highspeed
MiniMax
MiniMax's latest M2.5 text model for coding agents, multimodal assistants, and high-speed inference.
- Context
- 204,800
- Input
- $0.002/1K tok
- Output
- $0.01/1K tok
- Coverage
- Full profile
MiniMax
MiniMax-M2.1
MiniMax
MiniMax's earlier M2.x models for general-purpose multimodal inference.
- Context
- 204,800
- Input
- $0.002/1K tok
- Output
- $0.01/1K tok
- Coverage
- Full profile
MiniMax
MiniMax-M2.1-highspeed
MiniMax
MiniMax's earlier M2.x models for general-purpose multimodal inference.
- Context
- 204,800
- Input
- $0.002/1K tok
- Output
- $0.01/1K tok
- Coverage
- Full profile
MiniMax
MiniMax-M2
MiniMax
MiniMax's earlier M2.x models for general-purpose multimodal inference.
- Context
- 204,800
- Input
- $0.002/1K tok
- Output
- $0.01/1K tok
- Coverage
- Full profile