DeepSeek: DeepSeek V4 Flash
About DeepSeek: DeepSeek V4 Flash
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...
More LLM Models
Anthropic: Claude Fable Latest
~anthropic
This model always redirects to the latest model in the Claude Fable family..
NVIDIA: Nemotron 3 Ultra (free)
nvidia
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE).
NVIDIA: Nemotron 3 Ultra
nvidia
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE).
Anthropic: Claude Opus 4.8 (Fast)
anthropic
Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with higher output speed at 2x pricing relative to regular Opus 4.8. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode.