ByteDance: UI-TARS 7B
bytedance 🔮 Multimodal
About ByteDance: UI-TARS 7B
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...
More Multimodal Models
MoonshotAI: Kimi K2.7 Code
moonshotai
MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts.
Nex AGI: Nex-N2-Pro (free)
nex-agi
Nex-N2-Pro is an agentic mixture-of-experts model from Nex AGI, with 17B active parameters out of 397B total.
NVIDIA: Nemotron 3.5 Content Safety (free)
nvidia
NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, fine-tuned from Google Gemma-3-4B.
Qwen: Qwen3.7 Plus
qwen
Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series.