Meta: Llama 3.2 11B Vision Instruct

meta-llama 🔮 Multimodal

About Meta: Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

Pricing

api

Specs

Provider: meta-llama
Type: 🔮 Multimodal
API: Available

View on OpenRouter →

More Multimodal Models

Google: Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image)

google

Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) is Google's fastest, most cost-efficient Gemini image model, built for high-velocity developer pipelines and rapid-fire visual exploration.

Google: Nano Banana 2 (Gemini 3.1 Flash Image)

google

Gemini 3.1 Flash Image, a.k.a.

Google: Nano Banana Pro (Gemini 3 Pro Image)

google

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro.

MoonshotAI: Kimi K2.7 Code

moonshotai

MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts.