MODELS

60+ models. One picker.

Every model we support, refreshed as new releases ship from the underlying providers.

Applies to web, iOS and Android

Chat & reasoning

•

OpenAI — GPT-5, GPT-4.x family.

•

Anthropic — Claude Opus 4.7, Sonnet 4.6, Haiku 4.5.

•

Google — Gemini 3 Pro, Flash.

•

Meta — Llama 4 family.

•

Mistral — Large, Medium.

•

Cohere — Command R+.

Image

•

Flux (Pro / Dev / Schnell), SDXL, DALL·E, Ideogram.

•

Image-to-image, ControlNet, style presets, magic prompt.

Video

•

Luma Dream Machine, Runway Gen-3, Google Veo, Kling.

•

Text-to-video and image-to-video, up to 10 seconds at 1080p.

Audio & voice

•

ElevenLabs (TTS, voice cloning), Suno (music), Whisper (transcription).

The live picker

Inside the studio, the model picker shows real-time pricing, latency, and capability tags for every supported model. Sign in to see the current list — it changes weekly as providers ship updates.

Per-model deep dives

Dedicated landing pages with pricing, capabilities, code examples and FAQs for the most-asked-about models:

•

GPT-5 on Elevence AI — OpenAI's frontier model, 400K context

•

Claude Opus & Sonnet — Anthropic's family, 200K context, best code generation

•

Gemini 3 Pro & Flash — Google's frontier, 1M context, native multimodal

•

Llama 4 — Meta's open-weight model, strong reasoning at low cost

•

FLUX — Black Forest Labs image generation, photorealistic outputs

•

DALL·E 3 — OpenAI's image model, best text rendering in images

•

Runway Gen-3 — Cinematic text-to-video with motion control

•

Luma Dream Machine — Cost-effective text-to-video at speed

•

ElevenLabs — State-of-the-art TTS, voice cloning, music generation