Model Catalog
Browse every model available through Revo Mail. Filter by provider, compare context windows and capabilities, and copy the slug directly into your code.
OpenAI's mainline GPT-5.2 — strong general-purpose reasoning, multimodal input, and tool-calling support at the lowest GPT-5 tier price.
Mid-tier GPT-5 with stronger reasoning headroom than 5.2 — a sweet spot for analytical workflows that don't need 5.5's full capability.
Flagship GPT-5.5. The most capable OpenAI model on the platform — deep reasoning, long-context comprehension, and frontier multimodal performance.
Anthropic's newest frontier Opus model. State-of-the-art coding, agentic reasoning, and long-horizon task execution.
Previous-generation flagship Opus. Excellent for complex multi-step engineering, research synthesis, and high-stakes reasoning.
The cost-efficient Opus tier. Strong reasoning at a meaningfully lower per-token price than 4.7/4.8.
Balanced Sonnet — high-volume daily-driver for chat, code, and structured output. The recommended starting point in the Claude family.
Anthropic's fastest and most cost-efficient Claude model. Ideal for high-volume chat, classification, summarisation, and lightweight agent loops where latency and cost matter most.
The latest production Gemini Flash. Sub-second latency, native multimodal input, and a 1M-token context window.
Preview build of Gemini 3.1 Pro. Frontier reasoning and multimodal understanding ahead of GA.
Ultra-cheap Gemini tier for high-volume routing, classification, and lightweight chat.
Preview Gemini 3 Flash. Solid multimodal Flash-tier model with a generous context window at a budget price.
Google's open-weight Gemma 4 (31B) served via Ollama Cloud. Excellent value for everyday chat and code-completion tasks.
Google's text-to-video generation model. Produces short, high-quality video clips from natural-language prompts.
Fast image-generation Gemini variant (gemini-3.1-flash-image-preview). Quick, cost-efficient image synthesis from text prompts.
Higher-fidelity image model (gemini-3-pro-image-preview). Better detail and prompt-following for production-grade visuals.
Moonshot AI's code-specialized K2.7 variant. Tuned for code generation, refactoring, and agentic coding workflows — with native long thinking and deep reasoning.
Moonshot AI's flagship reasoning model. Excellent at complex problem-solving, multi-step reasoning, and long-document understanding.
The cost-efficient predecessor to K2.6, retaining strong long-context handling at a fraction of the price.
DeepSeek's strongest model. Tuned for advanced coding tasks, technical reasoning, and structured output generation.
The latency-optimised variant. Sub-second responses for interactive code, chat, and agent loops at near-zero cost.
Cost-effective general-purpose model. Strong everyday performance with excellent value per token.
MiniMax's flagship long-context model with a 1M-token window. Ideal for whole-codebase analysis and large-document tasks.
The efficient long-context model. 1M-token window at a budget price — great for retrieval-heavy and document workflows.
Alibaba's latest Qwen-Plus tier. Strong multilingual performance, capable code generation, and reliable reasoning.
The previous-generation Qwen-Plus model. Solid baseline for general chat and structured tasks.
Xiaomi's flagship MiMo model. Tuned for complex agent workflows, code-focused tasks, and structured reasoning.
The cost-efficient MiMo tier. Quick, capable, and great for high-volume general-purpose calls.
Z.AI's flagship GLM model for long-horizon agentic tasks. Served via Ollama Cloud with native prefix caching.
The latest iteration of Z.AI's GLM family, served via Ollama Cloud. Strong instruction-following and bilingual reasoning.