Guide

Learn more about how to use AI models with TypingMind

xAI: Grok 4.20 Multi-Agent Beta logoxAI: Grok 4.20 Multi-Agent Beta via OpenRouter

Access xAI: Grok 4.20 Multi-Agent Beta via OpenRouter

Grok 4.20 Multi-Agent Beta is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information across complex tasks. Reasoning effort behavior: - low / medium: 4 agents - high / xhigh: 16 agents

3 min read
xAI: Grok 4.20 Beta logoxAI: Grok 4.20 Beta via OpenRouter

Access xAI: Grok 4.20 Beta via OpenRouter

Grok 4.20 Beta is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering consistently precise and truthful responses. Reasoning can be enabled/disabled using the `reasoning` `enabled` parameter in the API. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#controlling-reasoning-tokens)

3 min read
Hunter Alpha logoHunter Alpha via OpenRouter

Access Hunter Alpha via OpenRouter

Hunter Alpha is a 1 Trillion parameter + 1M token context frontier intelligence model built for agentic use. It excels at long-horizon planning, complex reasoning, and sustained multi-step task execution, with the reliability and instruction-following precision that frameworks like OpenClaw need. **Note:** All prompts and completions for this model are logged by the provider and may be used to improve the model.

3 min read
Healer Alpha logoHealer Alpha via OpenRouter

Access Healer Alpha via OpenRouter

Healer Alpha is a frontier omni-modal model with vision, hearing, reasoning, and action capabilities. It brings the full power of agentic intelligence into the real world: natively perceiving visual and audio inputs, reasoning across modalities, and executing complex multi-step tasks with precision and reliability. **Note:** All prompts and completions for this model are logged by the provider and may be used to improve the model.

3 min read
NVIDIA: Nemotron 3 Super (free) logoNVIDIA: Nemotron 3 Super (free) via OpenRouter

Access NVIDIA: Nemotron 3 Super (free) via OpenRouter

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer Mixture-of-Experts architecture with multi-token prediction (MTP), it delivers over 50% higher token generation compared to leading open models. The model features a 1M token context window for long-term agent coherence, cross-document reasoning, and multi-step task planning. Latent MoE enables calling 4 experts for the inference cost of only one, improving intelligence and generalization. Multi-environment RL training across 10+ environments delivers leading accuracy on benchmarks including AIME 2025, TerminalBench, and SWE-Bench Verified. Fully open with weights, datasets, and recipes under the NVIDIA Open License, Nemotron 3 Super allows easy customization and secure deployment anywhere — from workstation to cloud.

3 min read
GLM 5 Turbo logoGLM 5 Turbo from chutes

Use GLM 5 Turbo from chutes with API Key

GLM 5 Turbo from Chutes - text input, 202,752 token context

5 min read
ByteDance Seed: Seed-2.0-Lite logoByteDance Seed: Seed-2.0-Lite via OpenRouter

Access ByteDance Seed: Seed-2.0-Lite via OpenRouter

Seed-2.0-Lite is a versatile, cost‑efficient enterprise workhorse that delivers strong multimodal and agent capabilities while offering noticeably lower latency, making it a practical default choice for most production workloads across text, vision, and tools. Engineered for high-frequency visual understanding and agentic workflows, it's an ideal choice for deployment at scale with minimal latency.

3 min read
Qwen: Qwen3.5-9B logoQwen: Qwen3.5-9B via OpenRouter

Access Qwen: Qwen3.5-9B via OpenRouter

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design with early fusion of multimodal tokens, allowing the model to process and reason across text and images within the same context.

3 min read
Grok 4.20 Beta (Non-Reasoning) logoGrok 4.20 Beta (Non-Reasoning) from xai

Use Grok 4.20 Beta (Non-Reasoning) from xai with API Key

Grok 4.20 Beta (Non-Reasoning) from xAI - text, image input, 2,000,000 token context

5 min read
Grok 4.20 Multi-Agent Beta logoGrok 4.20 Multi-Agent Beta from xai

Use Grok 4.20 Multi-Agent Beta from xai with API Key

Grok 4.20 Multi-Agent Beta from xAI - text, image input, 2,000,000 token context

5 min read
Grok 4.20 Beta (Reasoning) logoGrok 4.20 Beta (Reasoning) from xai

Use Grok 4.20 Beta (Reasoning) from xai with API Key

Grok 4.20 Beta (Reasoning) from xAI - text, image input, 2,000,000 token context

5 min read
OpenAI: GPT-5.4 Pro logoOpenAI: GPT-5.4 Pro via OpenRouter

Access OpenAI: GPT-5.4 Pro via OpenRouter

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs. Optimized for step-by-step reasoning, instruction following, and accuracy, GPT-5.4 Pro excels at agentic coding, long-context workflows, and multi-step problem solving.

3 min read
OpenAI: GPT-5.4 logoOpenAI: GPT-5.4 via OpenRouter

Access OpenAI: GPT-5.4 via OpenRouter

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs, enabling high-context reasoning, coding, and multimodal analysis within the same workflow. The model delivers improved performance in coding, document understanding, tool use, and instruction following. It is designed as a strong default for both general-purpose tasks and software engineering, capable of generating production-quality code, synthesizing information across multiple sources, and executing complex multi-step workflows with fewer iterations and greater token efficiency.

3 min read
GPT-5.4 logoGPT-5.4 from openai

Use GPT-5.4 from openai with API Key

GPT-5.4 from OpenAI - text, image, pdf input, 1,050,000 token context

5 min read
GPT-5.4 Pro logoGPT-5.4 Pro from openai

Use GPT-5.4 Pro from openai with API Key

GPT-5.4 Pro from OpenAI - text, image input, 1,050,000 token context

5 min read
GPT-5.4 Pro logoGPT-5.4 Pro from venice

Use GPT-5.4 Pro from venice with API Key

GPT-5.4 Pro from Venice AI - text, image input, 1,000,000 token context

5 min read
GPT-5.4 logoGPT-5.4 from venice

Use GPT-5.4 from venice with API Key

GPT-5.4 from Venice AI - text, image input, 1,000,000 token context

5 min read
Inception: Mercury 2 logoInception: Mercury 2 via OpenRouter

Access Inception: Mercury 2 via OpenRouter

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving >1,000 tokens/sec on standard GPUs. Mercury 2 is 5x+ faster than leading speed-optimized LLMs like Claude 4.5 Haiku and GPT 5 Mini, at a fraction of the cost. Mercury 2 supports tunable reasoning levels, 128K context, native tool use, and schema-aligned JSON output. Built for coding workflows where latency compounds, real-time voice/search, and agent loops. OpenAI API compatible. Read more in the [blog post](https://www.inceptionlabs.ai/blog/introducing-mercury-2).

3 min read
Grok 4.20 (Experimental, Reasoning) logoGrok 4.20 (Experimental, Reasoning) from xai

Use Grok 4.20 (Experimental, Reasoning) from xai with API Key

Grok 4.20 (Experimental, Reasoning) from xAI - text, image input, 2,000,000 token context

5 min read
Grok 4.20 Multi-Agent (Experimental) logoGrok 4.20 Multi-Agent (Experimental) from xai

Use Grok 4.20 Multi-Agent (Experimental) from xai with API Key

Grok 4.20 Multi-Agent (Experimental) from xAI - text, image input, 2,000,000 token context

5 min read
Grok 4.20 (Experimental, Non-Reasoning) logoGrok 4.20 (Experimental, Non-Reasoning) from xai

Use Grok 4.20 (Experimental, Non-Reasoning) from xai with API Key

Grok 4.20 (Experimental, Non-Reasoning) from xAI - text, image input, 2,000,000 token context

5 min read
OpenAI: GPT-5.3 Chat logoOpenAI: GPT-5.3 Chat via OpenRouter

Access OpenAI: GPT-5.3 Chat via OpenRouter

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations smoother, more useful, and more directly helpful. It delivers more accurate answers with better contextualization and significantly reduces unnecessary refusals, caveats, and overly cautious phrasing that can interrupt conversational flow.

3 min read
Google: Gemini 3.1 Flash Lite Preview logoGoogle: Gemini 3.1 Flash Lite Preview via OpenRouter

Access Google: Gemini 3.1 Flash Lite Preview via OpenRouter

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.

3 min read
Gemini 3.1 Flash Lite Preview logoGemini 3.1 Flash Lite Preview from google

Use Gemini 3.1 Flash Lite Preview from google with API Key

Gemini 3.1 Flash Lite Preview from Google - text, image, video, audio, pdf input, 1,048,576 token context

5 min read
GPT-4o logoGPT-4o from venice

Use GPT-4o from venice with API Key

GPT-4o from Venice AI - text, image input, 128,000 token context

5 min read
GPT-4o Mini logoGPT-4o Mini from venice

Use GPT-4o Mini from venice with API Key

GPT-4o Mini from Venice AI - text, image input, 128,000 token context

5 min read
ByteDance Seed: Seed-2.0-Mini logoByteDance Seed: Seed-2.0-Mini via OpenRouter

Access ByteDance Seed: Seed-2.0-Mini via OpenRouter

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, emphasizing fast response and flexible inference deployment. It delivers performance comparable to ByteDance-Seed-1.6, supports 256k context, four reasoning effort modes (minimal/low/medium/high), multimodal understanding, and is optimized for lightweight tasks where cost and speed take priority.

3 min read
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) logoGoogle: Nano Banana 2 (Gemini 3.1 Flash Image Preview) via OpenRouter

Access Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) via OpenRouter

Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines advanced contextual understanding with fast, cost-efficient inference, making complex image generation and iterative edits significantly more accessible. Aspect ratios can be controlled with the [image_config API Parameter](https://openrouter.ai/docs/features/multimodal/image-generation#image-aspect-ratio-configuration)

3 min read
Gemini 3.1 Flash Image (Preview) logoGemini 3.1 Flash Image (Preview) from google

Use Gemini 3.1 Flash Image (Preview) from google with API Key

Gemini 3.1 Flash Image (Preview) from Google - text, image, pdf input, 131,072 token context

5 min read
Qwen: Qwen3.5-35B-A3B logoQwen: Qwen3.5-35B-A3B via OpenRouter

Access Qwen: Qwen3.5-35B-A3B via OpenRouter

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency. Its overall performance is comparable to that of the Qwen3.5-27B.

3 min read
Qwen: Qwen3.5-27B logoQwen: Qwen3.5-27B via OpenRouter

Access Qwen: Qwen3.5-27B via OpenRouter

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of the Qwen3.5-122B-A10B.

3 min read
Qwen: Qwen3.5-122B-A10B logoQwen: Qwen3.5-122B-A10B via OpenRouter

Access Qwen: Qwen3.5-122B-A10B via OpenRouter

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. In terms of overall performance, this model is second only to Qwen3.5-397B-A17B. Its text capabilities significantly outperform those of Qwen3-235B-2507, and its visual capabilities surpass those of Qwen3-VL-235B.

3 min read
Qwen: Qwen3.5-Flash logoQwen: Qwen3.5-Flash via OpenRouter

Access Qwen: Qwen3.5-Flash via OpenRouter

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.

3 min read
LiquidAI: LFM2-24B-A2B logoLiquidAI: LFM2-24B-A2B via OpenRouter

Access LiquidAI: LFM2-24B-A2B via OpenRouter

LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for efficient on-device deployment. Built as a 24B parameter Mixture-of-Experts model with only 2B active parameters per token, it delivers high-quality generation while maintaining low inference costs. The model fits within 32 GB of RAM, making it practical to run on consumer laptops and desktops without sacrificing capability.

3 min read
Google: Gemini 3.1 Pro Preview Custom Tools logoGoogle: Gemini 3.1 Pro Preview Custom Tools via OpenRouter

Access Google: Gemini 3.1 Pro Preview Custom Tools via OpenRouter

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party or user-defined functions are available. This specialized preview endpoint significantly increases function calling reliability and ensures the model selects the most appropriate tool in coding agents and complex, multi-tool workflows. It retains the core strengths of Gemini 3.1 Pro, including multimodal reasoning across text, image, video, audio, and code, a 1M-token context window, and strong software engineering performance.

3 min read
Qwen 3.5 35B A3B logoQwen 3.5 35B A3B from venice

Use Qwen 3.5 35B A3B from venice with API Key

Qwen 3.5 35B A3B from Venice AI - text, image, video input, 256,000 token context

5 min read
OpenAI: GPT-5.3-Codex logoOpenAI: GPT-5.3-Codex via OpenRouter

Access OpenAI: GPT-5.3-Codex via OpenRouter

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results on SWE-Bench Pro and strong performance on Terminal-Bench 2.0 and OSWorld-Verified, reflecting improved multi-language coding, terminal proficiency, and real-world computer-use skills. The model is optimized for long-running, tool-using workflows and supports interactive steering during execution, making it suitable for complex development tasks, debugging, deployment, and iterative product work. Beyond coding, GPT-5.3-Codex performs strongly on structured knowledge-work benchmarks such as GDPval, supporting tasks like document drafting, spreadsheet analysis, slide creation, and operational research across domains. It is trained with enhanced cybersecurity awareness, including vulnerability identification capabilities, and deployed with additional safeguards for high-risk use cases. Compared to prior Codex models, it is more token-efficient and approximately 25% faster, targeting professional end-to-end workflows that span reasoning, execution, and computer interaction.

3 min read
GPT-5.3 Codex logoGPT-5.3 Codex from venice

Use GPT-5.3 Codex from venice with API Key

GPT-5.3 Codex from Venice AI - text, image input, 400,000 token context

5 min read
AionLabs: Aion-2.0 logoAionLabs: Aion-2.0 via OpenRouter

Access AionLabs: Aion-2.0 via OpenRouter

Aion-2.0 is a variant of DeepSeek V3.2 optimized for immersive roleplaying and storytelling. It is particularly strong at introducing tension, crises, and conflict into stories, making narratives feel more engaging. It also handles mature and darker themes with more nuance and depth.

3 min read
Google: Gemini 3.1 Pro Preview logoGoogle: Gemini 3.1 Pro Preview via OpenRouter

Access Google: Gemini 3.1 Pro Preview via OpenRouter

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation of the Gemini 3 series, it combines high-precision reasoning across text, image, video, audio, and code with a 1M-token context window. Reasoning Details must be preserved when using multi-turn tool calling, see our docs here: https://openrouter.ai/docs/use-cases/reasoning-tokens#preserving-reasoning. The 3.1 update introduces measurable gains in SWE benchmarks and real-world coding environments, along with stronger autonomous task execution in structured domains such as finance and spreadsheet-based workflows. Designed for advanced development and agentic systems, Gemini 3.1 Pro Preview improves long-horizon stability and tool orchestration while increasing token efficiency. It introduces a new medium thinking level to better balance cost, speed, and performance. The model excels in agentic coding, structured planning, multimodal analysis, and workflow automation, making it well-suited for autonomous agents, financial modeling, spreadsheet automation, and high-context enterprise tasks.

3 min read
Gemini 3.1 Pro Preview Custom Tools logoGemini 3.1 Pro Preview Custom Tools from google

Use Gemini 3.1 Pro Preview Custom Tools from google with API Key

Gemini 3.1 Pro Preview Custom Tools from Google - text, image, video, audio, pdf input, 1,048,576 token context

5 min read
Gemini 3.1 Pro Preview logoGemini 3.1 Pro Preview from google

Use Gemini 3.1 Pro Preview from google with API Key

Gemini 3.1 Pro Preview from Google - text, image, video, audio, pdf input, 1,048,576 token context

5 min read
Gemini 3.1 Pro Preview logoGemini 3.1 Pro Preview from venice

Use Gemini 3.1 Pro Preview from venice with API Key

Gemini 3.1 Pro Preview from Venice AI - text, image, audio, video input, 1,000,000 token context

5 min read
Qwen3.5 397B A17B TEE logoQwen3.5 397B A17B TEE from chutes

Use Qwen3.5 397B A17B TEE from chutes with API Key

Qwen3.5 397B A17B TEE from Chutes - text, image input, 262,144 token context

5 min read
Anthropic: Claude Sonnet 4.6 logoAnthropic: Claude Sonnet 4.6 via OpenRouter

Access Anthropic: Claude Sonnet 4.6 via OpenRouter

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with memory, polished document creation, and confident computer use for web QA and workflow automation.

3 min read
Claude Sonnet 4.6 logoClaude Sonnet 4.6 from anthropic

Use Claude Sonnet 4.6 from anthropic with API Key

Claude Sonnet 4.6 from Anthropic - text, image, pdf input, 200,000 token context

5 min read
Claude Sonnet 4.6 logoClaude Sonnet 4.6 from venice

Use Claude Sonnet 4.6 from venice with API Key

Claude Sonnet 4.6 from Venice AI - text, image input, 1,000,000 token context

5 min read
Claude Sonnet 4.6 (EU) logoClaude Sonnet 4.6 (EU) from AWS Bedrock

Use Claude Sonnet 4.6 (EU) from AWS Bedrock with API Key

Claude Sonnet 4.6 (EU) from Amazon Bedrock - text, image, pdf input, 200,000 token context

5 min read
Claude Sonnet 4.6 logoClaude Sonnet 4.6 from AWS Bedrock

Use Claude Sonnet 4.6 from AWS Bedrock with API Key

Claude Sonnet 4.6 from Amazon Bedrock - text, image, pdf input, 200,000 token context

5 min read
Claude Sonnet 4.6 (Global) logoClaude Sonnet 4.6 (Global) from AWS Bedrock

Use Claude Sonnet 4.6 (Global) from AWS Bedrock with API Key

Claude Sonnet 4.6 (Global) from Amazon Bedrock - text, image, pdf input, 200,000 token context

5 min read
Claude Sonnet 4.6 (US) logoClaude Sonnet 4.6 (US) from AWS Bedrock

Use Claude Sonnet 4.6 (US) from AWS Bedrock with API Key

Claude Sonnet 4.6 (US) from Amazon Bedrock - text, image, pdf input, 200,000 token context

5 min read