Guide

Learn more about how to use AI models with TypingMind

NVIDIA: Nemotron 3.5 Content Safety (free) logoNVIDIA: Nemotron 3.5 Content Safety (free) via OpenRouter

Access NVIDIA: Nemotron 3.5 Content Safety (free) via OpenRouter

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, fine-tuned from Google Gemma-3-4B. It moderates both inputs to and responses from LLMs and VLMs, accepting...

3 min read
NVIDIA: Nemotron 3 Ultra (free) logoNVIDIA: Nemotron 3 Ultra (free) via OpenRouter

Access NVIDIA: Nemotron 3 Ultra (free) via OpenRouter

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

3 min read
NVIDIA: Nemotron 3 Ultra logoNVIDIA: Nemotron 3 Ultra via OpenRouter

Access NVIDIA: Nemotron 3 Ultra via OpenRouter

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

3 min read
Qwen: Qwen3.7 Plus logoQwen: Qwen3.7 Plus via OpenRouter

Access Qwen: Qwen3.7 Plus via OpenRouter

Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and image input with text output, building on the series' text capabilities with a comprehensive upgrade to its...

3 min read
Qwen 3.7 Plus logoQwen 3.7 Plus from venice

Use Qwen 3.7 Plus from venice with API Key

Qwen 3.7 Plus from Venice AI - text, image, video input, 1,000,000 token context

5 min read
MiniMax M3 logoMiniMax M3 from venice

Use MiniMax M3 from venice with API Key

MiniMax M3 from Venice AI - text, image, video input, 500,000 token context

5 min read
MiniMax: MiniMax M3 logoMiniMax: MiniMax M3 via OpenRouter

Access MiniMax: MiniMax M3 via OpenRouter

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding,...

3 min read
StepFun: Step 3.7 Flash logoStepFun: Step 3.7 Flash via OpenRouter

Access StepFun: Step 3.7 Flash via OpenRouter

Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters...

3 min read
Claude Opus 4.8 logoClaude Opus 4.8 from anthropic

Use Claude Opus 4.8 from anthropic with API Key

Claude Opus 4.8 from Anthropic - text, image, pdf input, 1,000,000 token context

5 min read
Claude Opus 4.8 Fast logoClaude Opus 4.8 Fast from venice

Use Claude Opus 4.8 Fast from venice with API Key

Claude Opus 4.8 Fast from Venice AI - text, image input, 1,000,000 token context

5 min read
Claude Opus 4.8 logoClaude Opus 4.8 from venice

Use Claude Opus 4.8 from venice with API Key

Claude Opus 4.8 from Venice AI - text, image input, 1,000,000 token context

5 min read
Claude Opus 4.8 (JP) logoClaude Opus 4.8 (JP) from AWS Bedrock

Use Claude Opus 4.8 (JP) from AWS Bedrock with API Key

Claude Opus 4.8 (JP) from Amazon Bedrock - text, image, pdf input, 1,000,000 token context

5 min read
Claude Opus 4.8 (AU) logoClaude Opus 4.8 (AU) from AWS Bedrock

Use Claude Opus 4.8 (AU) from AWS Bedrock with API Key

Claude Opus 4.8 (AU) from Amazon Bedrock - text, image, pdf input, 1,000,000 token context

5 min read
Claude Opus 4.8 (Global) logoClaude Opus 4.8 (Global) from AWS Bedrock

Use Claude Opus 4.8 (Global) from AWS Bedrock with API Key

Claude Opus 4.8 (Global) from Amazon Bedrock - text, image, pdf input, 1,000,000 token context

5 min read
Claude Opus 4.8 (EU) logoClaude Opus 4.8 (EU) from AWS Bedrock

Use Claude Opus 4.8 (EU) from AWS Bedrock with API Key

Claude Opus 4.8 (EU) from Amazon Bedrock - text, image, pdf input, 1,000,000 token context

5 min read
Claude Opus 4.8 (US) logoClaude Opus 4.8 (US) from AWS Bedrock

Use Claude Opus 4.8 (US) from AWS Bedrock with API Key

Claude Opus 4.8 (US) from Amazon Bedrock - text, image, pdf input, 1,000,000 token context

5 min read
Claude Opus 4.8 logoClaude Opus 4.8 from AWS Bedrock

Use Claude Opus 4.8 from AWS Bedrock with API Key

Claude Opus 4.8 from Amazon Bedrock - text, image, pdf input, 1,000,000 token context

5 min read
Anthropic: Claude Opus 4.8 (Fast) logoAnthropic: Claude Opus 4.8 (Fast) via OpenRouter

Access Anthropic: Claude Opus 4.8 (Fast) via OpenRouter

Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with higher output speed at 2x pricing relative to regular Opus 4.8. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

3 min read
Anthropic: Claude Opus 4.8 logoAnthropic: Claude Opus 4.8 via OpenRouter

Access Anthropic: Claude Opus 4.8 via OpenRouter

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

3 min read
Qwen 3.7 Max logoQwen 3.7 Max from venice

Use Qwen 3.7 Max from venice with API Key

Qwen 3.7 Max from Venice AI - text input, 1,000,000 token context

5 min read
Gemini 3.5 Flash logoGemini 3.5 Flash from venice

Use Gemini 3.5 Flash from venice with API Key

Gemini 3.5 Flash from Venice AI - text, image, audio, video input, 1,000,000 token context

5 min read
Qwen: Qwen3.7 Max logoQwen: Qwen3.7 Max via OpenRouter

Access Qwen: Qwen3.7 Max via OpenRouter

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric workloads, with particular strengths in coding, office and productivity tasks,...

3 min read
Grok Build 0.1 logoGrok Build 0.1 from venice

Use Grok Build 0.1 from venice with API Key

Grok Build 0.1 from Venice AI - text, image input, 256,000 token context

5 min read
xAI: Grok Build 0.1 logoxAI: Grok Build 0.1 via OpenRouter

Access xAI: Grok Build 0.1 via OpenRouter

Grok Build 0.1 is xAI’s fast coding model trained specifically for agentic software engineering workflows. It supports text and image inputs with text output, and is optimized for interactive coding...

3 min read
Google: Gemini 3.5 Flash logoGoogle: Gemini 3.5 Flash via OpenRouter

Access Google: Gemini 3.5 Flash via OpenRouter

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

3 min read
Gemini 3.5 Flash logoGemini 3.5 Flash from google

Use Gemini 3.5 Flash from google with API Key

Gemini 3.5 Flash from Google - text, image, video, audio, pdf input, 1,048,576 token context

5 min read
Claude Opus 4.7 Fast logoClaude Opus 4.7 Fast from venice

Use Claude Opus 4.7 Fast from venice with API Key

Claude Opus 4.7 Fast from Venice AI - text, image input, 1,000,000 token context

5 min read
Anthropic: Claude Opus 4.7 (Fast) logoAnthropic: Claude Opus 4.7 (Fast) via OpenRouter

Access Anthropic: Claude Opus 4.7 (Fast) via OpenRouter

Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

3 min read
OpenRouter: Fusion logoOpenRouter: Fusion via OpenRouter

Access OpenRouter: Fusion via OpenRouter

Fusion turns your prompt into a small multi-model deliberation. A panel of expert models (see below) analyzes your prompt in parallel with web search and web fetch enabled, then a...

3 min read
Perceptron: Perceptron Mk1 logoPerceptron: Perceptron Mk1 via OpenRouter

Access Perceptron: Perceptron Mk1 via OpenRouter

Perceptron Mk1 (Mark One) is Perceptron's highest-quality vision-language model for video and embodied reasoning.** It accepts image and video inputs paired with natural language queries, and produces detailed visual understanding...

3 min read
inclusionAI: Ring-2.6-1T (free) logoinclusionAI: Ring-2.6-1T (free) via OpenRouter

Access inclusionAI: Ring-2.6-1T (free) via OpenRouter

Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool...

3 min read
inclusionAI: Ring-2.6-1T logoinclusionAI: Ring-2.6-1T via OpenRouter

Access inclusionAI: Ring-2.6-1T via OpenRouter

Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool...

3 min read
Google: Gemini 3.1 Flash Lite logoGoogle: Gemini 3.1 Flash Lite via OpenRouter

Access Google: Gemini 3.1 Flash Lite via OpenRouter

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

3 min read
Gemini 3.1 Flash Lite logoGemini 3.1 Flash Lite from google

Use Gemini 3.1 Flash Lite from google with API Key

Gemini 3.1 Flash Lite from Google - text, image, video, audio, pdf input, 1,048,576 token context

5 min read
Baidu Qianfan: CoBuddy (free) logoBaidu Qianfan: CoBuddy (free) via OpenRouter

Access Baidu Qianfan: CoBuddy (free) via OpenRouter

CoBuddy is a code generation model from Baidu, optimized for coding tasks and AI Agent workflows. It features high inference throughput and low end-to-end latency, with native support for tool...

3 min read
OpenAI: GPT Chat Latest logoOpenAI: GPT Chat Latest via OpenRouter

Access OpenAI: GPT Chat Latest via OpenRouter

GPT Chat Latest points to OpenAI's stable API alias `chat-latest` that always resolves to the latest Instant chat model used in ChatGPT. As OpenAI rolls out new Instant model updates...

3 min read
xAI: Grok 4.3 logoxAI: Grok 4.3 via OpenRouter

Access xAI: Grok 4.3 via OpenRouter

Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instruction-following tasks, and applications requiring high factual...

3 min read
IBM: Granite 4.1 8B logoIBM: Granite 4.1 8B via OpenRouter

Access IBM: Granite 4.1 8B via OpenRouter

Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...

3 min read
Mistral: Mistral Medium 3.5 logoMistral: Mistral Medium 3.5 via OpenRouter

Access Mistral: Mistral Medium 3.5 via OpenRouter

Mistral Medium 3.5 is a dense 128B instruction-following model from Mistral AI. It supports text and image inputs with text output, and is designed for agentic workflows, coding, and complex...

3 min read
Mistral Medium 3.5 logoMistral Medium 3.5 from mistral

Use Mistral Medium 3.5 from mistral with API Key

Mistral Medium 3.5 from Mistral - text, image input, 262,144 token context

5 min read
Owl Alpha logoOwl Alpha via OpenRouter

Access Owl Alpha via OpenRouter

Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution....

3 min read
NVIDIA: Nemotron 3 Nano Omni (free) logoNVIDIA: Nemotron 3 Nano Omni (free) via OpenRouter

Access NVIDIA: Nemotron 3 Nano Omni (free) via OpenRouter

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...

3 min read
Poolside: Laguna XS.2 (free) logoPoolside: Laguna XS.2 (free) via OpenRouter

Access Poolside: Laguna XS.2 (free) via OpenRouter

Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering...

3 min read
Poolside: Laguna M.1 (free) logoPoolside: Laguna M.1 (free) via OpenRouter

Access Poolside: Laguna M.1 (free) via OpenRouter

Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling and reasoning, with a 128K...

3 min read
Anthropic Claude Haiku Latest logoAnthropic Claude Haiku Latest via OpenRouter

Access Anthropic Claude Haiku Latest via OpenRouter

This model always redirects to the latest model in the Anthropic Claude Haiku family.

3 min read
OpenAI GPT Mini Latest logoOpenAI GPT Mini Latest via OpenRouter

Access OpenAI GPT Mini Latest via OpenRouter

This model always redirects to the latest model in the OpenAI GPT Mini family.

3 min read
Google Gemini Pro Latest logoGoogle Gemini Pro Latest via OpenRouter

Access Google Gemini Pro Latest via OpenRouter

This model always redirects to the latest model in the Google Gemini Pro family.

3 min read
MoonshotAI Kimi Latest logoMoonshotAI Kimi Latest via OpenRouter

Access MoonshotAI Kimi Latest via OpenRouter

This model always redirects to the latest model in the MoonshotAI Kimi family.

3 min read
Google Gemini Flash Latest logoGoogle Gemini Flash Latest via OpenRouter

Access Google Gemini Flash Latest via OpenRouter

This model always redirects to the latest model in the Google Gemini Flash family.

3 min read
Anthropic Claude Sonnet Latest logoAnthropic Claude Sonnet Latest via OpenRouter

Access Anthropic Claude Sonnet Latest via OpenRouter

This model always redirects to the latest model in the Anthropic Claude Sonnet family.

3 min read
OpenAI GPT Latest logoOpenAI GPT Latest via OpenRouter

Access OpenAI GPT Latest via OpenRouter

This model always redirects to the latest model in the OpenAI GPT family.

3 min read