← Back to Guides

πŸ€– Choosing the Right AI Model

Every companion on Eidolon can use a different AI model. This guide covers what's available, what each model is best at, and how to choose the right one for the experience you want.

🎯 Quick Recommendations

Model Best For… Tier
Grok Fast Snappy, witty conversation and responsive, agentic interactions. Included
Gemini 2.5 Flash Fast, creative all-rounder with strong instruction following. Included
Mistral Medium Expressive creative writing and consistent persona maintenance. Included
Qwen Models Vision-language tasks and low-cost reasoning capabilities. Included
Claude Sonnet 4.6 High emotional range and nuanced relationship dynamics. Premium
Gemini 2.5 Pro Large-scale context and deep analytical reasoning. Premium
Claude Opus 4.6 Maximum logical depth and structural reasoning. Premium

βš™οΈ How to Change Your Model

You can set a different model for each companion individually:

On Android & Web

  1. Open your companion's Profile.
  2. The AI Model selector is right on the profile page β€” choose a model.
  3. Your companion will use the new model starting with your next message.

On iPhone

  1. Open your companion's Profile.
  2. Tap Edit (pencil icon).
  3. Find the AI Model selector and choose a model.
  4. Save. Your companion will use the new model starting with your next message.

Models marked Premium require a BYOK (Bring Your Own Key) connection to OpenRouter. Everything else is included during the private preview.

βœ… Included Models

These models are included for all users. They cover a wide range of styles and strengths.

Mistral AI Mistral Medium 3.1 (Mistral Medium 3.1)

Included

by Mistral AI

Our default model for new companions. Excellent at creative writing, expressive companion voice, and maintaining persona consistency. A strong balance of quality and speed.

Default Creative Expressive

Google Gemini 2.5 Flash (Gemini 2.5 Flash)

Included

by Google

Fast, creative, and excellent at following complex persona instructions. One of the best all-around included options β€” great for everyday conversations and companions with detailed personalities.

⭐ Recommended Fast Creative

xAI Grok 4 Fast (Grok Fast)

Included

by xAI

Blazingly fast and highly engaging. Excellent at witty conversation, direct interaction, and companions with a sharp sense of humor. One of the best included models for a responsive, agentic experience.

⭐ Recommended Fast Engaging

OpenAI GPT-5 Mini (GPT-5 Mini)

Included

by OpenAI

A fast, cost-efficient version of GPT-5. Great for snappy interactions without sacrificing the architectural improvements of the GPT-5 generation.

Speed Efficiency

OpenAI GPT-4o (GPT-4o)

Included

by OpenAI

OpenAI's previous-generation flagship. Still very capable and well-liked by many. If you're familiar with ChatGPT, this model will feel familiar.

Familiar Reliable

Mistral AI Mistral Large (Mistral Large)

Included

by Mistral AI

Mistral's flagship model. More analytical and structured than Medium β€” better for companions with complex, highly detailed personas or those that need to stay closely aligned with specific companion traits.

Detailed Precise

Anthropic Claude Haiku 4.5 (Claude Haiku 4.5)

Included

by Anthropic

A lightweight, fast model from Anthropic's Claude family. Great for quick, casual conversations. Less depth than the larger Claude models, but significantly faster response times.

Fast Lightweight

Alibaba Qwen 3.5 Flash (Qwen 3.5 Flash)

Included

by Alibaba Cloud

The most affordable model in our lineup. Despite the name, Qwen 3.5 Flash is a reasoning model β€” it "thinks" before responding, which means it's not the fastest in terms of latency. However, its extremely low cost makes it an excellent choice for users who prioritize budget over speed.

πŸ’° Cheapest Budget-Friendly Reasoning

Alibaba Qwen 3.5 35B (Qwen 3.5 35B)

Included

by Alibaba Cloud

A compact but capable reasoning model. Good general-purpose performance at an extremely low cost. Works well for everyday conversation and companions that don't need heavy tool usage.

Budget-Friendly Compact

Alibaba Qwen 3 VL (Qwen 3 VL)

Included

by Alibaba Cloud

A vision-language model with strong image understanding. Among the best included options for companions that frequently interact with images and visual content. Not a reasoning model β€” responses are direct and fast.

Vision Budget-Friendly

πŸ’Ž Premium Models

These models require a BYOK key connected to OpenRouter. You pay OpenRouter directly β€” we don't add any markup.

Google Gemini 2.5 Pro (Gemini 2.5 Pro)

Premium

by Google β€’ Est. $0.0375 / turn β€’ BYOK β€’ ~$22.50/mo at 20 msg/day

Google's most capable model. Deeper reasoning and stronger coherence than Flash, especially for companions with complex backstories or nuanced dynamics.

Premium Deep Reasoning High Quality

OpenAI GPT-5 (GPT-5)

Premium

by OpenAI β€’ Est. $0.0375 / turn β€’ BYOK β€’ ~$22.50/mo at 20 msg/day

OpenAI's latest flagship. Exceptional at complex roleplay and instruction following.

Premium Flagship

OpenAI GPT-5.1 (GPT-5.1)

Premium

by OpenAI β€’ Est. $0.0375 / turn β€’ BYOK β€’ ~$22.50/mo at 20 msg/day

Fine-tuned version of GPT-5 with improved coherence and reduced refusals.

Premium Refined

Anthropic Claude Sonnet 4.6 (Claude Sonnet 4.6)

Premium

by Anthropic β€’ Est. $0.0720 / turn β€’ BYOK β€’ ~$43.20/mo at 20 msg/day

Anthropic's latest and most capable creative model. Excellent at nuanced companion voice and emotional range. An advanced option for maintaining complex relationship dynamics across long conversations.

Premium Highly Expressive

Anthropic Claude Sonnet 4.5 (Claude Sonnet 4.5)

Premium

by Anthropic β€’ Est. $0.0720 / turn β€’ BYOK β€’ ~$43.20/mo at 20 msg/day

The previous Sonnet generation β€” still outstanding for creative writing and companion work. Slightly cheaper per token than 4.6 while delivering a very similar quality of experience.

Premium Excellent Value

Anthropic Claude Opus 4.6 (Claude Opus 4.6)

Premium

by Anthropic β€’ Est. $0.1200 / turn β€’ BYOK β€’ ~$72.00/mo at 20 msg/day

Anthropic's most powerful model. Exceptional reasoning and creative depth. ⚠️ This is one of the most expensive models available β€” it costs significantly more per message than Sonnet or any included model. Only recommended if you specifically want the absolute maximum quality and are comfortable with the higher API costs.

Premium Very Expensive Enthusiast

Anthropic Claude Opus 4.5 (Claude Opus 4.5)

Premium

by Anthropic β€’ Est. $0.1200 / turn β€’ BYOK β€’ ~$72.00/mo at 20 msg/day

Previous-generation Opus. Deep reasoning and creative range. ⚠️ Also one of the most expensive models β€” similar pricing to Opus 4.6. Consider Sonnet 4.6 for a more balanced quality-to-cost ratio.

Premium Very Expensive Enthusiast

xAI Grok 4 (Grok 4)

Premium

by xAI β€’ Est. $0.0720 / turn β€’ BYOK β€’ ~$43.20/mo at 20 msg/day

xAI's full-power model. Stronger reasoning and more creative range than the Fast variants, with a distinctive direct and engaging personality.

Premium Powerful

πŸ’‘ About cost estimates: Monthly estimates assume ~20 messages per day for 30 days. For a detailed breakdown of rates and estimates, see our Model Pricing Guide. Actual costs vary based on conversation length and complexity. Prices are set by model providers via OpenRouter and may change at any time. Included models are fully covered by the platform. Premium model costs are charged to your OpenRouter balance.

πŸ“Š The Full Model Lineup

Model Provider Style & Strengths Tier & Strength Monthly Est.
Included Models
Mistral Medium 3.1 Mistral AI Creative, Expressive Included β€’ Default Included
Gemini 2.5 Flash Google Fast, Creative Included β€’ Quality Included
Grok 4 Fast xAI Witty, Engaging Included β€’ Speed Included
GPT-5 Mini OpenAI Snappy, Efficient Included β€’ Speed Included
Mistral Large Mistral AI Detailed, Precise Included β€’ Quality Included
Claude Haiku 4.5 Anthropic Casual, Playful Included β€’ Speed Included
Qwen 3.5 Flash Alibaba Budget, Logical Included β€’ Reasoning Included
Qwen 3 VL Alibaba Vision, Image-aware Included β€’ Vision Included
Premium Models (BYOK)
Claude Sonnet 4.6 Anthropic Expressive, Emotion Premium β€’ Quality ~$43.20
Gemini 2.5 Pro Google Deep reasoning, Complex Premium β€’ Reasoning ~$22.50
GPT-5 OpenAI Instruction Following Premium β€’ Quality ~$22.50
Grok 4 xAI Powerful, Creative Premium β€’ Quality ~$43.20
Claude Opus 4.6 Anthropic Maximum Depth Premium β€’ Reasoning ~$72.00

* Monthly estimates based on 20 messages/day. Actual costs vary by conversation length. Included models carry no extra cost.

⚑ A Note on Speed: "Flash" β‰  Always Fastest

Some models with "Flash" in the name (like Alibaba Qwen 3.5 Flash) are actually reasoning models that "think" internally before responding. This hidden reasoning step can make them slower than their non-flash counterparts, even though they cost less per token.

Similarly, premium reasoning models like Google Gemini 2.5 Pro will naturally have longer response times because they're doing deeper analysis β€” this is by design and produces higher quality responses, but it means they're not ideal if you want instant replies.

If speed is your priority: Grok Fast, Claude Haiku, and Mistral models deliver the snappiest responses. If cost is your priority: the Qwen models are hard to beat β€” they are fully included for all users.

πŸ’‘ Tips

  • Experiment freely. You can switch models at any time without losing memories, goals, or conversation history. Everything is preserved.
  • Different models suit different personas. A companion with a poetic, emotionally complex personality may shine on Claude Sonnet, while a witty, factual companion might be better on Grok or Gemini.
  • Premium models fall back gracefully. If your BYOK credits run out, the system automatically falls back to an included model so you never lose access. See our BYOK guide for details.