Supported Providers
BoostGPT supports 9 AI providers with 50+ models spanning from ultra-fast nano models to advanced reasoning models.OpenAI
GPT-4o, GPT-5, O1, O3 Mini + reasoning models
Gemini 2.5 Pro, Gemini 3, Flash Thinking
Anthropic
Claude Opus 4.1, Sonnet 4.5, Extended Thinking
xAI
Grok 3, Grok 4.1, Reasoning models
DeepSeek
DeepSeek V3, DeepSeek R1 (reasoning)
Mistral
Small, Medium, Large variants
Cohere
Command R+, Command A, Command R7B
Groq
Ultra-fast inference: Llama, Qwen, Mistral
Ollama
Self-hosted local models (bring your own)
Key Features
Bring Your Own API Keys
Bring Your Own API Keys
Use your own API keys for OpenAI, Google, Anthropic, xAI, DeepSeek, Mistral, Cohere, Groq, or Ollama. You control costs and rate limits.
Use Our Infrastructure (Optional)
Use Our Infrastructure (Optional)
Don’t have API keys? Use our pooled infrastructure on paid plans. We handle provisioning, rate limits, and scaling.
Reasoning Models
Reasoning Models
Access advanced reasoning models like O1, O3 Mini, DeepSeek R1, Gemini Flash Thinking, and Claude Extended Thinking for complex problem-solving.
Smart Model Selection
Smart Model Selection
Choose models based on speed, cost, reasoning capability, and context window. Use fast models for simple queries, reasoning models for complex tasks.
Self-Hosted with Ollama
Self-Hosted with Ollama
Run models locally using Ollama. Keep data private, eliminate API costs, and use custom fine-tuned models.
Model Categories
Speed-Optimized (Nano/Mini)
Ultra-fast models for simple tasks, high-volume applications, and real-time responses. Best for: FAQs, basic chat, high-traffic botsExamples: GPT-4.1 Nano, Llama 3.1 Instant (8B), Gemini 2.5 Flash Lite
Balanced (Standard)
Great all-around models balancing speed, cost, and capability. Best for: Most production use cases, customer support, content generationExamples: GPT-4o Mini, Claude Sonnet 4.5, Gemini 2.5 Flash
Advanced (Pro/Large)
Powerful models for complex tasks requiring deep understanding. Best for: Complex queries, creative writing, technical analysisExamples: GPT-5, Claude Opus 4.1, Gemini 2.5 Pro
Reasoning Models
Specialized models with extended thinking for complex problem-solving. Best for: Math, coding, logic puzzles, multi-step reasoningExamples: O1, DeepSeek R1, Gemini Flash Thinking, Claude Extended Thinking Learn more about reasoning models →
Model Comparison
| Provider | Model | Speed | Cost | Reasoning | Context | Credits |
|---|---|---|---|---|---|---|
| OpenAI | O1 | Slow | High | Exceptional | 200K | 6 |
| OpenAI | GPT-5 | Medium | High | Exceptional | 200K | 5 |
| OpenAI | GPT-4o | Medium | Medium | Excellent | 128K | 5 |
| OpenAI | GPT-4o Mini | Fast | Low | Good | 128K | 1 |
| Gemini 2.5 Pro | Slow | High | Exceptional | 2M | 3 | |
| Gemini 2.5 Flash | Fast | Medium | Excellent | 1M | 2 | |
| Anthropic | Claude Opus 4.1 | Medium | High | Exceptional | 200K | 5 |
| Anthropic | Claude Sonnet 4.5 | Medium | Medium | Excellent | 200K | 3 |
| xAI | Grok 3 Reasoning | Medium | Medium | Excellent | 128K | 4 |
| DeepSeek | DeepSeek R1 | Medium | Medium | Excellent | 64K | 2 |
| Groq | Llama 3.3 (70B) | Fast | Medium | Excellent | 128K | 3 |
| Ollama | Llama 3.3 (70B) | Medium | Free* | Excellent | 128K | 2 |
Using Your Own Models (Ollama)
The only way to use custom models in BoostGPT is through Ollama.1
Install Ollama
Download and install Ollama on your server or local machine.
2
Pull Your Model
3
Configure BoostGPT
Set your Ollama host URL in the dashboard or via SDK:
4
Deploy
Your agent will now use your self-hosted Ollama model. No API costs, full control.
Credits System
Each model has a credit cost per message. Credits are consumed when your agent generates responses.- Nano/Mini models: 0.5-1 credits (cheapest)
- Standard models: 2-3 credits
- Pro/Large models: 4-5 credits
- Reasoning models: 3-6 credits (most expensive, but highest quality)
Credits are only for agent responses. Incoming messages, training data, and API calls don’t consume credits.
Choosing the Right Model
For Creators (No-Code Dashboard)
- Go to your bot settings
- Click “Model Selection”
- Choose based on your needs:
- Fast responses → GPT-4o Mini, Gemini Flash Lite
- Best quality → GPT-5, Claude Opus, Gemini Pro
- Complex reasoning → O1, DeepSeek R1
- Low cost → Use nano/mini variants
For Developers (SDK/API)
Provider-Specific Setup
Bring Your Own Keys
Use your existing API keys from any provider
Use Our Infrastructure
Let us handle provisioning (paid plans only)