Providers & Models

Supported Providers

BoostGPT supports 9 AI providers with 50+ models spanning from ultra-fast nano models to advanced reasoning models.

OpenAI

GPT-4o, GPT-5, O1, O3 Mini + reasoning models

Google

Gemini 2.5 Pro, Gemini 3, Flash Thinking

Anthropic

Claude Opus 4.1, Sonnet 4.5, Extended Thinking

xAI

Grok 3, Grok 4.1, Reasoning models

DeepSeek

DeepSeek V3, DeepSeek R1 (reasoning)

Mistral

Small, Medium, Large variants

Cohere

Command R+, Command A, Command R7B

Groq

Ultra-fast inference: Llama, Qwen, Mistral

Ollama

Self-hosted local models (bring your own)

Key Features

Bring Your Own API Keys

Use your own API keys for OpenAI, Google, Anthropic, xAI, DeepSeek, Mistral, Cohere, Groq, or Ollama. You control costs and rate limits.

Use Our Infrastructure (Optional)

Don’t have API keys? Use our pooled infrastructure on paid plans. We handle provisioning, rate limits, and scaling.

Reasoning Models

Access advanced reasoning models like O1, O3 Mini, DeepSeek R1, Gemini Flash Thinking, and Claude Extended Thinking for complex problem-solving.

Smart Model Selection

Choose models based on speed, cost, reasoning capability, and context window. Use fast models for simple queries, reasoning models for complex tasks.

Self-Hosted with Ollama

Run models locally using Ollama. Keep data private, eliminate API costs, and use custom fine-tuned models.

Model Categories

Speed-Optimized (Nano/Mini)

Ultra-fast models for simple tasks, high-volume applications, and real-time responses. Best for: FAQs, basic chat, high-traffic bots
Examples: GPT-4.1 Nano, Llama 3.1 Instant (8B), Gemini 2.5 Flash Lite

Balanced (Standard)

Great all-around models balancing speed, cost, and capability. Best for: Most production use cases, customer support, content generation
Examples: GPT-4o Mini, Claude Sonnet 4.5, Gemini 2.5 Flash

Advanced (Pro/Large)

Powerful models for complex tasks requiring deep understanding. Best for: Complex queries, creative writing, technical analysis
Examples: GPT-5, Claude Opus 4.1, Gemini 2.5 Pro

Reasoning Models

Specialized models with extended thinking for complex problem-solving. Best for: Math, coding, logic puzzles, multi-step reasoning
Examples: O1, DeepSeek R1, Gemini Flash Thinking, Claude Extended Thinking Learn more about reasoning models →

Model Comparison

Provider	Model	Speed	Cost	Reasoning	Context	Credits
OpenAI	O1	Slow	High	Exceptional	200K	6
OpenAI	GPT-5	Medium	High	Exceptional	200K	5
OpenAI	GPT-4o	Medium	Medium	Excellent	128K	5
OpenAI	GPT-4o Mini	Fast	Low	Good	128K	1
Google	Gemini 2.5 Pro	Slow	High	Exceptional	2M	3
Google	Gemini 2.5 Flash	Fast	Medium	Excellent	1M	2
Anthropic	Claude Opus 4.1	Medium	High	Exceptional	200K	5
Anthropic	Claude Sonnet 4.5	Medium	Medium	Excellent	200K	3
xAI	Grok 3 Reasoning	Medium	Medium	Excellent	128K	4
DeepSeek	DeepSeek R1	Medium	Medium	Excellent	64K	2
Groq	Llama 3.3 (70B)	Fast	Medium	Excellent	128K	3
Ollama	Llama 3.3 (70B)	Medium	Free*	Excellent	128K	2

*Local hosting costs (compute) not included View full comparison →

Using Your Own Models (Ollama)

The only way to use custom models in BoostGPT is through Ollama.

Install Ollama

Download and install Ollama on your server or local machine.

Pull Your Model

    ollama pull llama3.3:70b
    # Or your custom fine-tuned model

Configure BoostGPT

Set your Ollama host URL in the dashboard or via SDK:

    const bot = await client.chat({
      bot_id: 'my-bot-id',
      provider_host: 'http://localhost:11434',  // or your server IP
      model: 'llama3.3:70b',
      message: 'Hello, world!' 
    });

Deploy

Your agent will now use your self-hosted Ollama model. No API costs, full control.

Learn more about Ollama →

Credits System

Each model has a credit cost per message. Credits are consumed when your agent generates responses.

Nano/Mini models: 0.5-1 credits (cheapest)
Standard models: 2-3 credits
Pro/Large models: 4-5 credits
Reasoning models: 3-6 credits (most expensive, but highest quality)

Credits are only for agent responses. Incoming messages, training data, and API calls don’t consume credits.

Choosing the Right Model

For Creators (No-Code Dashboard)

Go to your bot settings
Click “Model Selection”
Choose based on your needs:
- Fast responses → GPT-4o Mini, Gemini Flash Lite
- Best quality → GPT-5, Claude Opus, Gemini Pro
- Complex reasoning → O1, DeepSeek R1
- Low cost → Use nano/mini variants

For Developers (SDK/API)

const bot = await client.createBot({
  name: 'Support Bot',
  model: 'gpt-5.1', // Fast & affordable
  // Or for complex queries:
  // model: 'o1', // Advanced reasoning
});

Provider-Specific Setup

Bring Your Own Keys

Use your existing API keys from any provider

Use Our Infrastructure

Let us handle provisioning (paid plans only)

Next Steps

Model Comparison

Compare all 50+ models

Reasoning Models

Learn about O1, R1, etc.

Quickstart

Start building now

Getting Started

Choose Your Path

Providers & Models

Supported Providers

OpenAI

Google

Anthropic

xAI

DeepSeek

Mistral

Cohere

Groq

Ollama

Key Features

Model Categories

Speed-Optimized (Nano/Mini)

Balanced (Standard)

Advanced (Pro/Large)

Reasoning Models

Model Comparison

Using Your Own Models (Ollama)

Credits System

Choosing the Right Model

For Creators (No-Code Dashboard)

For Developers (SDK/API)

Provider-Specific Setup

Bring Your Own Keys

Use Our Infrastructure

Next Steps

Model Comparison

Reasoning Models

Quickstart

Getting Started

Choose Your Path

Providers & Models

​Supported Providers

OpenAI

Google

Anthropic

xAI

DeepSeek

Mistral

Cohere

Groq

Ollama

​Key Features

​Model Categories

​Speed-Optimized (Nano/Mini)

​Balanced (Standard)

​Advanced (Pro/Large)

​Reasoning Models

​Model Comparison

​Using Your Own Models (Ollama)

​Credits System

​Choosing the Right Model

​For Creators (No-Code Dashboard)

​For Developers (SDK/API)

​Provider-Specific Setup

Bring Your Own Keys

Use Our Infrastructure

​Next Steps

Model Comparison

Reasoning Models

Quickstart

Supported Providers

Key Features

Model Categories

Speed-Optimized (Nano/Mini)

Balanced (Standard)

Advanced (Pro/Large)

Reasoning Models

Model Comparison

Using Your Own Models (Ollama)

Credits System

Choosing the Right Model

For Creators (No-Code Dashboard)

For Developers (SDK/API)

Provider-Specific Setup

Next Steps