June 2025

Alibaba

🧠
Qwen 2.5-Omni flagship
Multimodal, 128 k ctx, speech
1.2T
1.8e25
400-600ms
$0.05/1K tokens
v2.5
2024-04-01
🔬
Qwen 2.5-Max
Highest reasoning & benchmark scores
800B
1.2e25
300-450ms
$0.07/1K tokens
v2.5
2024-04-01
QwQ-32B
Next-gen reasoning model
32B
0.4e25
100-200ms
$0.01/1K tokens
v1.0
2024-03-25
👁️
Qwen 2.5-VL
Vision-language tasks
60B
0.35e25
200-300ms
$0.03/1K tokens
v2.5
2024-04-01
⚡️
Qwen 2.5-Fast
Budget-friendly rapid chat
20B
0.15e25
100-180ms
$0.01/1K tokens
v2.5
2024-04-01
🧠
Qwen 3 flagship
Alibaba's next-gen flagship model, 235B params
235B
Unknown*
Unknown*
Unknown*
v3.0
2025-04-29

Amazon

💡
Amazon Nova foundational
Amazon's new generation foundation model
Unknown*
Unknown*
Unknown*
Unknown*
v1.0
2024-12-03
🤖
Nova Act agent
Browser automation and task completion
Unknown*
Unknown*
Unknown*
Unknown*
v1.0
2025-04-10

Anthropic

🧠
Claude 3 Opus
Top intelligence, 200 k ctx
1.5T*
2.2e25*
350-550ms*
$0.15/1K tokens
v3.0
2024-03-04
⚖️
Claude 3.7 Sonnet
Balanced cost vs performance
800B*
1.2e25*
300-450ms*
$0.05/1K tokens
v3.7
2024-03-20
⚡️
Claude 3.5 Haiku
Fastest, cost-efficient
25B*
0.2e25*
80-150ms*
$0.01/1K tokens
v3.5
2024-03-15
🧠
Claude Opus 4 flagship
Most powerful model for complex tasks, agent capabilities
1.8T*
2.6e25*
300-500ms*
$0.15/1K tokens
v4.0
2025-05-22
⚖️
Claude Sonnet 4 balanced
Improved steerability, reasoning, and coding
900B*
1.4e25*
250-400ms*
$0.06/1K tokens
v4.0
2025-05-22

Black Forest Labs

🌲
BlackForest-1 new
European flagship model with privacy focus
120B
0.8e25*
300-450ms*
$0.07/1K tokens
v1.0
2025-04-01

Cohere

⚖️
Command-R
Balanced generation
35B
0.25e25
180-280ms
$0.02/1K tokens
v1.0
2024-03-01
🤝
Command-R+
200 k ctx, retrieval-augmented
45B
0.3e25
220-320ms
$0.025/1K tokens
v1.0
2024-03-15

DeepSeek

🧠
DeepSeek-V3
671 B MoE flagship
671B (MoE)
1.5e25
300-500ms
$0.07/1K tokens
v3.0
2024-03-20
🔍
DeepSeek-R1
Reasoning-heavy
100B
0.5e25
150-250ms
$0.02/1K tokens
v1.0
2024-03-15
💻
DeepSeek-Coder 33B
Coding assistant
33B
0.25e25
100-180ms
$0.01/1K tokens
v1.0
2024-03-10
💬
DeepSeek-Chat
Daily conversations
30B
0.3e25
150-250ms
$0.02/1K tokens
v1.0
2024-03-15

Google

🧠
Gemini 2.5 Pro 1 M ctx
Flagship multimodal
1.2T*
2.0e25*
400-600ms*
$0.10/1K tokens
v2.5
2024-04-10
⚡️
Gemini 2.5 Flash
Tuned for speed
18B*
0.12e25*
90-160ms*
$0.01/1K tokens
v2.5
2024-04-10
⚡️
Gemini 2.0 Flash-Lite ultra-fast
Google's lightweight and fast model for quick tasks
Unknown*
Unknown*
Unknown*
Unknown*
v2.0
2025-02-05
🧠
Gemini 2.0 Pro balanced
Google's balanced Pro model from the 2.0 series
Unknown*
Unknown*
Unknown*
Unknown*
v2.0
2025-02-05
⚡️
Gemini 2.0 Flash fast
Google's fast model from the 2.0 series
Unknown*
Unknown*
Unknown*
Unknown*
v2.0
2024-12-11
🧠
Gemini 2.5 Pro Deep Think enhanced reasoning
Extended reasoning mode for complex problems
1.2T*
2.0e25*
500-800ms*
$0.12/1K tokens
v2.5
2025-05-20

Manus

🤖
Manus Agent autonomous
End-to-end task completion without intervention
Unknown*
Unknown*
Unknown*
Unknown*
v1.0
2025-03-01

Meta

🚀
Llama 4 Behemoth 2T params
Flagship model, 288B active, 2T total
2T (288B active)
2.8e25
500-800ms
$0.12/1K tokens
v4.0
2024-04-15
⚖️
Llama 4 Maverick 128E
Balanced performance, 128 experts
128B (MoE)
0.6e25
250-400ms
$0.04/1K tokens
v4.0
2024-04-10
🧠
Llama 4 Scout 10M ctx
Lightweight, 10M context, single GPU
10B
0.08e25
60-100ms
$0.005/1K tokens
v4.0
2024-04-15
📜
Llama 3 70B
Open weights, OSS baseline
70B
0.6e25
250-400ms
$0.04/1K tokens
v3.0
2024-04-15

Microsoft

🔍
Copilot Researcher agent
Research-focused agent for Microsoft 365
Unknown*
Unknown*
Unknown*
Subscription
v1.0
2025-03-15
📊
Copilot Analyst agent
Data analysis agent for Microsoft 365
Unknown*
Unknown*
Unknown*
Subscription
v1.0
2025-03-15

Mistral AI

🧠
Mistral Large
Flagship, 64 k ctx
70B
1.2e25
250-450ms
$0.08/1K tokens
v1.0
2024-02-26
🔀
Mixtral 8×22B
Sparse MoE, high throughput
176B (8×22B)
0.8e25
180-300ms
$0.04/1K tokens
v1.0
2024-02-15
💻
Codestral
Code-centric support
40B
0.3e25
120-200ms
$0.015/1K tokens
v1.0
2024-03-20
⚖️
Mistral Medium
Good generalist
50B
0.4e25
200-300ms
$0.03/1K tokens
v1.0
2024-02-20
⚡️
Mistral Small
Fast & light
15B
0.1e25
70-120ms
$0.008/1K tokens
v1.0
2024-02-20

Monica

🤖
Monica Assistant agent
Task automation with web browsing capabilities
Unknown*
Unknown*
Unknown*
Unknown*
v1.0
2025-03-01

OpenAI

🧠
o3 TRY IT!
Airtight reasoning & full tool access
1.8T*
2.5e25*
300-500ms*
$0.06/1K tokens
v3.5
2024-03-15
⚡️
o4-mini
Advanced reasoning at high speed
400B*
1.0e25*
200-350ms*
$0.03/1K tokens
v4.0
2024-04-05
👁️
o4-mini-high
Coding & complex diagrams
500B*
1.2e25*
250-400ms*
$0.04/1K tokens
v4.0
2024-04-05
🎨
GPT-4o default
Rich multimodal & creative
1.2T*
1.8e25*
350-550ms*
$0.08/1K tokens
v4.0
2024-04-01
✍️
GPT-4.5 research preview
Writing & exploring ideas
1.5T*
2.2e25*
400-600ms*
$0.12/1K tokens
v4.5
2024-04-15
⚡️
GPT-4o mini
Faster responses
200B*
0.5e25*
150-250ms*
$0.02/1K tokens
v4.0
2024-04-01
📆
GPT-4o with scheduled tasks beta
Follow-up reminders & automations
1.2T*
1.8e25*
350-550ms*
$0.10/1K tokens
v4.0
2024-04-01
GPT-4
Leaving 30 April – migrate now
2T (288B active)
2.8e25
500-800ms
$0.12/1K tokens
v4.0
2024-04-15
⚡️
GPT-o4-mini latest mini
Latest iteration of OpenAI's efficient mini model
Unknown*
Unknown*
Unknown*
Unknown*
vUnknown
2025-04-16
🧠
GPT-4.1 next-gen
Successor to GPT-4, advanced capabilities
Unknown*
Unknown*
Unknown*
Unknown*
v4.1
2025-04-14

Perplexity

🔍
PPLX-70B
Mixtral-style MoE
70B
0.6e25
150-250ms
$0.02/1K tokens
v1.0
2024-03-10

Salesforce

🤖
Agentforce CRM
CRM workflow automation agent
Unknown*
Unknown*
Unknown*
Subscription
v1.0
2025-02-15

xAI

🚀
Grok 3 1M ctx
Flagship model with 1 million token context window
1.5T*
2.3e25*
450-700ms*
$0.09/1K tokens
v3.0
2024-03-28
⚡️
Grok 2 Mini
Compact model balancing speed and quality
12B*
0.09e25*
70-130ms*
$0.006/1K tokens
v2.0
2024-03-28
🧠
Grok 2 beta
Advanced reasoning, coding, and image generation
1.5T*
2.2e25*
400-600ms*
$0.12/1K tokens
v4.5
2024-04-15

Here to simplify complexity.

Genotix

Summarization

Tests how well each model distills information.

Summarize the following article in 3 bullet points: […article text…]

Creative Writing

Tests creativity, coherence, and tone.

Write a short story about a child who befriends a robot on Mars, in the style of a bedtime fairy tale.

Information Q&A

Tests factual accuracy and explanatory clarity.

What are the main causes of climate change, and how do they impact ocean levels?

Code Generation

Tests coding ability and correctness.

Write a Python function that takes a list of numbers and returns the list sorted without using built-in sort.

Code Debugging

Tests ability to reason about and fix code.

Here is a snippet of code and the error it produces, how can I fix it? […code snippet and error message…]

Customer Support Email

Tests tone control, empathy, and professionalism.

You are a customer service agent. Respond to this customer complaint in a polite tone: "I bought your product and it broke in two days. I'm very upset."

Translation

Tests multilingual capabilities and preservation of meaning/tone.

Translate this English paragraph into French and Chinese: […English paragraph…]

Idea Brainstorming

Tests creativity and practicality of suggestions.

I'm launching a new coffee shop. Give me 5 creative marketing ideas to attract college students.

Explanation (Tutoring)

Tests simplification skills and clarity.

Explain the concept of blockchain to a 12-year-old in a few sentences.

Roleplay/Conversation

Tests conversational ability, persuasiveness, and persona maintenance.

Act as a personal fitness coach. I haven't exercised in months; encourage me with a motivational plan in a friendly tone.

Structured Output

Tests ability to follow specific output format instructions.

Analyze this customer feedback and categorize the issues. Format your response as a JSON object with the following structure: {"positive_points": ["point1", "point2"], "negative_points": ["point1", "point2"], "suggestions": ["suggestion1", "suggestion2"]}

Step-by-Step Reasoning

Tests logical reasoning and problem-solving capabilities.

Solve this math problem step by step, explaining your reasoning at each stage: A store is offering a 25% discount on an item that originally costs $120. If there is also a 8% sales tax applied after the discount, what is the final price?

System Prompt Engineering

Tests ability to follow system-level instructions and constraints.

You are an expert programming tutor specializing in Python. Your responses should: 1. Explain concepts clearly with simple examples, 2. Identify and correct errors in student code, 3. Follow educational best practices by guiding rather than solving, 4. Include explanatory comments in all code examples, 5. Reference Python 3.12 standards. Now help me understand how to implement a binary search algorithm.

Context-Aware Response

Tests ability to incorporate provided context into responses.

Context: I'm a high school physics teacher preparing materials for students who struggle with mathematical concepts. Many of my students have math anxiety but are interested in practical applications. Request: Create an explanation of Newton's Second Law (F=ma) that uses minimal mathematical notation while still conveying the core concept accurately.

Multimodal Reasoning

Tests ability to reason about and describe visual content.

Look at this image of a data visualization chart and explain what trends it shows. What conclusions can be drawn from this data? What might be missing or misleading about this presentation?

Chain of Thought

Tests ability to break down complex problems into logical steps.

Let's think through this problem step by step: A train leaves Station A at 3:00 PM traveling at 60 mph. Another train leaves Station B at 4:30 PM traveling at 75 mph toward Station A. If the stations are 300 miles apart, at what time will the trains meet?

Iterative Refinement

Tests ability to improve outputs based on feedback.

Write a short product description for a new smartphone. After I review it, I'll provide feedback, and I want you to refine the description based on my comments.

Ethical Reasoning

Tests ability to navigate complex ethical scenarios with nuance.

Consider this ethical dilemma in AI development: A healthcare algorithm must allocate limited medical resources. What ethical frameworks should guide its design? Present multiple perspectives and discuss the tradeoffs involved.

Model Performance Levels

Entry Level

Basic models suitable for simple tasks and quick responses

Mid-range

Balanced models offering good performance for general use

High Performance

Advanced models with enhanced capabilities and reasoning

Premium

Top-tier models with maximum performance and capabilities

Model Capabilities

Reasoning Focused

Models optimized for complex reasoning and problem-solving

Code Specialized

Models optimized for programming and code generation

Creative Focused

Models specialized in creative content generation

Analysis Focused

Models optimized for data analysis and interpretation

Specialized Architecture

Models with unique architectures like Mixture of Experts (MoE)

Retrieval Augmented / Large Context

Models excelling at handling large context windows or using retrieval augmentation

Model Status

Beta Version

Models in testing phase or early access

Deprecated

Older models no longer actively maintained

GitHub Copilot

Pair programmer that helps you write better code

Cursor

AI-first code editor with pair programming capabilities

Ollama

Run large language models locally

v0.dev

Generative UI. Generate UI with simple text prompts.

n8n

Workflow automation platform to connect different services

Tabnine

AI code completion assistant for developers

Amazon CodeWhisperer

AI coding companion by AWS, provides code suggestions

Snyk

Developer security platform for finding and fixing vulnerabilities

Sourcery

AI-powered coding assistant for refactoring and improving code quality

Mintlify

AI-powered platform for creating beautiful and effective documentation

Pieces.app

AI-enabled productivity tool for developers to save, enrich, and reuse code snippets

Codeium

Free AI-powered toolkit for developers, with code completion and chat features

Qodo

AI code assistant with test generation and PR review capabilities

Bolt

AI-powered app builder with no-code capabilities

Lovable

AI-powered coding assistant for web development

Perplexity

AI-powered search engine with real-time information

Deep Research

AI research assistant for comprehensive information gathering

NotebookLM

AI-powered note-taking and research tool by Google

Canva Magic Studio

AI-powered design tools for creating professional graphics

Notion AI Q&A

AI-powered knowledge management and question answering

Gamma

AI-powered presentation creation tool

ElevenLabs

AI voice generation with natural-sounding results

Suno

AI music generation platform

Tidio AI

AI-powered chatbot for customer service

0 modellen geselecteerd

Model Vergelijking

Ctrl + K Zoeken
Ctrl + D Dark mode