Llama 4 Maverick (400B) Meta AI 1000000
💰 Total Cost Calculation (from Plugin)
Output: $0.006000 (rounded ~ $0.01)
Output: $0.006000 (rounded ~ $0.01)
Unit: $0.000000
Fees: $0.000000
Detailed Cost Analysis (from Plugin)
For 40,000 input tokens and 10,000 output tokens:
- Input Cost: $0.006000 (rounded ~ $0.01)
- Output Cost: $0.006000 (rounded ~ $0.01)
- Total Cost: $0.012000 (rounded ~ $0.01)
- Cost per 1K tokens: $0.000240
- Tokens per dollar: 4,166,667 tokens
- Context Window: 1000000 tokens
Speed & Performance Analysis
With a processing speed of 400 tokens per second and 150ms time to first token:
- Processing Time: 2 minutes, 13.93 seconds
- Latency: 150 milliseconds to first token
- Base Throughput: 400 tokens/second
- Effective Throughput: 374 tokens/second (temperature-adjusted)
Best Use Cases
Want this applied to YOUR actual stack?
This calculator shows the math for Llama 4 Maverick (400B). Your decision needs more — current infrastructure, compliance requirements, actual workload patterns, volume tiers — that change which model is right for you.
Get a $39 personalized AI Architecture Audit. PDF tailored to your stack, delivered in under 60 seconds. 7-day no-questions-asked refund.
Get my instant AI audit — $39 →Grok 5 xAI 1000000
💰 Total Cost Calculation (from Plugin)
Output: $0.006250 (rounded ~ $0.01)
Output: $0.006250 (rounded ~ $0.01)
Unit: $0.000000
Fees: $0.000000
Advanced Cost Breakdown (from Plugin)
Detailed Cost Analysis (from Plugin)
For 40,000 input tokens and 10,000 output tokens:
- Input Cost: $0.012500 (rounded ~ $0.01)
- Output Cost: $0.006250 (rounded ~ $0.01)
- Total Cost: $0.015375 (rounded ~ $0.02)
- Cost per 1K tokens: $0.000308
- Tokens per dollar: 3,252,033 tokens
- Context Window: 1000000 tokens
Speed & Performance Analysis
With a processing speed of 520 tokens per second and 190ms time to first token:
- Processing Time: 1 minute, 43.06 seconds
- Latency: 190 milliseconds to first token
- Base Throughput: 520 tokens/second
- Effective Throughput: 486 tokens/second (temperature-adjusted)
Best Use Cases
Want this applied to YOUR actual stack?
This calculator shows the math for Grok 5. Your decision needs more — current infrastructure, compliance requirements, actual workload patterns, volume tiers — that change which model is right for you.
Get a $39 personalized AI Architecture Audit. PDF tailored to your stack, delivered in under 60 seconds. 7-day no-questions-asked refund.
Get my instant AI audit — $39 →✨ Market Recommendations AI Model Registry
← Back to Llama 4 Maverick (400B)| Rank | AI Model & Provider | Total Cost | vs Llama 4 Maverick (400B) | vs Grok 5 |
|---|---|---|---|---|
| 🏆 |
Mistral Small 3
Mistral AI
|
$0.001480 Best Value | ↓ 87.7% cheaper | ↓ 90.4% cheaper |
| 🥈 |
Grok Code Fast 1
xAI
|
$0.005210 (rounded ~ $0.01) | ↓ 56.6% cheaper | ↓ 66.1% cheaper |
| 🥉 |
Gemini 3.1 Flash Lite
Google
|
$0.005575 (rounded ~ $0.01) | ↓ 53.5% cheaper | ↓ 63.7% cheaper |
| #4 |
Mistral Large 3
Mistral AI
|
$0.007400 (rounded ~ $0.01) | ↓ 38.3% cheaper | ↓ 51.9% cheaper |
| #5 |
Gemini 2.5 Flash
Google
|
$0.008440 (rounded ~ $0.01) | ↓ 29.7% cheaper | ↓ 45.1% cheaper |
| #6 |
Grok 4.3
xAI
|
$0.015375 (rounded ~ $0.02) | ↑ 28.1% more | Same price |
| #7 |
GPT-5.4 mini
OpenAI
|
$0.016725 (rounded ~ $0.02) | ↑ 39.4% more | ↑ 8.8% more |
| #8 |
o4-mini Deep Research
OpenAI
|
$0.017300 (rounded ~ $0.02) | ↑ 44.2% more | ↑ 12.5% more |
| #9 |
o4-mini
OpenAI
|
$0.019030 | ↑ 58.6% more | ↑ 23.8% more |
| #10 |
Claude Haiku 4.5
Anthropic
|
$0.019800 | ↑ 65% more | ↑ 28.8% more |
| #11 |
Gemini 3.1 Flash
Google
|
$0.022300 (rounded ~ $0.02) | ↑ 85.8% more | ↑ 45% more |
| #12 |
Grok 4.20 Beta
xAI
|
$0.029600 | ↑ 146.7% more | ↑ 92.5% more |
| #13 |
Gemini 3.5 Flash
Google
|
$0.033450 (rounded ~ $0.03) | ↑ 178.8% more | ↑ 117.6% more |
| #14 |
GPT-5.3 Codex Spark
OpenAI
|
$0.047775 (rounded ~ $0.05) | ↑ 298.1% more | ↑ 210.7% more |
| #15 |
GPT-5.3 Instant
OpenAI
|
$0.047775 (rounded ~ $0.05) | ↑ 298.1% more | ↑ 210.7% more |
| #16 |
Claude Sonnet 4.6
Anthropic
|
$0.059400 | ↑ 395% more | ↑ 286.3% more |
| #17 |
Gemini 2.5 Pro
Google
|
$0.068250 (rounded ~ $0.07) | ↑ 468.8% more | ↑ 343.9% more |
| #18 |
Gemini 3.1 Pro
Google
|
$0.089200 | ↑ 643.3% more | ↑ 480.2% more |
| #19 |
Claude Opus 4.7
Anthropic
|
$0.099000 (rounded ~ $0.10) | ↑ 725% more | ↑ 543.9% more |
| #20 |
Claude Opus 4.8
Anthropic
|
$0.099000 (rounded ~ $0.10) | ↑ 725% more | ↑ 543.9% more |
| #21 |
Claude Opus 4.6
Anthropic
|
$0.099000 (rounded ~ $0.10) | ↑ 725% more | ↑ 543.9% more |
| #22 |
GPT-5.4
OpenAI
|
$0.111500 (rounded ~ $0.11) | ↑ 829.2% more | ↑ 625.2% more |
| #23 |
GPT-5.4 Thinking
OpenAI
|
$0.111500 (rounded ~ $0.11) | ↑ 829.2% more | ↑ 625.2% more |
| #24 |
GPT-5.5 Instant
OpenAI
|
$0.111500 (rounded ~ $0.11) | ↑ 829.2% more | ↑ 625.2% more |
| #25 |
o3 Deep Research
OpenAI
|
$0.173000 (rounded ~ $0.17) | ↑ 1341.7% more | ↑ 1025.2% more |
| #26 |
GPT-5.5
OpenAI
|
$0.223000 (rounded ~ $0.22) | ↑ 1758.3% more | ↑ 1350.4% more |
| #27 |
o3 Pro
OpenAI
|
$0.346000 (rounded ~ $0.35) | ↑ 2783.3% more | ↑ 2150.4% more |
| #28 |
GPT-5.2 Pro
OpenAI
|
$0.573300 (rounded ~ $0.57) | ↑ 4677.5% more | ↑ 3628.8% more |
| #29 |
GPT-5.2 Pro
OpenAI
|
$0.573300 (rounded ~ $0.57) | ↑ 4677.5% more | ↑ 3628.8% more |
Mistral Small 3 Mistral AI
Grok Code Fast 1 xAI
Gemini 3.1 Flash Lite Google
Mistral Large 3 Mistral AI
Gemini 2.5 Flash Google
Grok 4.3 xAI
GPT-5.4 mini OpenAI
o4-mini Deep Research OpenAI
o4-mini OpenAI
Claude Haiku 4.5 Anthropic
Gemini 3.1 Flash Google
Grok 4.20 Beta xAI
Gemini 3.5 Flash Google
GPT-5.3 Codex Spark OpenAI
GPT-5.3 Instant OpenAI
Claude Sonnet 4.6 Anthropic
Gemini 2.5 Pro Google
Gemini 3.1 Pro Google
Claude Opus 4.7 Anthropic
Claude Opus 4.8 Anthropic
Claude Opus 4.6 Anthropic
GPT-5.4 OpenAI
GPT-5.4 Thinking OpenAI
GPT-5.5 Instant OpenAI
o3 Deep Research OpenAI
GPT-5.5 OpenAI
o3 Pro OpenAI
GPT-5.2 Pro OpenAI
GPT-5.2 Pro OpenAI
Enterprise Comparison: Multi-agent DevOps and industrial automation.
Detailed cost-performance analysis for Q2 2026.