Voxtral Mini Transcribe v2 Mistral
💰 Total Cost Calculation (from Plugin)
Output: $0.002000
Output: $0.002000
Unit: $0.000000
Fees: $0.000000
Detailed Cost Analysis (from Plugin)
For 0 input tokens and 1,000 output tokens:
- Input Cost: $0.000000
- Output Cost: $0.002000
- Total Cost: $0.002000
- Cost per 1K tokens: $0.002000
- Tokens per dollar: 500,000 tokens
- Context Window: 256000 tokens
Speed & Performance Analysis
With a processing speed of 500 tokens per second and 200ms time to first token:
- Processing Time: 2.18 seconds
- Latency: 200 milliseconds to first token
- Base Throughput: 500 tokens/second
Best Use Cases
Want this applied to YOUR actual stack?
This calculator shows the math for Voxtral Mini Transcribe v2. Your decision needs more — current infrastructure, compliance requirements, actual workload patterns, volume tiers — that change which model is right for you.
Get a $39 personalized AI Architecture Audit. PDF tailored to your stack, delivered in under 60 seconds. 7-day no-questions-asked refund.
Get my instant AI audit — $39 →✨ Market Recommendations AI Model Registry
← Back to Voxtral Mini Transcribe v2| Rank | AI Model & Provider | Total Cost | vs Voxtral Mini Transcribe v2 |
|---|---|---|---|
| 🏆 |
Gemini 3.1 Flash Lite
Google
|
$0.007575 (rounded ~ $0.01) Best Value | ↑ 278.8% more |
| 🥈 |
Gemini 2.5 Flash
Google
|
$0.009265 | ↑ 363.3% more |
| 🥉 |
Gemini 3.1 Flash
Google
|
$0.030300 | ↑ 1415% more |
| #4 |
Grok 4.3
xAI
|
$0.036625 (rounded ~ $0.04) | ↑ 1731.3% more |
| #5 |
Gemini 3.5 Flash
Google
|
$0.045450 (rounded ~ $0.05) | ↑ 2172.5% more |
| #6 |
Gemini 2.5 Pro
Google
|
$0.077000 (rounded ~ $0.08) | ↑ 3750% more |
| #7 |
Gemini 2.5 Pro
Google
|
$0.077000 (rounded ~ $0.08) | ↑ 3750% more |
Gemini 3.1 Flash Lite Google
Gemini 2.5 Flash Google
Gemini 3.1 Flash Google
Grok 4.3 xAI
Gemini 3.5 Flash Google
Gemini 2.5 Pro Google
Gemini 2.5 Pro Google
Efficient Audio-to-Text at Scale
Calculate the cost of transcribing thousands of hours of audio. Optimized for speed and low cost, this model is perfect for podcast networks and call center logs.