LLM API Providers Leaderboard - Comparison of over 100 LLM endpoints
Comparison and ranking of API provider performance for over 100 AI LLM Model endpoints across performance key metrics including price, output speed, latency, context window & others. For more details including relating to our methodology, see our FAQs.
API providers compared: OpenAI, Playground AI, Microsoft Azure, Ideogram, Mistral, Amazon Bedrock, DeepSeek, Hyperbolic, Groq, FriendliAI, Together.ai, Anthropic, Black Forest Labs, Perplexity, Google, Lambda Labs, Fireworks, Cerebras, Leonardo.Ai, Cohere, Recraft AI, Upstage, Simplismart, Speechmatics, Fish Audio, Deepinfra, , Replicate, Genmo, Nebius, Adobe, MiniMax, CentML, StepFun, Runpod, Zyphra, Murf AI, Speechify, Rev AI, AssemblyAI, fal.ai, Rime, kluster.ai, Prodia, Reka AI, Hume AI, Deepgram, Gladia, Stability.ai, Baseten, Midjourney, Reve, Databricks, ElevenLabs, Vivago AI, IBM, SambaNova, xAI, Cartesia, LMNT, PlayAI, 01.AI, Alibaba Cloud, Novita, AI21 Labs, and WaveSpeed.
Features | Model Intelligence | Price | Output tokens/s | Latency | End-to-End Response Time | ||||
---|---|---|---|---|---|---|---|---|---|
Further Analysis | |||||||||
o4-mini (high) | 200k | 70 | $1.93 | 134.8 | 35.31 | 39.02 | N/A | ||
![]() | o4-mini (high) | 200k | 70 | $1.93 | 41.8 | 170.26 | 182.21 | N/A | |
Gemini 2.5 Pro Preview | 1m | 68 | $3.44 | 209.7 | 26.77 | 29.16 | N/A | ||
Grok 3 mini Reasoning (high) | 131k | 67 | $0.35 | 99.8 | 0.36 | 25.40 | 20.04 | ||
o3-mini (high) | 200k | 66 | $1.93 | 187.9 | 37.07 | 39.73 | N/A | ||
![]() | o3-mini (high) | 200k | 66 | $1.93 | 189.2 | 41.89 | 44.53 | N/A | |
o3-mini | 200k | 63 | $1.93 | 188.6 | 12.27 | 14.92 | N/A | ||
![]() | o3-mini | 200k | 63 | $1.93 | 197.8 | 14.09 | 16.61 | N/A | |
o1 | 200k | 62 | $26.25 | 67.1 | 43.33 | 50.79 | N/A | ||
![]() | o1 | 200k | 62 | $26.25 | 111.1 | 25.28 | 29.78 | N/A | |
![]() DeepSeek R1 | 164k | 60 | $0.95 | 35.7 | 0.55 | 80.35 | 65.79 | ||
![]() | ![]() DeepSeek R1 | 64k | 60 | $0.96 | 23.0 | 3.26 | 126.78 | 101.83 | |
![]() DeepSeek R1 | 128k | 60 | $2.00 | 30.3 | 1.48 | 95.40 | 77.42 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 62.3 | 0.45 | 46.12 | 37.65 | |
![]() DeepSeek R1 Base | 128k | 60 | $1.20 | 31.1 | 0.67 | 92.29 | 75.53 | ||
![]() DeepSeek R1 Fast | 128k | 60 | $3.00 | 83.1 | 0.69 | 34.94 | 28.24 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $3.99 | 71.1 | 0.57 | 40.61 | 33.01 | |
![]() | ![]() DeepSeek R1 | 128k | 60 | $2.36 | 95.3 | 0.54 | 30.42 | 24.64 | |
![]() DeepSeek R1 (Fast) | 164k | 60 | $4.25 | 111.9 | 0.81 | 26.26 | 20.98 | ||
![]() DeepSeek R1 (Turbo, FP4) | 33k | 60 | $1.50 | 133.3 | 0.30 | 21.66 | 17.61 | ||
![]() DeepSeek R1 | 64k | 60 | $0.96 | 14.1 | 0.57 | 202.19 | 166.21 | ||
![]() DeepSeek R1 | 128k | 60 | $4.00 | 73.3 | 0.56 | 39.42 | 32.04 | ||
![]() | ![]() DeepSeek R1 Turbo | 64k | 60 | $1.15 | 28.9 | 0.88 | 99.55 | 81.34 | |
![]() | ![]() DeepSeek R1 | 64k | 60 | $4.00 | 29.3 | 0.79 | 97.88 | 80.04 | |
![]() | ![]() DeepSeek R1 | 16k | 60 | $5.50 | 191.3 | 0.97 | 15.86 | 12.27 | |
![]() DeepSeek R1 | 128k | 60 | $4.00 | 89.0 | 0.69 | 32.68 | 26.37 | ||
![]() | ![]() DeepSeek R1 | 128k | 60 | $7.00 | 28.9 | 0.58 | 99.05 | 81.17 | |
QwQ-32B | 131k | 58 | $0.20 | 26.0 | 1.58 | 116.46 | 95.68 | ||
QwQ-32B Fast | 131k | 58 | $0.75 | 52.1 | 0.71 | 58.16 | 47.85 | ||
QwQ-32B Base | 131k | 58 | $0.23 | 40.9 | 1.00 | 74.19 | 60.96 | ||
![]() | QwQ-32B | 131k | 58 | $0.65 | 83.8 | 0.49 | 36.21 | 29.74 | |
QwQ-32B | 131k | 58 | $0.90 | 141.2 | 0.33 | 21.51 | 17.64 | ||
QwQ-32B | 131k | 58 | $0.14 | 35.5 | 0.28 | 84.42 | 70.07 | ||
QwQ-32B | 131k | 58 | $0.32 | 398.1 | 0.29 | 7.80 | 6.26 | ||
![]() | QwQ-32B | 16k | 58 | $0.63 | 425.0 | 0.92 | 7.96 | 5.86 | |
QwQ-32B | 131k | 58 | $1.20 | 87.5 | 0.44 | 34.63 | 28.47 | ||
o1-mini | 128k | 54 | $1.93 | 214.4 | 9.95 | 12.28 | N/A | ||
![]() | o1-mini | 128k | 54 | $2.12 | 256.5 | 8.92 | 10.87 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 64k | 53 | $0.48 | 25.5 | 3.45 | 23.09 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.45 | 71.9 | 1.11 | 8.06 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 31.8 | 1.31 | 17.04 | N/A | ||
![]() DeepSeek V3 (Mar' 25) Fast | 128k | 53 | $3.00 | 92.0 | 0.67 | 6.10 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.75 | 34.2 | 0.63 | 15.27 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.80 | 77.1 | 0.57 | 7.05 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $2.00 | 66.5 | 0.69 | 8.21 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 160k | 53 | $0.90 | 73.7 | 0.84 | 7.62 | N/A | ||
![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $0.52 | 11.0 | 0.78 | 46.13 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $0.63 | 31.4 | 0.88 | 16.82 | N/A | |
![]() | ![]() DeepSeek V3 (Mar' 25) | 8k | 53 | $1.13 | 264.9 | 0.62 | 2.50 | N/A | |
![]() DeepSeek V3 (Mar' 25) | 128k | 53 | $1.25 | 34.2 | 2.69 | 17.31 | N/A | ||
![]() | ![]() DeepSeek V3 (Mar' 25) | 164k | 53 | $1.25 | 18.8 | 0.75 | 27.28 | N/A | |
GPT-4.1 mini | 1m | 53 | $0.70 | 103.1 | 0.38 | 5.23 | N/A | ||
![]() | GPT-4.1 mini | 1m | 53 | $0.70 | 158.9 | 0.63 | 3.78 | N/A | |
GPT-4.1 | 1m | 53 | $3.50 | 90.9 | 0.46 | 5.96 | N/A | ||
![]() | GPT-4.1 | 1m | 53 | $3.50 | 113.6 | 0.82 | 5.22 | N/A | |
![]() DeepSeek R1 Distill Qwen 32B | 128k | 52 | $0.14 | 44.7 | 0.26 | 56.25 | 44.79 | ||
![]() | ![]() DeepSeek R1 Distill Qwen 32B | 64k | 52 | $0.30 | 20.3 | 1.10 | 124.10 | 98.41 | |
Grok 3 | 131k | 51 | $6.00 | 54.6 | 0.55 | 9.70 | N/A | ||
Llama 4 Maverick (FP8) | 1m | 51 | $0.30 | 130.1 | 0.42 | 4.26 | N/A | ||
Llama 4 Maverick Vertex | 524k | 51 | $0.00 | 127.5 | 0.38 | 4.30 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.20 | 133.1 | 0.46 | 4.22 | N/A | |
![]() | Llama 4 Maverick (FP8) | 128k | 51 | $0.61 | 62.1 | 0.34 | 8.38 | N/A | |
Llama 4 Maverick | 131k | 51 | $0.39 | 139.0 | 0.52 | 4.12 | N/A | ||
Llama 4 Maverick (FP8) | 131k | 51 | $0.30 | 105.0 | 0.56 | 5.32 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.36 | 65.3 | 0.65 | 8.31 | N/A | |
Llama 4 Maverick | 128k | 51 | $0.30 | 287.4 | 0.25 | 1.99 | N/A | ||
![]() | Llama 4 Maverick | 8k | 51 | $0.92 | 804.0 | 0.92 | 1.54 | N/A | |
Llama 4 Maverick (FP8) | 524k | 51 | $0.41 | 111.3 | 0.24 | 4.74 | N/A | ||
![]() | Llama 4 Maverick (FP8) | 1m | 51 | $0.35 | 112.6 | 0.46 | 4.90 | N/A | |
GPT-4o (March 2025) | 128k | 50 | $7.50 | 193.9 | 0.53 | 3.11 | N/A | ||
Gemini 2.0 Pro Experimental (AI Studio) | 2m | 49 | $0.00 | 207.2 | 27.82 | 30.23 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Qwen 14B | 64k | 49 | $0.15 | 44.8 | 0.72 | 56.54 | 44.66 | |
![]() DeepSeek R1 Distill Qwen 14B | 128k | 49 | $1.60 | 157.7 | 0.36 | 16.22 | 12.68 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.30 | 63.6 | 0.48 | 39.76 | 31.43 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 66k | 48 | $0.94 | 2,407.8 | 0.27 | 1.31 | 0.83 | |
![]() DeepSeek R1 Distill Llama 70B Base | 128k | 48 | $0.38 | 57.7 | 0.58 | 43.91 | 34.67 | ||
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.34 | 30.3 | 0.48 | 83.05 | 66.06 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 32k | 48 | $0.39 | 50.5 | 0.90 | 50.44 | 39.63 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $0.81 | 387.2 | 0.19 | 6.65 | 5.17 | ||
![]() | ![]() DeepSeek R1 Distill Llama 70B | 16k | 48 | $0.88 | 303.6 | 1.56 | 9.79 | 6.59 | |
![]() DeepSeek R1 Distill Llama 70B | 128k | 48 | $2.00 | 124.8 | 0.38 | 20.41 | 16.03 | ||
![]() | Claude 3.7 Sonnet | 200k | 48 | $6.00 | 40.0 | 1.02 | 13.53 | N/A | |
Claude 3.7 Sonnet | 200k | 48 | $6.00 | 75.7 | 1.14 | 7.74 | N/A | ||
Gemini 2.0 Flash Vertex | 1m | 48 | $0.26 | 241.5 | 0.29 | 2.36 | N/A | ||
Gemini 2.0 Flash (AI Studio) | 1m | 48 | $0.17 | 247.9 | 0.33 | 2.34 | N/A | ||
![]() | ![]() Reka Flash 3 | 128k | 47 | $0.35 | 56.3 | 0.98 | 45.35 | 35.50 | |
Gemini 2.0 Flash (exp) (AI Studio) | 1m | 46 | $0.00 | 243.0 | 0.29 | 2.34 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 66k | 46 | $0.48 | 25.6 | 3.22 | 22.77 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $0.25 | 28.9 | 1.39 | 18.69 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $0.75 | 24.7 | 0.68 | 20.92 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) | 128k | 46 | $2.00 | 72.9 | 0.61 | 7.47 | N/A | |
![]() DeepSeek V3 (Dec '24) | 128k | 46 | $1.31 | 54.2 | 0.68 | 9.90 | N/A | ||
![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.59 | 20.5 | 0.49 | 24.91 | N/A | ||
![]() | ![]() DeepSeek V3 (Dec '24) Turbo | 64k | 46 | $0.63 | 29.1 | 0.84 | 18.03 | N/A | |
![]() | ![]() DeepSeek V3 (Dec '24) | 64k | 46 | $0.89 | 29.2 | 0.85 | 17.99 | N/A | |
![]() DeepSeek V3 (Dec '24) (FP8) | 128k | 46 | $1.25 | 33.8 | 2.97 | 17.76 | N/A | ||
Qwen2.5 Max | 32k | 45 | $2.80 | 52.3 | 1.25 | 10.81 | N/A | ||
Gemini 1.5 Pro (Sep) (Vertex) | 2m | 45 | $2.19 | 93.7 | 0.60 | 5.93 | N/A | ||
Gemini 1.5 Pro (Sep) (AI Studio) | 2m | 45 | $2.19 | 94.3 | 0.55 | 5.85 | N/A | ||
![]() | Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 47.4 | 0.79 | 11.33 | N/A | |
Claude 3.5 Sonnet (Oct) Vertex | 200k | 44 | $6.00 | 79.9 | 1.00 | 7.26 | N/A | ||
Claude 3.5 Sonnet (Oct) | 200k | 44 | $6.00 | 79.9 | 0.98 | 7.24 | N/A | ||
![]() Sonar | 127k | 43 | $1.00 | 76.2 | 2.41 | 8.97 | N/A | ||
Llama 4 Scout | 1m | 43 | $0.15 | 121.1 | 0.44 | 4.57 | N/A | ||
![]() | Llama 4 Scout | 32k | 43 | $0.70 | 2,584.7 | 0.31 | 0.50 | N/A | |
Llama 4 Scout Vertex | 1m | 43 | $0.00 | 132.4 | 0.38 | 4.15 | N/A | ||
![]() | Llama 4 Scout | 1m | 43 | $0.10 | 116.9 | 0.48 | 4.76 | N/A | |
![]() | Llama 4 Scout | 128k | 43 | $0.34 | 39.0 | 0.35 | 13.18 | N/A | |
Llama 4 Scout | 128k | 43 | $0.26 | 140.8 | 0.55 | 4.10 | N/A | ||
Llama 4 Scout | 131k | 43 | $0.15 | 110.7 | 0.26 | 4.77 | N/A | ||
![]() | Llama 4 Scout | 131k | 43 | $0.20 | 77.1 | 0.68 | 7.16 | N/A | |
Llama 4 Scout | 131k | 43 | $0.17 | 597.9 | 0.36 | 1.19 | N/A | ||
![]() | Llama 4 Scout | 8k | 43 | $0.47 | 739.1 | 0.73 | 1.41 | N/A | |
Llama 4 Scout | 328k | 43 | $0.28 | 114.7 | 0.21 | 4.56 | N/A | ||
![]() | Llama 4 Scout | 128k | 43 | $0.71 | 81.1 | 0.50 | 6.67 | N/A | |
![]() Sonar Pro | 200k | 43 | $6.00 | 61.0 | 2.97 | 11.17 | N/A | ||
QwQ 32B-Preview | 33k | 43 | $0.20 | 54.8 | 1.02 | 46.62 | 36.48 | ||
QwQ 32B-Preview | 33k | 43 | $0.26 | 45.9 | 0.26 | 54.71 | 43.57 | ||
QwQ 32B-Preview | 33k | 43 | $1.20 | 86.5 | 0.44 | 29.33 | 23.11 | ||
GPT-4o (Nov '24) | 128k | 41 | $4.38 | 146.6 | 0.58 | 3.99 | N/A | ||
![]() | GPT-4o (Nov '24) | 128k | 41 | $4.38 | 129.8 | 1.08 | 4.93 | N/A | |
Gemini 2.0 Flash-Lite (Feb '25) (AI Studio) | 1m | 41 | $0.13 | 197.0 | 0.28 | 2.82 | N/A | ||
Llama 3.3 70B (FP8) | 128k | 41 | $0.17 | 39.2 | 0.54 | 13.29 | N/A | ||
![]() | Llama 3.3 70B | 33k | 41 | $0.94 | 2,350.8 | 0.29 | 0.51 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.40 | 88.2 | 1.02 | 6.69 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 139.0 | 0.60 | 4.20 | N/A | |
Llama 3.3 70B Fast | 128k | 41 | $0.38 | 133.5 | 0.55 | 4.29 | N/A | ||
Llama 3.3 70B Base | 128k | 41 | $0.20 | 33.4 | 0.69 | 15.65 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.50 | 143.9 | 0.51 | 3.98 | N/A | |
![]() | Llama 3.3 70B | 128k | 41 | $0.71 | 48.7 | 0.45 | 10.71 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.90 | 178.4 | 0.45 | 3.25 | N/A | ||
Llama 3.3 70B (Turbo, FP8) | 128k | 41 | $0.20 | 34.2 | 0.27 | 14.91 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.27 | 29.1 | 0.47 | 17.62 | N/A | ||
Llama 3.3 70B | 128k | 41 | $0.60 | 185.2 | 0.41 | 3.11 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.39 | 91.6 | 0.62 | 6.08 | N/A | |
Llama 3.3 70B | 128k | 41 | $0.64 | 302.0 | 0.39 | 2.04 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.75 | 460.3 | 0.30 | 1.38 | N/A | |
Llama 3.3 70B Turbo | 128k | 41 | $0.88 | 124.1 | 0.43 | 4.46 | N/A | ||
![]() | Llama 3.3 70B | 128k | 41 | $0.70 | 45.8 | 0.55 | 11.47 | N/A | |
GPT-4.1 nano | 1m | 41 | $0.17 | 253.1 | 0.40 | 2.38 | N/A | ||
![]() | GPT-4.1 nano | 1m | 41 | $0.17 | 333.0 | 0.50 | 2.00 | N/A | |
GPT-4o (May '24) | 128k | 41 | $7.50 | 105.0 | 0.56 | 5.32 | N/A | ||
![]() | GPT-4o (May '24) | 128k | 41 | $7.50 | 103.3 | 0.91 | 5.75 | N/A | |
Llama 3.1 405B (FP8) | 128k | 40 | $0.80 | 34.8 | 0.62 | 14.98 | N/A | ||
Llama 3.1 405B | 128k | 40 | $9.50 | 19.2 | 1.00 | 27.09 | N/A | ||
Llama 3.1 405B | 128k | 40 | $4.00 | 42.4 | 1.04 | 12.84 | N/A | ||
![]() | Llama 3.1 405B Standard | 128k | 40 | $2.40 | 30.8 | 1.83 | 18.05 | N/A | |
![]() | Llama 3.1 405B Latency Optimized | 128k | 40 | $3.00 | 31.0 | 2.13 | 18.28 | N/A | |
Llama 3.1 405B Base | 128k | 40 | $1.50 | 31.2 | 0.70 | 16.74 | N/A | ||
Llama 3.1 405B Vertex | 128k | 40 | $7.75 | 30.0 | 0.40 | 17.08 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $8.00 | 31.4 | 0.50 | 16.44 | N/A | |
Llama 3.1 405B | 128k | 40 | $3.00 | 89.4 | 0.60 | 6.20 | N/A | ||
Llama 3.1 405B | 33k | 40 | $0.90 | 25.2 | 0.49 | 20.31 | N/A | ||
![]() | Llama 3.1 405B | 16k | 40 | $6.25 | 177.9 | 1.24 | 4.06 | N/A | |
Llama 3.1 405B | 128k | 40 | $7.50 | 36.4 | 0.80 | 14.52 | N/A | ||
Llama 3.1 405B Turbo | 128k | 40 | $3.50 | 92.0 | 0.45 | 5.88 | N/A | ||
![]() | Llama 3.1 405B | 128k | 40 | $3.50 | 17.0 | 1.06 | 30.47 | N/A | |
Qwen2.5 72B | 131k | 40 | $0.40 | 30.9 | 1.58 | 17.78 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.20 | 18.3 | 0.79 | 28.13 | N/A | ||
Qwen2.5 72B Fast | 131k | 40 | $0.38 | 67.0 | 0.57 | 8.03 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.90 | 44.3 | 0.45 | 11.75 | N/A | ||
Qwen2.5 72B | 33k | 40 | $0.27 | 40.1 | 0.29 | 12.74 | N/A | ||
Qwen2.5 72B Turbo | 131k | 40 | $1.20 | 85.4 | 0.63 | 6.49 | N/A | ||
Qwen2.5 72B | 131k | 40 | $0.00 | 62.7 | 1.10 | 9.08 | N/A | ||
![]() | ![]() MiniMax-Text-01 | 1m | 40 | $0.42 | 34.2 | 0.90 | 15.54 | N/A | |
Phi-4 | 16k | 40 | $0.15 | 123.6 | 0.51 | 4.55 | N/A | ||
![]() | Phi-4 | 16k | 40 | $0.22 | 42.5 | 0.48 | 12.26 | N/A | |
Phi-4 | 16k | 40 | $0.09 | 41.0 | 0.25 | 12.46 | N/A | ||
![]() Command A | 256k | 40 | $4.38 | 55.0 | 0.26 | 9.34 | N/A | ||
Gemini 1.5 Flash (Sep) (Vertex) | 1m | 39 | $0.13 | 193.5 | 0.22 | 2.80 | N/A | ||
Gemini 1.5 Flash (Sep) (AI Studio) | 1m | 39 | $0.13 | 169.3 | 0.33 | 3.29 | N/A | ||
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 39.0 | 0.45 | 13.27 | N/A | |
![]() | ![]() Mistral Large 2 (Nov '24) | 128k | 38 | $3.00 | 36.3 | 0.52 | 14.30 | N/A | |
Gemma 3 27B | 128k | 38 | $0.07 | 39.1 | 0.41 | 13.19 | N/A | ||
Grok Beta | 128k | 38 | $7.50 | 64.1 | 0.30 | 8.11 | N/A | ||
![]() | ![]() Pixtral Large | 128k | 37 | $3.00 | 31.9 | 0.49 | 16.18 | N/A | |
Qwen2.5 Instruct 32B Fast | 128k | 37 | $0.20 | 79.0 | 0.54 | 6.86 | N/A | ||
Qwen2.5 Instruct 32B Base | 128k | 37 | $0.10 | 59.4 | 0.59 | 9.00 | N/A | ||
Llama 3.1 Nemotron 70B (FP8) | 128k | 37 | $0.17 | 43.2 | 0.52 | 12.09 | N/A | ||
Llama 3.1 Nemotron 70B Base | 128k | 37 | $0.20 | 40.8 | 0.64 | 12.91 | N/A | ||
Llama 3.1 Nemotron 70B Fast | 128k | 37 | $0.38 | 71.8 | 0.57 | 7.54 | N/A | ||
Llama 3.1 Nemotron 70B | 128k | 37 | $0.27 | 35.8 | 0.32 | 14.28 | N/A | ||
![]() | ![]() Nova Pro | 300k | 37 | $1.40 | 109.6 | 0.36 | 4.92 | N/A | |
![]() | ![]() Nova Pro Latency Optimized | 300k | 37 | $1.75 | 109.5 | 0.41 | 4.98 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 35.6 | 0.54 | 14.60 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 32.8 | 0.46 | 15.70 | N/A | |
![]() | ![]() Mistral Large 2 (Jul '24) | 128k | 37 | $3.00 | 35.9 | 0.54 | 14.46 | N/A | |
Qwen2.5 Coder 32B | 33k | 36 | $0.09 | 64.4 | 0.49 | 8.26 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.20 | 52.5 | 1.10 | 10.61 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.90 | 66.9 | 0.34 | 7.82 | N/A | ||
Qwen2.5 Coder 32B | 33k | 36 | $0.10 | 48.8 | 0.25 | 10.49 | N/A | ||
Qwen2.5 Coder 32B | 131k | 36 | $0.80 | 83.7 | 0.41 | 6.38 | N/A | ||
GPT-4o mini | 128k | 36 | $0.26 | 77.4 | 0.36 | 6.82 | N/A | ||
![]() | GPT-4o mini | 128k | 36 | $0.26 | 159.2 | 0.97 | 4.11 | N/A | |
Llama 3.1 70B (FP8) | 128k | 35 | $0.17 | 48.9 | 0.53 | 10.76 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.40 | 44.7 | 1.69 | 12.89 | N/A | ||
![]() | Llama 3.1 70B Standard | 128k | 35 | $0.72 | 31.6 | 0.64 | 16.48 | N/A | |
![]() | Llama 3.1 70B Latency Optimized | 128k | 35 | $0.90 | 31.6 | 0.81 | 16.65 | N/A | |
Llama 3.1 70B Base | 128k | 35 | $0.20 | 38.3 | 0.65 | 13.70 | N/A | ||
Llama 3.1 70B Fast | 128k | 35 | $0.38 | 148.0 | 0.55 | 3.93 | N/A | ||
Llama 3.1 70B Vertex | 128k | 35 | $0.00 | 72.6 | 0.25 | 7.14 | N/A | ||
![]() | Llama 3.1 70B | 128k | 35 | $2.90 | 53.1 | 0.46 | 9.88 | N/A | |
Llama 3.1 70B | 128k | 35 | $0.90 | 159.4 | 0.42 | 3.56 | N/A | ||
Llama 3.1 70B (Turbo, FP8) | 128k | 35 | $0.20 | 34.7 | 0.28 | 14.68 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.27 | 38.5 | 0.39 | 13.38 | N/A | ||
![]() | Llama 3.1 70B | 32k | 35 | $0.35 | 15.0 | 1.38 | 34.64 | N/A | |
Llama 3.1 70B Turbo | 128k | 35 | $0.88 | 113.4 | 0.41 | 4.82 | N/A | ||
Llama 3.1 70B | 128k | 35 | $0.90 | 126.6 | 0.50 | 4.45 | N/A | ||
![]() | ![]() Mistral Small 3.1 | 128k | 35 | $0.15 | 159.4 | 0.33 | 3.47 | N/A | |
![]() Mistral Small 3.1 Vertex | 128k | 35 | $0.15 | 208.5 | 0.17 | 2.57 | N/A | ||
![]() | ![]() Mistral Small 3 | 32k | 35 | $0.15 | 131.0 | 0.39 | 4.21 | N/A | |
![]() Mistral Small 3 | 32k | 35 | $0.09 | 61.9 | 0.28 | 8.36 | N/A | ||
![]() Mistral Small 3 | 32k | 35 | $0.80 | 96.6 | 0.25 | 5.43 | N/A | ||
![]() | Claude 3 Opus | 200k | 35 | $30.00 | 24.4 | 1.27 | 21.79 | N/A | |
Claude 3 Opus Vertex | 200k | 35 | $30.00 | 27.3 | 1.12 | 19.44 | N/A | ||
Claude 3 Opus | 200k | 35 | $30.00 | 28.1 | 1.12 | 18.89 | N/A | ||
![]() | Claude 3.5 Haiku Standard | 200k | 35 | $1.60 | 55.6 | 1.37 | 10.36 | N/A | |
![]() | Claude 3.5 Haiku Latency Optimized | 200k | 35 | $2.00 | 49.7 | 1.27 | 11.33 | N/A | |
Claude 3.5 Haiku Vertex | 200k | 35 | $1.60 | 65.9 | 0.65 | 8.23 | N/A | ||
Claude 3.5 Haiku | 200k | 35 | $1.60 | 65.9 | 6.29 | 13.88 | N/A | ||
![]() | ![]() DeepSeek R1 Distill Llama 8B | 32k | 34 | $0.04 | 54.1 | 0.71 | 46.93 | 36.98 | |
Gemma 3 12B | 128k | 34 | $0.06 | 43.6 | 0.46 | 11.93 | N/A | ||
Gemini 1.5 Pro (May) (Vertex) | 2m | 34 | $2.19 | 66.3 | 0.39 | 7.93 | N/A | ||
Gemini 1.5 Pro (May) (AI Studio) | 2m | 34 | $2.19 | 65.4 | 0.44 | 8.09 | N/A | ||
Qwen Turbo | 1m | 34 | $0.09 | 109.4 | 1.00 | 5.57 | N/A | ||
![]() | Llama 3.2 90B (Vision) | 128k | 33 | $0.72 | 60.3 | 0.51 | 8.80 | N/A | |
Llama 3.2 90B (Vision) Vertex | 128k | 33 | $0.00 | 30.4 | 0.18 | 16.63 | N/A | ||
Llama 3.2 90B (Vision) | 128k | 33 | $0.90 | 42.3 | 0.40 | 12.21 | N/A | ||
Llama 3.2 90B (Vision) | 33k | 33 | $0.36 | 37.8 | 0.42 | 13.66 | N/A | ||
Llama 3.2 90B (Vision) Turbo | 128k | 33 | $1.20 | 31.0 | 0.26 | 16.39 | N/A | ||
Qwen2 72B | 33k | 33 | $0.90 | 42.0 | 0.56 | 12.48 | N/A | ||
Qwen2 72B | 131k | 33 | $0.00 | 31.0 | 1.29 | 17.41 | N/A | ||
![]() | ![]() Nova Lite | 300k | 33 | $0.10 | 279.1 | 0.35 | 2.14 | N/A | |
Gemini 1.5 Flash-8B AI Studio | 1m | 31 | $0.07 | 284.8 | 0.21 | 1.97 | N/A | ||
![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 66.3 | 0.56 | 8.10 | N/A | ||
![]() | ![]() Jamba 1.5 Large | 256k | 29 | $3.50 | 51.2 | 0.69 | 10.44 | N/A | |
![]() Jamba 1.6 Large | 256k | 29 | $3.50 | 63.5 | 0.53 | 8.41 | N/A | ||
Gemini 1.5 Flash (May) (Vertex) | 1m | 28 | $0.13 | 309.4 | 0.29 | 1.91 | N/A | ||
Gemini 1.5 Flash (May) (AI Studio) | 1m | 28 | $0.13 | 307.0 | 0.25 | 1.87 | N/A | ||
![]() | ![]() Nova Micro | 130k | 28 | $0.06 | 330.0 | 0.32 | 1.84 | N/A | |
![]() Yi-Large | 32k | 28 | $3.00 | 69.1 | 0.45 | 7.68 | N/A | ||
![]() | Claude 3 Sonnet | 200k | 28 | $6.00 | 59.1 | 0.73 | 9.19 | N/A | |
Claude 3 Sonnet | 200k | 28 | $6.00 | 61.5 | 0.56 | 8.68 | N/A | ||
![]() | ![]() Codestral (Jan '25) | 256k | 28 | $0.45 | 183.1 | 0.33 | 3.06 | N/A | |
![]() Codestral (Jan '25) Vertex | 128k | 28 | $0.45 | 149.5 | 0.15 | 3.49 | N/A | ||
Llama 3 70B | 8k | 27 | $1.18 | 45.4 | 0.42 | 11.43 | N/A | ||
Llama 3 70B | 8k | 27 | $0.40 | 32.7 | 1.03 | 16.32 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $2.86 | 51.7 | 0.42 | 10.09 | N/A | |
![]() | Llama 3 70B | 8k | 27 | $2.90 | 19.0 | 0.78 | 27.16 | N/A | |
Llama 3 70B | 8k | 27 | $0.90 | 154.3 | 0.41 | 3.65 | N/A | ||
Llama 3 70B | 8k | 27 | $0.27 | 35.6 | 0.55 | 14.60 | N/A | ||
![]() | Llama 3 70B | 8k | 27 | $0.57 | 33.0 | 0.63 | 15.80 | N/A | |
Llama 3 70B | 8k | 27 | $0.64 | 334.3 | 0.27 | 1.77 | N/A | ||
Llama 3 70B (Reference, FP16) | 8k | 27 | $0.90 | 130.1 | 0.67 | 4.52 | N/A | ||
Llama 3 70B (Turbo, FP8) | 8k | 27 | $0.88 | 20.8 | 0.38 | 24.44 | N/A | ||
![]() | ![]() Mistral Small (Sep '24) | 33k | 27 | $0.30 | 58.1 | 0.43 | 9.04 | N/A | |
![]() | Phi-4 Multimodal | 128k | 27 | $0.00 | 21.6 | 0.37 | 23.56 | N/A | |
Qwen2.5 Coder 7B Fast | 131k | 27 | $0.04 | 219.1 | 0.49 | 2.77 | N/A | ||
Qwen2.5 Coder 7B Base | 131k | 27 | $0.01 | 200.9 | 0.51 | 3.00 | N/A | ||
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 30.6 | 0.54 | 16.89 | N/A | |
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 42.0 | 0.43 | 12.34 | N/A | |
![]() | ![]() Mistral Large (Feb '24) | 33k | 26 | $6.00 | 40.1 | 0.51 | 12.98 | N/A | |
![]() | ![]() Mixtral 8x22B | 65k | 26 | $3.00 | 55.1 | 0.37 | 9.44 | N/A | |
![]() Mixtral 8x22B Base | 65k | 26 | $0.60 | 78.2 | 0.58 | 6.98 | N/A | ||
![]() Mixtral 8x22B Fast | 65k | 26 | $1.05 | 100.1 | 0.53 | 5.52 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 75.5 | 0.44 | 7.07 | N/A | ||
![]() Mixtral 8x22B | 65k | 26 | $1.20 | 90.9 | 0.31 | 5.81 | N/A | ||
![]() | Phi-4 Mini | 128k | 26 | $0.12 | 221.9 | 0.43 | 2.69 | N/A | |
![]() | Phi-4 Mini | 128k | 26 | $0.00 | 56.3 | 0.34 | 9.22 | N/A | |
![]() | Phi-3 Medium 14B | 128k | 25 | $0.30 | 51.9 | 0.44 | 10.07 | N/A | |
Gemma 3 4B | 128k | 24 | $0.03 | 90.9 | 0.30 | 5.80 | N/A | ||
![]() | Claude 2.1 | 200k | 24 | $12.00 | 29.4 | 1.70 | 18.68 | N/A | |
Claude 2.1 | 200k | 24 | $12.00 | 14.0 | 0.86 | 36.55 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.03 | 135.9 | 0.43 | 4.11 | N/A | ||
![]() | Llama 3.1 8B | 33k | 24 | $0.10 | 2,146.2 | 0.29 | 0.53 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.10 | 73.3 | 1.06 | 7.89 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.22 | 91.9 | 0.36 | 5.80 | N/A | |
Llama 3.1 8B Fast | 128k | 24 | $0.04 | 184.8 | 0.49 | 3.20 | N/A | ||
Llama 3.1 8B Base | 128k | 24 | $0.03 | 66.5 | 0.53 | 8.05 | N/A | ||
Llama 3.1 8B Vertex | 128k | 24 | $0.00 | 118.8 | 0.17 | 4.38 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.38 | 226.0 | 0.31 | 2.52 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.20 | 217.4 | 0.32 | 2.62 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.04 | 47.2 | 0.24 | 10.83 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.10 | 464.2 | 0.38 | 1.46 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.05 | 71.2 | 0.74 | 7.77 | N/A | |
Llama 3.1 8B | 128k | 24 | $0.06 | 899.6 | 0.21 | 0.77 | N/A | ||
![]() | Llama 3.1 8B | 16k | 24 | $0.13 | 1,173.1 | 0.21 | 0.64 | N/A | |
Llama 3.1 8B Turbo | 128k | 24 | $0.18 | 143.2 | 0.32 | 3.81 | N/A | ||
Llama 3.1 8B | 128k | 24 | $0.15 | 467.2 | 0.18 | 1.25 | N/A | ||
![]() | Llama 3.1 8B | 128k | 24 | $0.18 | 61.6 | 0.52 | 8.63 | N/A | |
![]() | ![]() Pixtral 12B | 128k | 23 | $0.15 | 106.4 | 0.34 | 5.04 | N/A | |
![]() Pixtral 12B | 128k | 23 | $0.10 | 79.6 | 0.64 | 6.92 | N/A | ||
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 160.3 | 0.31 | 3.42 | N/A | |
![]() | ![]() Mistral Small (Feb '24) | 33k | 23 | $1.50 | 88.5 | 0.41 | 6.06 | N/A | |
![]() | ![]() Mistral Medium | 33k | 23 | $4.09 | 40.5 | 0.47 | 12.81 | N/A | |
![]() | ![]() Ministral 8B | 128k | 22 | $0.10 | 140.1 | 0.37 | 3.94 | N/A | |
Gemma 2 9B Fast | 8k | 22 | $0.04 | 169.5 | 0.49 | 3.44 | N/A | ||
Gemma 2 9B Base | 8k | 22 | $0.03 | 171.2 | 0.52 | 3.44 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.04 | 43.1 | 0.47 | 12.06 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.20 | 715.7 | 0.24 | 0.94 | N/A | ||
Gemma 2 9B | 8k | 22 | $0.30 | 134.7 | 0.22 | 3.94 | N/A | ||
![]() LFM 40B | 32k | 22 | $0.15 | 169.2 | 0.43 | 3.38 | N/A | ||
![]() | ![]() Command-R+ | 128k | 21 | $6.00 | 47.7 | 0.49 | 10.98 | N/A | |
![]() Command-R+ | 128k | 21 | $4.38 | 52.0 | 0.28 | 9.90 | N/A | ||
Llama 3 8B | 8k | 21 | $0.10 | 81.3 | 0.48 | 6.63 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.38 | 103.2 | 0.31 | 5.15 | N/A | |
![]() | Llama 3 8B | 8k | 21 | $0.38 | 73.8 | 0.39 | 7.16 | N/A | |
Llama 3 8B | 8k | 21 | $0.04 | 69.9 | 0.29 | 7.44 | N/A | ||
![]() | Llama 3 8B | 8k | 21 | $0.04 | 44.5 | 0.88 | 12.12 | N/A | |
Llama 3 8B | 8k | 21 | $0.06 | 1,350.3 | 0.32 | 0.69 | N/A | ||
Llama 3 8B | 8k | 21 | $0.20 | 174.7 | 0.42 | 3.28 | N/A | ||
Gemini 1.0 Pro Vertex | 33k | 21 | $0.19 | 163.4 | 0.32 | 3.37 | N/A | ||
![]() | ![]() Codestral (May '24) | 33k | 20 | $0.30 | 100.1 | 0.40 | 5.39 | N/A | |
![]() Aya Expanse 32B | 128k | 20 | $0.75 | 120.4 | 0.16 | 4.31 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 47.7 | 0.50 | 10.99 | N/A | |
![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 70.8 | 0.24 | 7.30 | N/A | ||
![]() | ![]() Command-R+ (Apr '24) | 128k | 20 | $6.00 | 50.3 | 0.62 | 10.56 | N/A | |
![]() DBRX | 33k | 20 | $1.13 | 69.7 | 0.56 | 7.73 | N/A | ||
![]() | ![]() Ministral 3B | 128k | 20 | $0.04 | 217.8 | 0.30 | 2.60 | N/A | |
![]() | ![]() Mistral NeMo | 128k | 20 | $0.15 | 128.3 | 0.37 | 4.26 | N/A | |
![]() Mistral NeMo Fast | 128k | 20 | $0.12 | 164.1 | 0.53 | 3.57 | N/A | ||
![]() Mistral NeMo Base | 128k | 20 | $0.06 | 26.6 | 0.75 | 19.53 | N/A | ||
![]() Mistral NeMo | 128k | 20 | $0.06 | 54.0 | 0.24 | 9.49 | N/A | ||
Llama 3.2 3B (FP8) | 128k | 20 | $0.02 | 215.9 | 0.45 | 2.77 | N/A | ||
Llama 3.2 3B | 128k | 20 | $0.10 | 216.7 | 0.85 | 3.16 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.15 | 72.4 | 0.47 | 7.38 | N/A | |
Llama 3.2 3B Base | 128k | 20 | $0.01 | 125.8 | 0.48 | 4.46 | N/A | ||
![]() | Llama 3.2 3B | 128k | 20 | $0.06 | 246.1 | 0.41 | 2.44 | N/A | |
Llama 3.2 3B | 128k | 20 | $0.02 | 144.1 | 0.21 | 3.68 | N/A | ||
![]() | Llama 3.2 3B | 32k | 20 | $0.04 | 71.0 | 0.67 | 7.71 | N/A | |
![]() | Llama 3.2 3B | 8k | 20 | $0.10 | 1,587.7 | 0.23 | 0.54 | N/A | |
Llama 3.2 3B Turbo | 128k | 20 | $0.06 | 162.2 | 0.33 | 3.41 | N/A | ||
![]() DeepSeek R1 Distill Qwen 1.5B | 128k | 19 | $0.18 | 339.1 | 0.26 | 7.63 | 5.90 | ||
![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 164.5 | 0.32 | 3.36 | N/A | ||
![]() | ![]() Jamba 1.5 Mini | 256k | 18 | $0.25 | 82.6 | 0.49 | 6.55 | N/A | |
![]() Jamba 1.6 Mini | 256k | 18 | $0.25 | 200.6 | 0.35 | 2.84 | N/A | ||
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.70 | 77.1 | 0.38 | 6.86 | N/A | |
![]() | ![]() Mixtral 8x7B | 33k | 17 | $0.51 | 59.5 | 0.33 | 8.73 | N/A | |
![]() Mixtral 8x7B Fast | 33k | 17 | $0.23 | 53.6 | 0.61 | 9.93 | N/A | ||
![]() Mixtral 8x7B Base | 33k | 17 | $0.12 | 53.7 | 0.60 | 9.91 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.50 | 177.9 | 0.33 | 3.14 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.24 | 99.0 | 0.21 | 5.26 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.63 | 95.2 | 0.48 | 5.73 | N/A | ||
![]() Mixtral 8x7B | 33k | 17 | $0.60 | 53.1 | 0.43 | 9.85 | N/A | ||
![]() Aya Expanse 8B | 8k | 16 | $0.75 | 167.8 | 0.12 | 3.10 | N/A | ||
![]() | ![]() Command-R | 128k | 15 | $0.75 | 109.6 | 0.36 | 4.92 | N/A | |
![]() Command-R | 128k | 15 | $0.26 | 85.4 | 0.19 | 6.04 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 109.1 | 0.36 | 4.94 | N/A | |
![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 176.6 | 0.15 | 2.98 | N/A | ||
![]() | ![]() Command-R (Mar '24) | 128k | 15 | $0.75 | 80.1 | 0.47 | 6.71 | N/A | |
![]() | ![]() Codestral-Mamba | 256k | 14 | $0.25 | 93.0 | 0.53 | 5.90 | N/A | |
![]() | ![]() Mistral 7B | 8k | 10 | $0.25 | 102.3 | 0.36 | 5.25 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.04 | 77.1 | 0.22 | 6.71 | N/A | ||
![]() | ![]() Mistral 7B | 32k | 10 | $0.06 | 117.3 | 0.84 | 5.10 | N/A | |
![]() Mistral 7B | 8k | 10 | $0.20 | 173.8 | 0.18 | 3.06 | N/A | ||
![]() | Llama 3.2 1B | 128k | 10 | $0.10 | 118.5 | 0.45 | 4.66 | N/A | |
Llama 3.2 1B Base | 128k | 10 | $0.01 | 272.7 | 0.48 | 2.31 | N/A | ||
Llama 3.2 1B | 128k | 10 | $0.01 | 179.6 | 0.24 | 3.02 | N/A | ||
![]() | Llama 3.2 1B | 16k | 10 | $0.05 | 2,606.2 | 0.18 | 0.37 | N/A | |
Llama 2 Chat 7B | 4k | 8 | $0.10 | 132.7 | 0.53 | 4.30 | N/A | ||
o1-preview | 128k | $26.25 | 163.0 | 20.69 | 23.76 | N/A | |||
![]() | o1-preview | 128k | $28.88 | 133.1 | 26.72 | 30.47 | N/A | ||
GPT-4o (Aug '24) | 128k | $4.38 | 100.6 | 0.56 | 5.53 | N/A | |||
![]() | GPT-4o (Aug '24) | 128k | $4.38 | 114.9 | 0.82 | 5.17 | N/A | ||
![]() | o3 | 128k | $17.50 | 91.1 | 30.18 | 35.67 | N/A | ||
GPT-4.5 (Preview) | 128k | $93.75 | 58.0 | 1.15 | 9.77 | N/A | |||
![]() | Llama 3.2 11B (Vision) | 128k | $0.16 | 143.3 | 0.47 | 3.96 | N/A | ||
![]() | Llama 3.2 11B (Vision) | 128k | $0.15 | 83.4 | 0.44 | 6.43 | N/A | ||
Llama 3.2 11B (Vision) | 128k | $0.20 | 109.8 | 0.27 | 4.82 | N/A | |||
Llama 3.2 11B (Vision) | 128k | $0.06 | 48.8 | 0.22 | 10.47 | N/A | |||
Llama 3.2 11B (Vision) Turbo | 128k | $0.18 | 115.3 | 0.24 | 4.58 | N/A | |||
Gemma 2 27B Fast | 8k | $0.26 | 87.1 | 0.54 | 6.28 | N/A | |||
Gemma 2 27B Base | 8k | $0.15 | 54.5 | 0.59 | 9.76 | N/A | |||
Gemma 2 27B | 8k | $0.80 | 89.1 | 0.27 | 5.88 | N/A | |||
Gemini 2.5 Flash Preview (Reasoning) (AI_Studio) | 1m | $0.99 | 151.4 | 11.55 | 14.86 | N/A | |||
Gemini 2.5 Flash Preview (AI_Studio) | 1m | $0.26 | 213.1 | 10.71 | 13.06 | N/A | |||
![]() | Claude 3.5 Sonnet (June) | 200k | $6.00 | 45.3 | 0.91 | 11.95 | N/A | ||
Claude 3.5 Sonnet (June) Vertex | 200k | $6.00 | 80.1 | 0.92 | 7.16 | N/A | |||
Claude 3.5 Sonnet (June) | 200k | $6.00 | 80.1 | 0.71 | 6.95 | N/A | |||
![]() | Claude 3 Haiku | 200k | $0.50 | 107.3 | 1.00 | 5.66 | N/A | ||
Claude 3 Haiku | 200k | $0.50 | 138.3 | 0.42 | 4.04 | N/A | |||
![]() | ![]() Mistral Saba | 32k | $0.30 | 84.7 | 0.40 | 6.30 | N/A | ||
![]() DeepSeek Coder V2 Lite Fast, FP8 | 128k | $0.12 | 115.3 | 0.60 | 4.94 | N/A | |||
![]() DeepSeek Coder V2 Lite Base, FP8 | 128k | $0.06 | 110.4 | 0.58 | 5.11 | N/A | |||
![]() Sonar Reasoning | 127k | $2.00 | 77.7 | 2.04 | 34.23 | 25.75 | |||
![]() | ![]() Solar Mini | 4k | $0.15 | 65.2 | 1.09 | 8.76 | N/A | ||
![]() | ![]() Reka Flash | 128k | $0.35 | 46.2 | 0.96 | 11.79 | N/A | ||
![]() | ![]() Reka Core | 128k | $2.00 | 27.9 | 0.97 | 18.87 | N/A | ||
![]() | ![]() Reka Flash (Feb '24) | 128k | $0.35 | 46.1 | 0.94 | 11.79 | N/A | ||
![]() | ![]() Reka Edge | 128k | $0.10 | 84.0 | 0.95 | 6.90 | N/A | ||
Qwen1.5 Chat 110B | 32k | $0.00 | 29.6 | 1.25 | 18.14 | N/A | |||
GPT-4 Turbo | 128k | $15.00 | 34.6 | 0.74 | 15.20 | N/A | |||
![]() | GPT-4 Turbo | 128k | $15.00 | 46.2 | 1.54 | 12.36 | N/A | ||
GPT-4 | 8k | $37.50 | 24.3 | 0.71 | 21.31 | N/A | |||
Gemini 2.0 Flash-Lite (Preview) (AI Studio) | 1m | $0.13 | 200.5 | 0.28 | 2.78 | N/A | |||
Claude 2.0 | 100k | $12.00 | 30.7 | 0.86 | 17.16 | N/A | |||
![]() OpenChat 3.5 | 8k | $0.06 | 45.5 | 0.23 | 11.23 | N/A | |||
![]() Jamba Instruct | 256k | $0.55 | 168.8 | 0.37 | 3.33 | N/A |