Models (28)

All Models

MetaNew
LLM

Llama 3.3 70B

High-performance multilingual LLM optimized for dialogue and instruction following.

70B128K context
$0.88per 1M tokens
Meta
LLM

Llama 3.1 405B

The largest and most capable Llama model for complex reasoning and generation tasks.

405B128K context
$3.50per 1M tokens
Meta
LLM

Llama 3.1 70B

Balanced performance and efficiency for production workloads.

70B128K context
$0.88per 1M tokens
Meta
LLM

Llama 3.1 8B

Fast and cost-effective model for simpler tasks and high-volume applications.

8B128K context
$0.10per 1M tokens
Mistral AINew
LLM

Mistral Large 2

Flagship model with strong multilingual and coding capabilities.

123B128K context
$2.00 / $6.00per 1M tokens
Mistral AI
LLM

Mixtral 8x22B

Sparse mixture-of-experts model balancing capability and efficiency.

141B MoE64K context
$0.90per 1M tokens
Mistral AI
LLM

Mixtral 8x7B

Efficient MoE model great for general-purpose tasks.

46B MoE32K context
$0.50per 1M tokens
QwenNew
LLM

Qwen 2.5 72B

Strong multilingual model with excellent Chinese and English performance.

72B128K context
$0.90per 1M tokens
Qwen
LLM

Qwen 2.5 32B

Mid-size model with great balance of speed and capability.

32B128K context
$0.40per 1M tokens
DeepSeekNew
LLM

DeepSeek V3

State-of-the-art MoE model with exceptional reasoning capabilities.

671B MoE64K context
$0.50 / $1.50per 1M tokens
DeepSeekNew
LLM

DeepSeek R1

Reasoning-focused model trained with reinforcement learning for complex tasks.

671B MoE64K context
$3.00 / $7.00per 1M tokens
Google
LLM

Gemma 2 27B

Efficient model from Google with strong performance on diverse tasks.

27B8K context
$0.30per 1M tokens
Google
LLM

Gemma 2 9B

Lightweight model ideal for on-device and edge deployments.

9B8K context
$0.10per 1M tokens
Meta
Code

Code Llama 70B

Specialized for code generation, completion, and understanding.

70B16K context
$0.88per 1M tokens
Meta
Code

Code Llama 34B

Fast code generation with support for many programming languages.

34B16K context
$0.40per 1M tokens
DeepSeek
Code

DeepSeek Coder 33B

Top-performing code model trained on 2T tokens of code.

33B16K context
$0.40per 1M tokens
QwenNew
Code

Qwen 2.5 Coder 32B

Specialized coding model with strong completion and generation.

32B128K context
$0.40per 1M tokens
MetaNew
Vision

Llama 3.2 90B Vision

Multimodal model for image understanding and visual reasoning.

90B128K context
$1.20per 1M tokens
MetaNew
Vision

Llama 3.2 11B Vision

Efficient vision-language model for image analysis tasks.

11B128K context
$0.18per 1M tokens
Qwen
Vision

Qwen2-VL 72B

Advanced vision-language model with OCR and document understanding.

72B32K context
$1.00per 1M tokens
Stability AI
Image

FLUX.1 Pro

State-of-the-art image generation with exceptional quality and prompt adherence.

12B
$0.05image
Stability AI
Image

FLUX.1 Dev

High-quality image generation for development and testing.

12B
$0.03image
Stability AI
Image

FLUX.1 Schnell

Fast image generation optimized for speed.

12B
$0.003image
Stability AI
Image

Stable Diffusion XL

Versatile image generation with fine-tuning support.

6.6B
$0.002image
Stability AI
Embedding

BGE Large EN

High-quality English embeddings for RAG and semantic search.

335M512 context
$0.021M tokens
Stability AI
Embedding

BGE Base EN

Fast and efficient embeddings for production use.

109M512 context
$0.0081M tokens
Mistral AI
Embedding

E5 Mistral 7B

Large embedding model with 4096 dimensions for high-fidelity retrieval.

7B4K context
$0.021M tokens
OpenAI
Audio

Whisper Large V3

Industry-leading speech-to-text with multilingual support.

1.5B30s context
$0.006minute

Ready to get started?

Start building with $10 in free credits. No credit card required.