Simple, transparent pricing

Pay only for what you use. No hidden fees, no minimum commitments. Start free and scale as you grow.

Serverless Inference

Text & Chat Models

Pricing per 1M tokens. All models support streaming and function calling.

ModelParametersContextInputOutput
Llama 3.3 70B70B128K$0.88$0.88
Llama 3.1 405B405B128K$3.50$3.50
Llama 3.1 8B8B128K$0.10$0.10
Mistral Large 2123B128K$2.00$6.00
Mixtral 8x22B141B MoE64K$0.90$0.90
Mixtral 8x7B46B MoE32K$0.50$0.50
Qwen 2.5 72B72B128K$0.90$0.90
DeepSeek V3671B MoE64K$0.50$1.50
Gemma 2 27B27B8K$0.30$0.30
Gemma 2 9B9B8K$0.10$0.10

Embeddings

Pricing per 1M tokens. Ideal for RAG, semantic search, and similarity matching.

ModelDimensionsPrice / 1M tokens
UAE Large V11024$0.016
BGE Large EN1024$0.016
BGE Base EN768$0.008
E5 Mistral 7B4096$0.020

Image Generation

Pricing per image. Higher resolutions may incur additional costs.

ModelResolutionPrice / image
FLUX.1 Pro1024x1024$0.050
FLUX.1 Dev1024x1024$0.025
FLUX.1 Schnell1024x1024$0.003
Stable Diffusion XL1024x1024$0.002

Free Tier

Get started with $10 in free credits. No credit card required. Perfect for testing and prototyping your AI applications.

Start for free

Pricing FAQ

How does billing work?

We use a pay-as-you-go model. You're billed monthly based on your usage. You can set spending limits and receive alerts to stay in control of your costs.

Is there a free tier?

Yes! New users receive $10 in free credits. This is enough to make thousands of API calls and thoroughly test our platform before committing.

Do you offer volume discounts?

Yes, we offer volume discounts for high-usage customers. Contact our sales team to discuss custom pricing for your needs.

What payment methods do you accept?

We accept all major credit cards (Visa, Mastercard, American Express) and support invoice billing for enterprise customers.

Ready to get started?

Start building with $10 in free credits. No credit card required.