Build AI Apps. Faster.

Access powerful AI models through a simple API. Low latency inference, easy fine-tuning, and enterprise-grade infrastructure.

Trusted by

    • MetaDOS
    • DOSafe
    • DOS.Me
    • DOScan
    • DOSwap
  • OverMint

Why use DOS?

Build faster, scale easier, and focus on what matters - your product.

Industry-leading inference speeds with optimized model serving. Get responses in milliseconds, not seconds.

Live Inference Metrics
Time to First Token
12ms
Throughput
847tok/s
P99 Latency
0.8sec
Request Latency (last 60s)
67% faster than leading alternatives

Run the latest open models with a single line of code

Evaluate and build with open-source models for chat, images, code, and more.

Model Library

Evaluate and build with open-source and specialized models for chat, images, videos, code, and more.

Migrate from closed models with OpenAI-compatible APIs.

Start building now

What can you build on DOS?

From chatbots to code assistants, DOS powers the next generation of AI applications.

Conversational AI

Build intelligent chatbots and virtual assistants.

Create AI-powered chat experiences with multi-turn conversations, context awareness, and natural language understanding.

AI

Support Agent

Online

AI

Hi! How can I help you today?

I need to reset my API key

AI

I've revoked your old key and generated a new one:

dos_sk_live_7f3a...x9k2

Copied to clipboard. The old key is now inactive.

Thanks! Can you also check my usage this month?

AI

Here's your usage summary:

API Calls12,847
Tokens2.4M
Cost$8.42
Type a message...

Code Assistant

Accelerate development with AI-powered coding.

Generate code, detect bugs, review pull requests, and create documentation automatically with state-of-the-art code models.

api-handler.ts
utils.py
123456789101112131415161718

import { streamText } from 'ai'

import { createDOS } from '@dos/sdk'

// AI-generated: streaming chat endpoint

 

const dos = createDOS()

 

export async function POST(req) {

const { messages } = await req.json()

 

const result = streamText({

model: dos('dos-ai'),

messages,

temperature: 0.7,

})

 

return result.toDataStreamResponse()

}

 

// Handle errors and rate limiting

AI: Add error handling with retry logic?Tab to accept

RAG & Search

Build powerful semantic search experiences.

Create knowledge bases with retrieval-augmented generation for accurate, contextual answers with citation support.

How do I configure rate limiting?
AI Answer3 sources

To configure rate limiting, add the rateLimit option to your API configuration:

const config = {
  rateLimit: {
    requests: 100,
    window: '1m'
  }
}

This limits each API key to 100 requests per minute. You can also set per-endpoint limits.

Sources

Rate Limiting Guide

docs/guides/rate-limiting.md

97%

API Configuration Reference

docs/api/config.md

89%

Security Best Practices

docs/guides/security.md

82%

Get started today

Start building with DOS in minutes. No credit card required. Get $10 free credits to explore our API.

Start building for free

Loved by developers worldwide.

Teams of all sizes trust DOS to power their AI applications. Here's what they have to say.

    • DOS has completely transformed how we build AI features. What used to take weeks now takes hours. The inference speed is incredible.

      Sarah Chen
      CTO at TechFlow
    • We switched from OpenAI to DOS and cut our AI costs by 60%. The API is drop-in compatible, so migration was seamless.

      Michael Torres
      Lead Engineer at DataPipe
    • The fine-tuning capabilities are game-changing. We trained a custom model for our legal documents in just a few hours with amazing accuracy.

      James Wilson
      Founder of LegalAI
    • DOS support team is exceptional. They helped us optimize our prompts and reduced latency by 40%. Best developer experience in the industry.

      Emily Zhang
      VP Engineering at Nexus
    • We built our entire RAG pipeline on DOS. The embeddings API is fast, accurate, and the pricing makes it viable at scale.

      David Park
      Founder of SearchBot
    • Enterprise-grade reliability with startup-friendly pricing. DOS powers our chatbot serving 100k+ users daily without breaking a sweat.

      Lisa Wang
      Head of AI at CloudScale

Frequently asked questions

Can't find what you're looking for? Reach out to our support team at support@dos.ai and we'll get back to you within 24 hours.

    • What models does DOS support?

      DOS provides access to 100+ open-source and proprietary models including Llama, Mistral, Claude, GPT-4, and many more. We continuously add new models as they become available.

    • How does pricing work?

      We use simple pay-as-you-go pricing based on tokens processed. No hidden fees, no minimum commitments. You only pay for what you use, and we offer volume discounts for high-usage customers.

    • Is there a free tier?

      Yes! New users get $10 in free credits to explore our API. This is enough to make thousands of API calls and test our platform thoroughly before committing.

    • How fast is the inference?

      Our optimized infrastructure delivers industry-leading inference speeds. Most requests complete in under 100ms for the first token, with streaming support for real-time applications.

    • Can I fine-tune models on DOS?

      Absolutely! We support fine-tuning for many popular models. Upload your training data, configure your hyperparameters, and we handle the rest. Fine-tuned models are private to your account.

    • Is my data secure?

      Security is our top priority. We are SOC2 Type II certified, GDPR compliant, and encrypt all data in transit and at rest. We never train on your data or share it with third parties.

    • Do you offer enterprise plans?

      Yes, we offer enterprise plans with dedicated resources, custom SLAs, priority support, and volume discounts. Contact our sales team to discuss your requirements.

    • What SDKs do you provide?

      We offer official SDKs for Python, Node.js, Go, and Rust. Our REST API is also compatible with OpenAI client libraries, making migration seamless.

    • How do I get support?

      All users have access to our documentation and community Discord. Paid plans include email support, and enterprise customers get dedicated Slack channels and account managers.