DOSDOS
Pricing
Get started
Models/Qwen 3.5 35B-A3B

Qwen 3.5 35B-A3B

New

by Qwen

Ultra-efficient MoE model — 35B total, 3B active parameters. Fast inference at near-8B cost with 70B-class quality.

Parameters
35B MoE (3B active)
Context Length
128K
Category
chat
Available Serverless

Run queries immediately, pay only for usage

$0.50in|$1.50out

Per 1M Tokens

Try this modelView documentation

About this model

Qwen 3.5 35B-A3B is a cutting-edge Mixture-of-Experts model that activates only 3B of its 35B parameters per token. This design delivers 70B-class quality at 8B-class speed and cost. Optimized for multilingual tasks, instruction following, and tool use.

Capabilities

Multi-turn conversationInstruction followingTool callingMultilingual (29 languages)Code generationStructured output

Use Cases

  • AI agents
  • Customer support
  • Content generation
  • Code assistance
  • Data analysis

Model Details

Provider
Qwen
Model ID
Qwen3.5-35B-A3B
Parameters
35B MoE (3B active)
Context Length
128K tokens
Category
chat

API Usage

Use the DOS API to integrate Qwen 3.5 35B-A3Binto your applications. Our API is compatible with OpenAI's client libraries for easy migration.

Model ID

Qwen3.5-35B-A3B

Python

python
from dos import DOS

client = DOS()

response = client.chat.completions.create(
    model="Qwen3.5-35B-A3B",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

cURL

bash
curl https://api.dos.ai/v1/chat/completions \
  -H "Authorization: Bearer $DOS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "Qwen3.5-35B-A3B",
    "messages": [
      {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Node.js

javascript
import DOS from 'dos-ai';

const client = new DOS();

const response = await client.chat.completions.create({
  model: "Qwen3.5-35B-A3B",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ]
});

console.log(response.choices[0].message.content);
View full API reference

Related Models

Meta

Llama 3.3 70B

High-performance multilingual LLM optimized for dialogue and instruction following.

$0.88 / 1M tokens
Meta

Llama 3.1 405B

The largest and most capable Llama model for complex reasoning and generation tasks.

$3.50 / 1M tokens
Mistral AI

Mistral Large 2

Flagship model with strong multilingual and coding capabilities.

$2.00 / 1M tokens
DOSDOS

AI infrastructure for everyone. Inference, agents, and safety — all in one platform.

Product

  • Models
  • Pricing
  • API Inference
  • DOSClaw
  • GPU Cloud

Developers

  • Documentation
  • API Reference
  • Status

DOS Ecosystem

  • DOSafe
  • DOS.Me
  • DOScan
  • DOSwap
  • MetaDOS

Company

  • About
  • Contact
  • Careers
  • Privacy
  • Terms

© 2026 All rights reserved.