Qwen 3.5 35B-A3B

New

Ultra-efficient MoE model — 35B total, 3B active parameters. Fast inference at near-8B cost with 70B-class quality.

Parameters

35B MoE (3B active)

Context Length

128K

About this model

Qwen 3.5 35B-A3B is a cutting-edge Mixture-of-Experts model that activates only 3B of its 35B parameters per token. This design delivers 70B-class quality at 8B-class speed and cost. Optimized for multilingual tasks, instruction following, and tool use.

Capabilities

Multi-turn conversationInstruction followingTool callingMultilingual (29 languages)Code generationStructured output

Use Cases

AI agents
Customer support
Content generation
Code assistance
Data analysis

Model Details

Provider: Qwen
Model ID: Qwen3.5-35B-A3B
Parameters: 35B MoE (3B active)
Context Length: 128K tokens
Category: chat

API Usage

Use the DOS API to integrate Qwen 3.5 35B-A3Binto your applications. Our API is compatible with OpenAI's client libraries for easy migration.

Model ID

Qwen3.5-35B-A3B

Python

python

from dos import DOS

client = DOS()

response = client.chat.completions.create(
    model="Qwen3.5-35B-A3B",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

cURL

bash

curl https://api.dos.ai/v1/chat/completions \
  -H "Authorization: Bearer $DOS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "Qwen3.5-35B-A3B",
    "messages": [
      {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Node.js

javascript

import DOS from 'dos-ai';

const client = new DOS();

const response = await client.chat.completions.create({
  model: "Qwen3.5-35B-A3B",
  messages: [
    { role: "user", content: "Hello, how are you?" }
  ]
});

console.log(response.choices[0].message.content);

View full API reference

Related Models

Llama 3.3 70B

High-performance multilingual LLM optimized for dialogue and instruction following.

$0.88 / 1M tokens

Llama 3.1 405B

The largest and most capable Llama model for complex reasoning and generation tasks.

$3.50 / 1M tokens

Mistral AI

Mistral Large 2

Flagship model with strong multilingual and coding capabilities.

$2.00 / 1M tokens