Models / Qwen3 8B
Qwen3 8B
Qwen Tier 1Apache-2.0Lightweight dense model for local deployment and edge inference. Supports thinking mode.
qwen/qwen3-8bContext Window
131K
Max Output
8K
Providers
1
Released
2025-04
Capabilities
chatcodethinkingtoolsstreaming
Pricing by Provider
| Provider | Input $/1M | Output $/1M | Latency p50 | Latency p95 | Status |
|---|---|---|---|---|---|
| alibaba | $0.04 | $0.12 | 180ms | 450ms |
Quick Start
Python
import magicrouter
mr = magicrouter.Client(
provider_keys={"alibaba": "your-api-key"}
)
response = mr.chat(
"qwen/qwen3-8b",
"Your prompt here"
)
print(response.choices[0].message.content)TypeScript
import { MagicRouter } from "magicrouter";
const mr = new MagicRouter({
providerKeys: { alibaba: "your-api-key" }
});
const response = await mr.chat({
model: "qwen/qwen3-8b",
messages: [{ role: "user", content: "Your prompt here" }]
});
console.log(response.choices[0].message.content);cURL
curl https://dashscope-intl.aliyuncs.com/compatible-mode/v1/chat/completions \
-H "Authorization: Bearer your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "qwen3-8b",
"messages": [{"role": "user", "content": "Your prompt here"}]
}'