by Alibaba
Qwen3 Coder 30B A3B fits on a single 80GB GPU while delivering exceptional coding performance. Only 3B active params enables blazing fast inference. 256K context handles entire codebases.
Parameters
30B (MoE)
Architecture
Mixture of Experts
Context
262K
Provider
Alibaba
Drop-in replacement for OpenAI API. Just change the base URL.
Only pay for actual GPU compute time. No idle costs.
99.9% uptime SLA, SOC 2 compliant, dedicated support.
Scales from zero to thousands of requests automatically.
See how much you'd save running Qwen3 Coder 30B A3B on Fleek
| Model Name | Qwen3 Coder 30B A3B |
| Total Parameters | 30B (MoE) |
| Active Parameters | 3B |
| Architecture | Mixture of Experts |
| Context Length | 262K tokens |
| Inference Speed | 4,500 tokens/sec |
| Provider | Alibaba |
| Release Date | Nov 1, 2025 |
| License | Apache 2.0 |
| HuggingFace | https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct |
Software Engineering benchmark - real GitHub issues
OpenAI code generation benchmark
Mostly Basic Python Problems
Click any benchmark to view the official leaderboard. Rankings among open-source models.
Join the waitlist for early access. Start free with $5 in credits.