LLMApache

Qwen3-235B

by Alibaba

Dec 1, 2025128K context$0.08/M input$0.33/M output

Qwen 3 235B is Alibaba's flagship dense model. Excellent multilingual capabilities with strong performance across all benchmarks. Apache 2.0 for unrestricted commercial use.

View on HuggingFace
Fleek Pricing
$0.0025/GPU-second
Context128K tokens
Estimated Token Cost
Input
$0.08/M
Output
$0.33/M
Based on 18,500 tokens/sec
vs CompetitorsUp to 26%

Overview

Parameters

235B

Architecture

Dense Transformer

Context

128K

Provider

Alibaba

Best For

General reasoningMultilingual tasksContent generationAnalysis

OpenAI Compatible

Drop-in replacement for OpenAI API. Just change the base URL.

Pay Per Second

Only pay for actual GPU compute time. No idle costs.

Enterprise Ready

99.9% uptime SLA, SOC 2 compliant, dedicated support.

Auto Scaling

Scales from zero to thousands of requests automatically.

Compare Pricing

FleekFireworksTogetherBaseten
Input$0.08$0.22
Output$0.33$0.88
Savings63%

Prices are per million tokens. Fleek pricing based on $0.0025/GPU-second.

Calculate Your Savings

See how much you'd save running Qwen3-235B on Fleek

Qwen3-235B
Your Fleek Cost
$33-45/mo
13.3K-18.2K GPU-sec × $0.0025
Fireworks AI
$55/mo
Your Savings28%
Annual Savings
$187

Technical Specifications

Model NameQwen3-235B
Total Parameters235B
Active ParametersN/A
ArchitectureDense Transformer
Context Length128K tokens
Inference Speed18,500 tokens/sec
ProviderAlibaba
Release DateDec 1, 2025
LicenseApache 2.0
HuggingFacehttps://huggingface.co/Qwen/Qwen-3-235B-Instruct

Ready to run Qwen3-235B?

Join the waitlist for early access. Start free with $5 in credits.

View Pricing