LLMApache

Qwen3-235B

by Alibaba

Dec 1, 2025•128K context•$0.08/M input•$0.33/M output

Qwen 3 235B is Alibaba's flagship dense model. Excellent multilingual capabilities with strong performance across all benchmarks. Apache 2.0 for unrestricted commercial use.

View on HuggingFace

Fleek Pricing

$0.0025/GPU-second

Context128K tokens

Estimated Token Cost

Input

$0.08/M

Output

$0.33/M

Based on 18,500 tokens/sec

vs CompetitorsUp to 26%

Overview

Parameters

235B

Architecture

Dense Transformer

Context

128K

Provider

Alibaba

Best For

General reasoningMultilingual tasksContent generationAnalysis

OpenAI Compatible

Drop-in replacement for OpenAI API. Just change the base URL.

Pay Per Second

Only pay for actual GPU compute time. No idle costs.

Enterprise Ready

99.9% uptime SLA, SOC 2 compliant, dedicated support.

Auto Scaling

Scales from zero to thousands of requests automatically.

Compare Pricing

	Fleek	Fireworks	Together	Baseten
Input	$0.08	$0.22	—	—
Output	$0.33	$0.88	—	—
Savings		63%	—	—

Prices are per million tokens. Fleek pricing based on $0.0025/GPU-second.

Calculate Your Savings

See how much you'd save running Qwen3-235B on Fleek

Model

Qwen3-235B

Compare To

Monthly Usage: 100M tokens

Quick Select

Your Fleek Cost

$33-45/mo

13.3K-18.2K GPU-sec × $0.0025

Fireworks AI

$55/mo

Your Savings28%

Annual Savings

$187

See all models in full calculatororUpload your bill for custom analysis

Technical Specifications

Model Name	Qwen3-235B
Total Parameters	235B
Active Parameters	N/A
Architecture	Dense Transformer
Context Length	128K tokens
Inference Speed	18,500 tokens/sec
Provider	Alibaba
Release Date	Dec 1, 2025
License	Apache 2.0
HuggingFace	https://huggingface.co/Qwen/Qwen-3-235B-Instruct