PopularNewLLMModified

Kimi K2.5

by Moonshot AI

Jan 27, 2026256K context$0.12/M input$0.47/M output

Kimi K2.5 is the latest model from Moonshot AI with native multimodal design for direct image/screenshot analysis. Full-parameter RL tuning, agent swarm capabilities, and dual Thinking/Instant modes.

View on HuggingFace
Fleek Pricing
$0.0025/GPU-second
Context256K tokens
Estimated Token Cost
Input
$0.12/M
Output
$0.47/M
Based on 34,000 tokens/sec
vs CompetitorsSave 26%

Overview

Parameters

1T (MoE)

Architecture

Mixture of Experts (384 experts)

Context

256K

Provider

Moonshot AI

Best For

Multimodal reasoningAgentic workflowsVisual analysisWeb development

OpenAI Compatible

Drop-in replacement for OpenAI API. Just change the base URL.

Pay Per Second

Only pay for actual GPU compute time. No idle costs.

Enterprise Ready

99.9% uptime SLA, SOC 2 compliant, dedicated support.

Auto Scaling

Scales from zero to thousands of requests automatically.

Compare Pricing

FleekFireworksTogetherBaseten
Input$0.12$0.60$1.00$0.60
Output$0.47$2.50$3.00$2.50
Savings70%70%70%

Prices are per million tokens. Fleek pricing based on $0.0025/GPU-second.

Calculate Your Savings

See how much you'd save running Kimi K2.5 on Fleek

Kimi K2.5
Your Fleek Cost
$47-80/mo
18.6K-32.0K GPU-sec × $0.0025
Fireworks AI
$155/mo
Your Savings59%
Annual Savings
$1.1K

Technical Specifications

Model NameKimi K2.5
Total Parameters1T (MoE)
Active Parameters32B
ArchitectureMixture of Experts (384 experts)
Context Length256K tokens
Inference Speed34,000 tokens/sec
ProviderMoonshot AI
Release DateJan 27, 2026
LicenseModified MIT
HuggingFacehttps://huggingface.co/moonshotai/Kimi-K2.5-Instruct

Benchmarks

Click any benchmark to view the official leaderboard. Rankings among open-source models.

Ready to run Kimi K2.5?

Join the waitlist for early access. Start free with $5 in credits.

View Pricing