For Developers/Models/Compare/Gemini 2.5 Flash vs Qwen 3

Gemini 2.5 Flash vs Qwen 3

Pricing, benchmarks, and use case comparison

Quick take

•Gemini 2.5 Flash is meaningfully stronger at multimodal (80 vs 30 on our capability index).
•Qwen 3 is open-weights (free to self-host); Gemini 2.5 Flash is paid API only.
•Qwen 3 has a 128K tokens (32K for 0.6B/1.7B/4B dense variants) context window vs 1,048,576 tokens (1M) input; up to 65,535 output - better for whole-repo or long-document work.

Specs comparison

	Gemini 2.5 Flash	Qwen 3
Provider	Google DeepMind	Alibaba (Qwen Team)
Type	Closed source	Open source
Context window	1,048,576 tokens (1M) input; up to 65,535 output	✓128K tokens (32K for 0.6B/1.7B/4B dense variants)
Input / 1M tokens	$0.30	✓Free (self-host)
Output / 1M tokens	$2.50	Free (self-host)
Release date	2025-06	2025-04

Scores sourced from official provider release posts and independent benchmark aggregators.

Choose Gemini 2.5 Flash if...

Choose Qwen 3 if...