New Qwen2.5-Max exceeds DeepSeek capabilities

1m, 10s

11:54, 31.01.2025

After the releases of Qwen2.5, Qwen2.5-VL, a new version of Qwen2.5-Max has become available. The new version of Qwen shows top performance over the DeepSeek V3 in the following benchmarks - GPQA-Diamond, Arena-Hard, LiveCodeBench, and LiveBench.

Architecture and Model Features

The Max version is a fairly large-scale project of the Mixture of Experts model. The uniqueness of this particular model was in training on real user feedback (RLHF), using Supervised-Fine-Tuning, and of course training on 20 trillion tokens.

At the moment, the data for the new version has not yet been posted on GitHub, only access to the API and Qwen Chat is available for now. There's a good chance that the lack of data on HuggingFace and GitHub indicates a rush to unveil the new project or a planned promotion by the company to incentivize the adoption of their cloud platform.

Qwen has published results regarding the new model. According to the open data table of the new Qwen version compared to LLaMA3.1 and DeepSeek-V3, the Max version outperforms its competitors in most characteristics. When compared to Claude Sonnet and GPT, the Max version loses to GPT.

The company has invested a significant budget in training data, and the superiority over competitors exists, but it is relatively insignificant. Because of this, some experts have the theory that it is possible to extend the capabilities of language models by using computing power during testing.

VPS popular offers

See all products

10Ge-KVM-SSD 2048

-8.4%

€

/mo

€ 27.5 /mo

Billed annually

CPU

4 Xeon Cores

RAM

2 GB

Space

30 GB SSD

Bandwidth

Unlimited
KVM-SSD 1024 HK

-24.4%

€

/mo

€ 13 /mo

Billed annually

CPU

2 Xeon Cores

RAM

1 GB

Space

20 GB SSD

Bandwidth

300 GB
wKVM-NVMe 16384

-9%

€

/mo

€ 61 /mo

Billed annually

CPU

6 Epyc Cores

RAM

16 GB

Space

150 GB NVMe

Bandwidth

Unlimited
KVM-SSD 4096 HK

-22.2%

€

/mo

€ 33 /mo

Billed annually

CPU

4 Xeon Cores

RAM

4 GB

Space

50 GB SSD

Bandwidth

300 GB
Keitaro KVM 16384

€

/mo

OS

CentOS

CPU

6 Epyc Cores

RAM

16 GB

Space

150 GB NVMe

Software

Keitaro

Bandwidth

Unlimited
10Ge-wKVM-SSD 4096

-9.1%

€

/mo

€ 66 /mo

Billed annually

CPU

4 Xeon Cores

RAM

4 GB

Space

100 GB SSD

Bandwidth

Unlimited
KVM-NVMe 65536

-8.8%

€

/mo

€ 170 /mo

Billed annually

CPU

10 Epyc Cores

RAM

64 GB

Space

400 GB NVMe

Bandwidth

Unlimited
DDoS Protected SSD-KVM 2048

-16.3%

€

/mo

€ 48 /mo

Billed annually

CPU

4 Xeon Cores

RAM

2 GB

Space

30 GB SSD

Bandwidth

40 Mbps
DDoS Protected SSD-wKVM 4096

-15.4%

€

/mo

€ 73 /mo

Billed annually

CPU

4 Xeon Cores

RAM

4 GB

Space

100 GB SSD

Bandwidth

60 Mbps
KVM-NVMe 1024

-10%

€

/mo

€ 6.5 /mo

Billed annually

CPU

2 Epyc Cores

RAM

1 GB

Space

10 GB NVMe

Bandwidth

Unlimited

New Qwen2.5-Max exceeds DeepSeek capabilities

Architecture and Model Features

Was this article helpful to you?

VPS popular offers

Other articles on this topic