GPT-4.5: A new stage in the development of language models

1m, 42s

13:51, 28.02.2025

A new language model, GPT-4.5, has been released, which will be more natural than previous versions, but the pricing will be higher.

GPT-4.5 is now available as a “Research Preview” for developers and users of the Pro version. Team and Plus users are scheduled to get access next week.

A significant difference between GPT-4.5 and the o3-mini and o1 models is that the new version responds much faster due to a change in the “unsupervised learning” approach. Since the new model does not think before responding, performance is greatly improved.

GPT-4.5 is also known as Orion and is the largest trained model so far. OpenAI states that the new model will not be “borderline” - such statements from the company may be related to the training of another o3 model.

The price of the model is significantly higher than the GPT-4o and o1 versions and is $75 (for a million input tokens) and $150 (for a million output tokens). Like previous versions, this variant will have a context length of 128,000 tokens.

OpenAI stated that the 2 main approaches (reasoning and learning) will be used as mutually complementary variants. Version 4.5 is already much smarter because of pre-training. There is also a big possibility that the new version of GPT-5 will be able to combine these two features.

Benchmarking results

As for performance tests, the 4.5 model shows good results and achieves 62.5% on SimpleQA. In the same test, Grok 3 shows a score of 43.6%, and GPT-4o shows a score of 43.6%. The hallucination rate is also significantly lowered and is 37.1%. Also, the new version 4.5 dominates the tests of human judgment in everyday matters, creative intelligence, and professional questions.

In STEM tests, results vary from model to model. For example, in the AIME '24 test, the 4.5 model scores at 36.7%, the o3-mini at 87.3%, and the GPT-4o at 9.3%. In the SWE-Bench Verified test, the result is 38.8%, while the o3-mini is 61.0%, and the GPT-4o is 30.7%.

If we compare the results of all benchmarks, the figures are quite stable, and there is no significant jump in performance as SimpleQA showed.

VPS popular offers

See all products

KVM-NVMe 1024

-10%

€

/mo

€ 6.5 /mo

Billed annually

CPU

2 Epyc Cores

RAM

1 GB

Space

10 GB NVMe

Bandwidth

Unlimited
KVM-SSD 1024 Metered

-26.7%

€

/mo

€ 10 /mo

Billed annually

CPU

3 Xeon Cores

RAM

1 GB

Space

20 GB SSD

Bandwidth

1 TB
Keitaro KVM 65536

-15%

€

/mo

€ 176 /mo

Billed annually

OS

CentOS

CPU

10 Epyc Cores

RAM

64GB

Space

300 GB NVMe

Software

Keitaro

Bandwidth

Unlimited
10Ge-KVM-SSD 8192

-6.7%

€

/mo

€ 105 /mo

Billed annually

CPU

4 Xeon Cores

RAM

8 GB

Space

100 GB SSD

Bandwidth

Unlimited
KVM-HDD 16384

-10%

€

/mo

€ 45 /mo

Billed annually

CPU

6 Xeon Cores

RAM

16 GB

Space

400 GB HDD

Bandwidth

Unlimited
KVM-SSD 4096

-10%

€

/mo

€ 14.5 /mo

Billed annually

CPU

4 Xeon Cores

RAM

4 GB

Space

50 GB SSD

Bandwidth

Unlimited
wKVM-SSD 16384

-8.9%

€

/mo

€ 53 /mo

Billed annually

CPU

6 Xeon Cores

RAM

16 GB

Space

150 GB SSD

Bandwidth

Unlimited
KVM-SSD 4096 HK

-22.2%

€

/mo

€ 33 /mo

Billed annually

CPU

4 Xeon Cores

RAM

4 GB

Space

50 GB SSD

Bandwidth

300 GB
DDoS Protected SSD-KVM 2048

-16.3%

€

/mo

€ 48 /mo

Billed annually

CPU

4 Xeon Cores

RAM

2 GB

Space

30 GB SSD

Bandwidth

40 Mbps
10Ge-KVM-SSD 4096

-9.1%

€

/mo

€ 55 /mo

Billed annually

CPU

4 Xeon Cores

RAM

4 GB

Space

50 GB SSD

Bandwidth

Unlimited

GPT-4.5: A new stage in the development of language models

Benchmarking results

Was this article helpful to you?

VPS popular offers

Other articles on this topic