New version of Gemini beats other AIs at math, science, and reasoning

Google’s new Gemini Pro is smarter than other AIs at reasoning, science, and coding.

This is according to a series of benchmark results posted by Google on Thursday. In short, Gemini 2.5 Pro beats chief competitors at nearly everything — though we’re sure the companies behind those competitors would disagree.

This Tweet is currently unavailable. It might be loading or has been removed.

According to Google’s data, Gemini 2.5 Pro has a healthy lead over OpenAI o3, Claude Opus 4, Grok 3 Beta, and DeepSeek R1, in the Humanity’s Last Exam benchmark, which evaluates a model’s math, science, knowledge, and reasoning. It’s also better at code editing (per the Aider Polyglot benchmark), and it wins over all competitors in several factuality benchmarks including FACTS Grounding, meaning it’s less likely to provide factually inaccurate text.

Mashable Light Speed

The only benchmark in which Gemini 2.5 Pro isn’t a clear winner is the mathematics-focused AIME 2025, and even there the differences between results are pretty small.

New version of Gemini beats other AIs at math, science, and reasoning

Recent Articles

T-Mobile is giving customers a free iPhone 16 Pro — here’s how to get yours

The Sonos Move 2 is back to its best-ever price at Amazon

Save $65 on this luxurious Shiatsu foot massager at Amazon

My favorite Beats earbuds are $30 off for a limited time

NASA just performed a ‘miracle save’ for its farthest spacecraft

Related Stories

Leave A Reply Cancel reply

Stay on op - Ge the daily news in your inbox