Deductive Reasoning Math

AI program plays the long game to solve decades-old math problems

A game of chess requires its players to think several moves ahead, a skill that computer programs have mastered over the ...

Nature21h

Why is mathematics education failing some of the world’s most talented children?

A study shines a light on the remarkable arithmetic skills that young people acquire outside formal schooling. Education must ...

ReadWrite2d

DeepMind claims its AI outperforms Olympiad gold medalists in solving maths problems

Google DeepMind’s AlphaGeometry2 reportedly solved 84% of Olympiad geometry problems, surpassing gold medalists.

Amazon bets on ‘automated reasoning’ to reduce AI’s made up answers, WSJ says

Amazon is looking to “automated reasoning” to provide mathematical proof that AI’s models’ tendency to make up answers, or hallucinations, can ...

I tested ChatGPT o3-mini vs DeepSeek R1 vs Qwen 2.5 with 9 prompts — here’s the winner

Winner: o3-mini wins for the best combination of clarity, detail and logical flow. Qwen 2.5 is in second place with a solid ...

TechCrunch15d

DeepSeek claims its ‘reasoning’ model beats OpenAI’s o1 on certain benchmarks

AIME employs other models to evaluate a model’s performance, while MATH-500 is a collection of word problems. SWE-bench Verified, meanwhile, focuses on programming tasks. Being a reasoning model ...

The Indian Express21d

DeepSeek unveils DeepSeek-R1, a reasoning model that beats OpenAI-o1

It incorporates a cold-start phase with carefully curated data and multi-stage RL which ensures enhanced reasoning capabilities and readability. The DeepSeek-R1 has showcased some remarkable ...

Reuters21d

TikTok owner ByteDance, DeepSeek lead Chinese push in AI reasoning

The latter are capable of reasoning through complex tasks and solving more challenging problems than previous models in science, coding and math. Last week, OpenAI CEO Sam Altman said they had ...

22d

Cutting-edge Chinese “reasoning” model rivals OpenAI o1—and it’s free to download

On Monday, Chinese AI lab DeepSeek released its new R1 model family under an open MIT license, with its largest version ...

NextBigFuture22d

Open Source DeepSeek R1 Matches OpenAI O1 Math, Code and Reasoning

which are focused on mathematical reasoning and problem-solving. This performance is attributed to DeepSeek’s use of chain-of-thought reasoning, where the model explicitly shows its reasoning process, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results