• Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints

The AI model that teaches itself to think through problems, no humans required

Simon Osuji by Simon Osuji
September 18, 2025
in Artificial Intelligence
0
The AI model that teaches itself to think through problems, no humans required
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


The AI model that teaches itself to think through problems, no humans required
The multistage pipeline of DeepSeek-R1. Credit: Nature (2025). DOI: 10.1038/s41586-025-09422-z

Artificial intelligence is getting smarter every day, but it still has its limits. One of the biggest challenges has been teaching advanced AI models to reason, which means solving problems step by step. But in a new paper published in the journal Nature, the team from DeepSeek AI, a Chinese artificial intelligence company, reports that they were able to teach their R1 model to reason on its own without human input.

Related posts

How Vulnerable Are Computers to an 80-Year-Old Spy Technique? Congress Wants Answers

How Vulnerable Are Computers to an 80-Year-Old Spy Technique? Congress Wants Answers

March 5, 2026
What AI Models for War Actually Look Like

What AI Models for War Actually Look Like

March 5, 2026

When many of us try to solve a problem, we typically don’t get the answer straight away. We follow a methodical process that may involve gathering information and taking notes until we get to a solution. Traditionally, training AI models to reason has involved copying our approach. However, it is a long, drawn-out process where people show an AI model countless examples of how to work through a problem. It also means that AI is only as good as the examples it is given and can pick up on human biases.

Instead of showing the R1 model every step, researchers at DeepSeek AI used a technique called reinforcement learning. This trial-and-error approach, using rewards for correct answers, encouraged the model to reason for itself.

“Rather than explicitly teaching the model how to solve a problem, we simply provide it with the right incentives and it autonomously develops advanced problem-solving strategies,” wrote the researchers in their paper.

DeepSeek’s R1 model was trained on difficult math, coding and science problems. The only reward it received was a signal that its final answer was correct. During its training, the researchers saw it develop skills such as checking its own work and exploring different strategies to find a solution. It even started to use words like “wait” as it reflected on its own thinking process. If a path led to the right answer, that strategy was reinforced. If it was wrong, the model learned not to repeat it. There was some human intervention, but only to polish R1’s skills later in the process.

The AI model that teaches itself to think through problems, no humans required
Accuracy and output length of DeepSeek-R1-Zero throughout the training process. Credit: Nature (2025). DOI: 10.1038/s41586-025-09422-z

The results were impressive. R1 performed better on math, coding and science tasks than older models trained with human guidance. One of the most noteworthy results was that it achieved an accuracy of 86.7% on the American Invitational Mathematics Examination (AIME) 2024, a tough math competition for the smartest high school students.

Even with these outstanding results, the researchers admit their model has some limitations to work through. For example, it sometimes mixed languages when given a non-English prompt and made some simple problems more complicated than they needed to be. But once these issues are ironed out, the researchers believe that an AI model that can reason for itself will lead to a new era of more capable and autonomous models.

Written for you by our author Paul Arnold, edited by Lisa Lock, and fact-checked and reviewed by Robert Egan—this article is the result of careful human work. We rely on readers like you to keep independent science journalism alive.
If this reporting matters to you,
please consider a donation (especially monthly).
You’ll get an ad-free account as a thank-you.

More information:
Daya Guo et al, DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning, Nature (2025). DOI: 10.1038/s41586-025-09422-z

© 2025 Science X Network

Citation:
The AI model that teaches itself to think through problems, no humans required (2025, September 18)
retrieved 18 September 2025
from https://techxplore.com/news/2025-09-ai-problems-humans-required.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

Ethiopia: Israel’s Twin In Africa Expansionism, Proxy Wars, And The Assault On Sovereignty

Next Post

Best Customer Service in Excellence

Next Post
Best Customer Service in Excellence

Best Customer Service in Excellence

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

Boosting road infrastructure through Lagos-Calabar Coastal Highway project – EnviroNews

Boosting road infrastructure through Lagos-Calabar Coastal Highway project – EnviroNews

1 year ago
15 Best Sunglasses for Every Outdoor Adventure (2023): Le Specs, Sunski, and More

15 Best Sunglasses for Every Outdoor Adventure (2023): Le Specs, Sunski, and More

2 years ago
Rand slips ahead of Fed chair’s speech

Rand slips ahead of Fed chair’s speech

2 years ago
AI ring tracks spelled words in American Sign Language

AI ring tracks spelled words in American Sign Language

12 months ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • Mahama attends Liberia’s 178th independence anniversary

    0 shares
    Share 0 Tweet 0
  • The world’s top 10 most valuable car brands in 2025

    0 shares
    Share 0 Tweet 0
  • Top 10 African countries with the highest GDP per capita in 2025

    0 shares
    Share 0 Tweet 0
  • Global ranking of Top 5 smartphone brands in Q3, 2024

    0 shares
    Share 0 Tweet 0

Get strategic intelligence you won’t find anywhere else. Subscribe to the Limitless Beliefs Newsletter for monthly insights on overlooked business opportunities across Africa.

Subscription Form

© 2026 LBNN – All rights reserved.

Privacy Policy | About Us | Contact

Tiktok Youtube Telegram Instagram Linkedin X-twitter
No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • LBNN Blueprints
  • Quizzes
    • Enneagram quiz
  • Fashion Intelligence

© 2023 LBNN - All rights reserved.