Sunday, June 1, 2025
LBNN
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • Documentaries
No Result
View All Result
LBNN

Academic researchers find a way to train an AI reasoning model for less than $50

Simon Osuji by Simon Osuji
February 7, 2025
in Artificial Intelligence
0
Academic researchers find a way to train an AI reasoning model for less than $50
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


Academic researchers find a way to train an AI reasoning model for less than $50
Sequential and parallel test-time scaling. (a): Budget forcing shows clear scaling trends and extrapolates to some extent. For the three rightmost dots, we prevent the model from stopping its thinking 2/4/6 times, each time appending “Wait” to its current reasoning trace. (b): For Qwen2.5-32B-Instruct we perform 64 evaluations for each sample with a temperature of 1 and visualize the performance when majority voting across 2, 4, 8, 16, 32, and 64 of these. Credit: arXiv (2025). DOI: 10.48550/arxiv.2501.19393

A small team of AI researchers from Stanford University and the University of Washington has found a way to train an AI reasoning model for a fraction of the price paid by big corporations that produce widely known products such as ChatGPT. The group has posted a paper on the arXiv preprint server describing their efforts to inexpensively train chatbots and other AI reasoning models.

Related posts

Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

May 31, 2025
Nike x Hyperice Hyperboot Review: Wearable Post-Run Recovery

Nike x Hyperice Hyperboot Review: Wearable Post-Run Recovery

May 31, 2025

Corporations such as Google and Microsoft have made clear their intentions to be leaders in the development of chatbots with ever-improving skills. These efforts are notoriously expensive and tend to involve the use of energy-intensive server farms.

More recently, a Chinese company called DeepSeek released an LLM equal in capabilities to those being produced by countries in the West developed at far lower cost. That announcement sent stock prices for many tech companies into a nosedive.

In this new study, the researchers claim that it is possible to train an LLM with capabilities similar to those made by OpenAI or DeepSeek for less than $50. The catch is that the researchers on this new effort used a distillation process to extract capabilities from another AI model.

To train an AI so inexpensively, the research team began with an off-the-shelf AI model made by Alibaba, a China-owned company, which created the freely available test model. The research team modified the model and called the result s1.

Preliminary training involved 1,000 question-and-answer pairs they had designed carefully to give their model a leg up on learning. They also gave it the “thinking process” behind Gemini 2.0, a freely available Google experimental model. They then trained it in just 26 minutes using 16 Nvidia H100 GPUs.

The team also tacked on what they call a little trick—they added a step called “thinking” that runs before the model provides an answer—it gives the model time to double-check its work. The result, the researchers claim, is an AI model on par with other much more well-known products, made at a fraction of the cost.

More information:
Niklas Muennighoff et al, s1: Simple test-time scaling, arXiv (2025). DOI: 10.48550/arxiv.2501.19393

Model: github.com/simplescaling/s1

Journal information:
arXiv

© 2025 Science X Network

Citation:
Academic researchers find a way to train an AI reasoning model for less than $50 (2025, February 6)
retrieved 6 February 2025
from https://techxplore.com/news/2025-02-academic-ai.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

What is Cryptocurrency for Dummies?

Next Post

Snapchat+ subscribers can now create custom AI-generated stickers

Next Post
Snapchat+ subscribers can now create custom AI-generated stickers

Snapchat+ subscribers can now create custom AI-generated stickers

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

BRICS Announce It’s Leaving The Dollar Behind, Focusing on Native Currency

BRICS Announce It’s Leaving The Dollar Behind, Focusing on Native Currency

11 months ago
Watching surgery videos allows robot to perform with skill of human doctor

Watching surgery videos allows robot to perform with skill of human doctor

7 months ago
Vodacom to cut staff in South Africa

Vodacom to cut staff in South Africa

1 year ago
Rand extends losses before Powell speech

Rand weakens after this week’s strong rally

2 years ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0
  • Matthew Slater, son of Jackson State great, happy to see HBCUs back at the forefront

    0 shares
    Share 0 Tweet 0
  • Dolly Varden Focuses on Adding Ounces the Remainder of 2023

    0 shares
    Share 0 Tweet 0
  • US Dollar Might Fall To 96-97 Range in March 2024

    0 shares
    Share 0 Tweet 0
  • Privacy Policy
  • Contact

© 2023 LBNN - All rights reserved.

No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • Documentaries
  • Quizzes
    • Enneagram quiz
  • Newsletters
    • LBNN Newsletter
    • Divergent Capitalist

© 2023 LBNN - All rights reserved.