• Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints

Microsoft’s small language model outperforms larger models on standardized math tests

Simon Osuji by Simon Osuji
March 8, 2024
in Artificial Intelligence
0
Microsoft’s small language model outperforms larger models on standardized math tests
0
SHARES
1
VIEWS
Share on FacebookShare on Twitter


Grade School Math
Credit: Deepak Gautam from Pexels

A small team of AI researchers at Microsoft reports that the company’s Orca-Math small language model outperforms other, larger models on standardized math tests. The group has published a paper on the arXiv preprint server describing their testing of Orca-Math on the Grade School Math 8K (GSM8K) benchmark and how it fared compared to well-known LLMs.

Related posts

Make the Most of Chrome’s Toolbar by Customizing It to Your Liking

Make the Most of Chrome’s Toolbar by Customizing It to Your Liking

March 1, 2026
The 5 Big ‘Known Unknowns’ of Donald Trump’s New War With Iran

The 5 Big ‘Known Unknowns’ of Donald Trump’s New War With Iran

March 1, 2026

Many popular LLMs such as ChatGPT are known for their impressive conversational skills—less well known is that most of them can also solve math word problems. AI researchers have tested their abilities at such tasks by pitting them against the GSM8K, a dataset of 8,500 grade-school math word problems that require multistep reasoning to solve, along with their correct answers.

In this new study, the research team at Microsoft tested Orca-Math, an AI application developed by another team at Microsoft specifically designed to tackle math word problems, and compared the results with larger AI models.

Microsoft points out on its Research Blog post that there is a major difference between popular LLMs such as ChatGPT and Orca-Math. The former is a large language model and the latter is a small language model—the difference is in the number of parameters that are used; typically in the thousands or a few million for SLMs, rather than the billions or trillions used by LLMs. Another difference is that, as its name suggests, Orca-Math was designed specifically to solve math problems; thus, it cannot be used to carry on conversations or answer random questions.

Orca-Math is relatively large compared to other SLMs, with 7 billion parameters, but still much smaller than most of the well-known LLMs. However, it still managed to score 86.81% on the GSM8k, close to GPT-4-0613, which got 97.0%. Others, such as Llama-2, did not fare nearly as well, with scores as low as 14.6%.

Microsoft reveals that it was able to garner such a high score by using higher-quality training data than is available to general-use LLMs and because it used an interactive learning process the AI team at Microsoft has been developing—a process that continually improves results by using feedback from a teacher. The team at Microsoft concludes that SLMs can perform as well as LLMs on certain applications when developed under specialized conditions.

More information:
Arindam Mitra et al, Orca-Math: Unlocking the potential of SLMs in Grade School Math, arXiv (2024). DOI: 10.48550/arxiv.2402.14830

Orca-Math: www.microsoft.com/en-us/resear … odel-specialization/
twitter.com/Arindam1408/status/1764761895473762738

Journal information:
arXiv

© 2024 Science X Network

Citation:
Microsoft’s small language model outperforms larger models on standardized math tests (2024, March 8)
retrieved 8 March 2024
from https://techxplore.com/news/2024-03-microsoft-small-language-outperforms-larger.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

Two climate activists charged for pouring red powder on National Archives display of the US Constitution

Next Post

6 Design Tips to Make Your Brand Stand Out in Competitive Markets

Next Post
6 Design Tips to Make Your Brand Stand Out in Competitive Markets

6 Design Tips to Make Your Brand Stand Out in Competitive Markets

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

Cryptocurrency: Top 3 AI Gem Coins To Invest Now

Cryptocurrency: Top 3 AI Gem Coins To Invest Now

2 years ago
Young Man’s Life Transformed After Mercy Ships Removes Life-Threatening Tumor

Young Man’s Life Transformed After Mercy Ships Removes Life-Threatening Tumor

1 year ago
Overland AI Rolls Out Autonomous Multi-Mission Military Vehicle

Overland AI Rolls Out Autonomous Multi-Mission Military Vehicle

11 months ago
Amazon Props Up Misleading, Junky Laptops No One Should Buy

Amazon Props Up Misleading, Junky Laptops No One Should Buy

2 weeks ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • Mahama attends Liberia’s 178th independence anniversary

    0 shares
    Share 0 Tweet 0
  • The world’s top 10 most valuable car brands in 2025

    0 shares
    Share 0 Tweet 0
  • Top 10 African countries with the highest GDP per capita in 2025

    0 shares
    Share 0 Tweet 0
  • Global ranking of Top 5 smartphone brands in Q3, 2024

    0 shares
    Share 0 Tweet 0

Get strategic intelligence you won’t find anywhere else. Subscribe to the Limitless Beliefs Newsletter for monthly insights on overlooked business opportunities across Africa.

Subscription Form

© 2026 LBNN – All rights reserved.

Privacy Policy | About Us | Contact

Tiktok Youtube Telegram Instagram Linkedin X-twitter
No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • LBNN Blueprints
  • Quizzes
    • Enneagram quiz
  • Fashion Intelligence

© 2023 LBNN - All rights reserved.