• Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints

Research team launches first-of-its-kind mini AI model with three trillion-token punch

Simon Osuji by Simon Osuji
January 31, 2024
in Artificial Intelligence
0
Research team launches first-of-its-kind mini AI model with three trillion-token punch
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


SUTD launches first-of-its-kind mini AI model with three trillion-token punch
TinyLlama–the mini AI model with three trillion-token punch. Credit: SUTD

It’s called TinyLlama and it’s taken the research world by storm because of how much power it packs.

Related posts

Kubuntu Focus Zr Gen 1 Review: A Powerhouse Linux Laptop

Kubuntu Focus Zr Gen 1 Review: A Powerhouse Linux Laptop

February 25, 2026
Talk to Your Own Personal Isaac Newton With Ailias’s Hologram Avatars

Talk to Your Own Personal Isaac Newton With Ailias’s Hologram Avatars

February 25, 2026

Developed by Associate Professor Lu Wei of Singapore University of Technology and Design (SUTD), research assistant Mr. Zhang Peiyuan, and Ph.D. students, Mr. Zeng Guangtao, and Mr. Wang Tianduo, TinyLlama is a 1.1 billion parameter open-sourced small language model that has outperformed other open-source models of comparable sizes across several benchmarks. A total of three trillion tokens of datasets were pre-trained on TinyLlama within just four months.

Current large language models (LLMs) such as ChatGPT or Google Bard, developed by large technology firms such as OpenAI or Google, are managed by thousands or even tens of thousands of graphic processing units (GPUs) and require users to connect online to their massive servers. TinyLlama, in contrast, is built on just 16 GPUs and takes up only 550MB of Random Access Memory (RAM). In other words, TinyLlama can readily be deployed on mobile devices, enabling everyone to carry a “mini ChatGPT” in their pocket wherever they go.

According to Marktechpost, a California-based Artificial Intelligence news platform with a community of over 1.5 million AI professionals and developers, TinyLlama’s performance in common-sense reasoning and problem-solving tasks highlights the potential of smaller models to achieve high performance when trained with a substantial amount of data. It also opens up new possibilities for research and application in natural language processing, especially in scenarios where computational resources are limited.

Said Prof Lu, also the Director of the StatNLP Research Group, which focuses on natural language processing research, “The importance of small language models cannot be understated, and the reason why TinyLlama was specifically created to be open-sourced was that it will democratize language models by allowing smaller tech companies and research labs to build and develop their own models for a variety of applications. As researchers, our plan is to lay the foundations for small language models, with the aim of making significant scientific advancements in the field.

“Smaller tech firms as well as individual researchers and developers are increasingly demanding small language models that require less resources to run. These models, such as TinyLlama, are therefore more feasible for them to build and more optimal for edge devices such as mobile phones. The compactness of such models also allows them to cater to a multitude of applications that demand real-time machine translation without an internet connection. This means that users can access the language model offline. They need not send their personal information to the server when using it, and through the technique called ‘fine-tuning,’ we are able to improve it further,” Prof Lu added.

SUTD launches first-of-its-kind mini AI model with three trillion-token punch
The team behind TinyLlama—from left to right: SUTD Ph.D. students, Zeng Guangtao and Wang Tianduo, Associate Prof Lu Wei and Research Assistant, Zhang Peiyuan. Credit: SUTD

TinyLlama’s innovative approach lies in its construction. It is based on the architecture and tokenizer of Llama 2 and incorporates several state-of-the-art technologies. One such technology is FlashAttention, which enhances computational efficiency. Despite its smaller size than some of its predecessors, TinyLlama exhibits exceptional performance in various downstream tasks. It has successfully challenged the notion that larger models are always better, demonstrating that models with fewer parameters can still achieve high levels of effectiveness when trained with extensive and diverse datasets.

Provided by
Singapore University of Technology and Design

Citation:
Research team launches first-of-its-kind mini AI model with three trillion-token punch (2024, January 31)
retrieved 31 January 2024
from https://techxplore.com/news/2024-01-team-kind-mini-ai-trillion.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

Whistleblower accuses medical tech giant Medtronic of putting ‘profit before patients’

Next Post

First Space Force guardian to be launched into space this summer

Next Post
First Space Force guardian to be launched into space this summer

First Space Force guardian to be launched into space this summer

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

Major Gulf markets subdued ahead of Fed meeting

Major Gulf markets subdued ahead of Fed meeting

3 months ago
Nigerian tycoon Dozy Mmobuosi faces $250-million fine and U.S. corporate ban

Nigerian tycoon Dozy Mmobuosi faces $250-million fine and U.S. corporate ban

1 year ago
He Made a Movie About Humans Rising Up Against AI. Now He’s Doing the Real Thing

He Made a Movie About Humans Rising Up Against AI. Now He’s Doing the Real Thing

2 years ago
Abu Dhabi real estate: 10 hotspots revealed; capital appreciation forecast

Abu Dhabi real estate: 10 hotspots revealed; capital appreciation forecast

10 months ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • The world’s top 10 most valuable car brands in 2025

    0 shares
    Share 0 Tweet 0
  • Top 10 African countries with the highest GDP per capita in 2025

    0 shares
    Share 0 Tweet 0
  • Global ranking of Top 5 smartphone brands in Q3, 2024

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0

Get strategic intelligence you won’t find anywhere else. Subscribe to the Limitless Beliefs Newsletter for monthly insights on overlooked business opportunities across Africa.

Subscription Form

© 2026 LBNN – All rights reserved.

Privacy Policy | About Us | Contact

Tiktok Youtube Telegram Instagram Linkedin X-twitter
No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • LBNN Blueprints
  • Quizzes
    • Enneagram quiz
  • Fashion Intelligence

© 2023 LBNN - All rights reserved.