Saturday, May 17, 2025
LBNN
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • Documentaries
No Result
View All Result
LBNN

ElevenLabs is launching its own speech-to-text model

Simon Osuji by Simon Osuji
February 26, 2025
in Creator Economy
0
ElevenLabs is launching its own speech-to-text model
0
SHARES
2
VIEWS
Share on FacebookShare on Twitter

ElevenLabs, an AI startup that just raised a $180 million mega funding round, has been primarily known for its audio generation prowess. The company took a step in another technological direction by launching its first standalone speech-to-text model called Scribe.

The startup, valued at $3.3 billion, has aided many other companies in providing speech-to-text services through its vast library of voices. However, the company is now looking to get into speech detection and compete with the likes of Gladia, Speechmatics, AssemblyAI, Deepgram, and OpenAI’s Whisper models.

ElevenLabs’ Scribe model supports over 99 languages at launch. The company categorizes over 25 languages in excellent accuracy category for the model where the word error rate is less than 5%. This list includes English (claimed accuracy rate of 97%), French, German, Hindi, Indonesian, Japanese, Kannada, Malayalam, Polish, Portuguese, Spanish, and Vietnamese. Other languages are ranked in different categories with high (5-10% word error rate), good (10 to 20% word error rate), and moderate (25 to 50%) word error rates.

The company said that the model outperformed Google Gemini 2.0 Flash and Whisper Large V3 across multiple languages in FLEURS & Common Voice benchmark tests.

ElevenLabs had developed the speech-to-text component for its AI conversational agent platform, which was released last year. However, this is the first time the company is releasing a standalone speech detection model. In a conversation with TechCrunch last month, CEO Mati Staniszewski talked about improving speech detection models.

“We want to understand what’s being said by you in a conversation better. We are working on ways to move away from only generating content and understanding and transcribing speech,” Staniszewski said at that time. “Many people say that speech-to-text is a solved problem. But for many languages, it is pretty bad. We think we can build better speech detection models because we have in-house teams to annotate data and give us quick feedback.”

The model also has smart speaker diarization to tell you who is speaking, timestamp at word level for accurate subtitles, and auto-tagging sound events like audience laughters. The startup is providing a way for customers to directly transcribe video content to add subtitles or captions in its studio.

Scribe currently only works with pre-recorded audio formats. The company said it will release a low-latency real-time version of the model soon. That means it is not yet effective for meeting transcriptions or voice note-taking.

ElevenLabs is pricing Scribe at $0.40 for an hour of transcribed audio. While the rate is competitive, some of its rivals offer a lower price for audio transcriptions at the moment with some feature differentiation.

Source link

Related posts

Build, don’t bind: Accel’s Sonali De Rycker on Europe’s AI crossroads

Build, don’t bind: Accel’s Sonali De Rycker on Europe’s AI crossroads

May 17, 2025
After adding its own billing option on iOS, Apple asks Patreon to move it to an external browser

After adding its own billing option on iOS, Apple asks Patreon to move it to an external browser

May 17, 2025
Previous Post

Amazon’s next-gen Alexa gets AI upgrade

Next Post

Amazon’s Souped-Up Alexa+ Arrives Next Month

Next Post
Amazon’s Souped-Up Alexa+ Arrives Next Month

Amazon's Souped-Up Alexa+ Arrives Next Month

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

A Model for African Producers: Wing Wah’s $2B Integrated Energy Project to Bolster Resource Monetization in the Republic of the Congo

A Model for African Producers: Wing Wah’s $2B Integrated Energy Project to Bolster Resource Monetization in the Republic of the Congo

11 months ago
Machine learning aids rapid advancement of a high-resolution 3D printing technology

Machine learning aids rapid advancement of a high-resolution 3D printing technology

6 months ago
Dundee farewells NnG offshore wind installation vessel

Dundee farewells NnG offshore wind installation vessel

1 month ago
Dubai Financial Market records $10.51mln in major trades on Al Ansari, Ajman Bank

Dubai Financial Market records $10.51mln in major trades on Al Ansari, Ajman Bank

6 months ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0
  • Matthew Slater, son of Jackson State great, happy to see HBCUs back at the forefront

    0 shares
    Share 0 Tweet 0
  • Dolly Varden Focuses on Adding Ounces the Remainder of 2023

    0 shares
    Share 0 Tweet 0
  • US Dollar Might Fall To 96-97 Range in March 2024

    0 shares
    Share 0 Tweet 0
  • Privacy Policy
  • Contact

© 2023 LBNN - All rights reserved.

No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • Documentaries
  • Quizzes
    • Enneagram quiz
  • Newsletters
    • LBNN Newsletter
    • Divergent Capitalist

© 2023 LBNN - All rights reserved.