Tuesday, May 20, 2025
LBNN
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • Documentaries
No Result
View All Result
LBNN

Startup Aims for Real-Time ‘Human-Level’ AI Transcripts

Simon Osuji by Simon Osuji
September 28, 2023
in Artificial Intelligence
0
Startup Aims for Real-Time ‘Human-Level’ AI Transcripts
0
SHARES
1
VIEWS
Share on FacebookShare on Twitter



College students Edward Aguilar and Sahan Reddy are taking on one of artificial intelligence’s most difficult problems: building an AI that can recognize and transcribe speech as well as a human can, in real time.

To achieve this goal, the duo formed the startup Echo Labs earlier this year and have already raised over US $2 million in pre-seed funding. They have also been accepted into a number of technology accelerator programs, including a new data and AI accelerator out of the University of Chicago, Transform, which announced on 12 September that it will provide a total investment in Echo Labs of $250,000.

Related posts

AI goes to ‘kindergarten’ in order to learn more complex tasks

AI goes to ‘kindergarten’ in order to learn more complex tasks

May 20, 2025
A Silicon Valley VC Says He Got the IDF Starlink Access Within Days of October 7 Attack

A Silicon Valley VC Says He Got the IDF Starlink Access Within Days of October 7 Attack

May 19, 2025

For many of us, AI-powered speech recognition is already a part of daily life. It’s baked into your smart speaker and your phone’s voice assistant. Products like Otter.ai may already be translating your Zoom meetings in real-time or jotting down ideas from an in-person brainstorming session.

Mark Hasegawa-Johnson is a professor of electrical and computer engineering at the University of Illinois Urbana-Champaign whose research looks at speech recognition through mathematical linguistic models. He says that even though speech recognition technology has come a long way in the past five years—and even passed a number of benchmarks to deem their transcriptions ‘human-level’—from a user standpoint it’s clear that there’s still more ground to be covered.

“Machines still do not generalize as well as humans,” Hasegawa-Johnson says. “For example, across different topics of conversation, different speakers—especially children, speakers with disabilities, and people with second-language accents—or different acoustic recording environments.”

It’s these edge cases that Echo Labs hopes to solve. Yet, despite Echo Labs’ current focus, Aguilar says that the origin of the company began as a clever way to get out of redundant class lectures using an application he designed called BuellerBot. At its core, this bot is made of three separate pieces: speech transcription software to join and automatically transcribe a Zoom call, a ChatGPT prompt to generate responses to questions posed in the lectures, and a speech synthesizer to mimic Aguilar’s voice.

“I wrote some extra code that glued all that together and then it has the ability to listen to your name, unmute all that,” Aguilar says. “So you have a little version of yourself that could join automatically and then get you out of class by responding to everything. It was great.”

Aguilar says it was his roommate, who was born deaf and now uses a cochlear implant, who recognized the possibility of BuellerBot beyond a lecture avoidance tool. In particular, his roommate recognized that the neural network behind BuellerBot’s transcription service may have potential as a powerful tool for building speech accessibility into everyday life.

“Today, our focus is entirely on accessibility compliance,” Aguilar says. “Every university in the country […] is required under the [Americans with Disabilities Act] to transcribe all their internal and external content at the human level,” which is a not-insignificant portion of the nearly $26 billion transcription industry in the United States.

The startup originally planned to improve the accessibility of live transcriptions via live in-eye subtitles in augmented reality glasses, but Aguilar says that vision has recently shifted to focus instead on the neural network itself. Now, the startup’s main goal is to develop a software application that can be directly incorporated into academic platforms—similar to how BuellerBot originally worked.

Echo Labs isn’t yet sharing exactly how it will achieve this goal, but Aguilar says the approach is a “significant departure from all existing literature.” Aguilar adds, “We’re taking a more biological approach to how to understand conversations much more holistically than anything that’s on the market.”

Exactly how novel this technology is still remains to be seen. Hasegawa-Johnson’s work, for example, has also looked at language processing holistically to interpret pronunciation variability or to disentangle confusing sentences by analyzing their stresses and rhythms. Likewise, biological inspiration is no stranger to the world of speech recognition, Hasegawa-Johnson says.

“It’s pretty clearly established that some degree of biologically-inspired processing can help with the front end—separating signals from background noise and reverberation, and encoding speech-related features from the audio,” Hasegawa-Johnson says. “Biologically-inspired approaches to the front-end problem have been shown to have some advantages over Fourier-transform-based front-ends.”

However, while biological solutions may be beneficial, Hasegawa-Johnson says that many universities and tech giants (including Google and Meta) typically skip it and instead focus on extracting audio features from large amounts of collected data to create “learned” front-ends.

“I have never seen a direct comparison of learned front-ends to any well-designed biologically-inspired front-end,” Hasegawa-Johnson says.

Time will tell exactly what role biological inspiration plays in Echo Labs’ AI, but it’s possible that a neuromorphic computing approach modeled on the structure of a human brain may be a part of it. In particular, Hasegawa-Johnson suggests that they might be exploring the role of spiking neurons in human brain processing.

“One fact about human processing that’s well-known but is not modeled by any widely deployed deep-learning system is that human neuronal networks communicate in spikes,” he says. “One possibility is that Echo Labs might be trying to apply spiking neural networks to automatic speech recognition—but that is pure speculation.”

Nevertheless, Shyama Majumdar, the director of Transform, says that if Echo Labs can pull off their mission they could have a big impact on the future direction of transcription technology.

“There is no one single entity dominating the transcription market and the one that does it right will lead the way,” Majumdar says. “Echo Labs is in the right place at the right time and I am confident with their ability to take this forward in a meaningful way.”

Echo Labs plans to make its next announcement in December, which Aguilar says will include more information about new partnerships as well as more details on the nuts and bolts of the startup’s technology.



Source link

Previous Post

Rising inflation pushes manufacturers’ unsold goods up 45.4% to N272bn

Next Post

Worldcoin doubles down on emerging markets amid wider criticism

Next Post
Worldcoin doubles down on emerging markets amid wider criticism

Worldcoin doubles down on emerging markets amid wider criticism

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

The 5 Automatic Litter Boxes We’ve Tested and Recommend (2024)

The 5 Automatic Litter Boxes We’ve Tested and Recommend (2024)

8 months ago
Tadawul All Share Index up 0.46% on Thursday

Tadawul All Share Index up 0.46% on Thursday

2 months ago
New York Times Developing Use of OpenAI for Headlines

New York Times Developing Use of OpenAI for Headlines

11 months ago
AI can create a reasonable facsimile of a person’s personality after two-hour interview

AI can create a reasonable facsimile of a person’s personality after two-hour interview

6 months ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0
  • Matthew Slater, son of Jackson State great, happy to see HBCUs back at the forefront

    0 shares
    Share 0 Tweet 0
  • Dolly Varden Focuses on Adding Ounces the Remainder of 2023

    0 shares
    Share 0 Tweet 0
  • US Dollar Might Fall To 96-97 Range in March 2024

    0 shares
    Share 0 Tweet 0
  • Privacy Policy
  • Contact

© 2023 LBNN - All rights reserved.

No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • Documentaries
  • Quizzes
    • Enneagram quiz
  • Newsletters
    • LBNN Newsletter
    • Divergent Capitalist

© 2023 LBNN - All rights reserved.