New York Tech Media
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital
No Result
View All Result
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital
No Result
View All Result
New York Tech Media
No Result
View All Result
Home AI & Robotics

Speechmatics Launches Autonomous Speech Recognition Software

New York Tech Editorial Team by New York Tech Editorial Team
October 26, 2021
in AI & Robotics
0
Speechmatics Launches Autonomous Speech Recognition Software
Share on FacebookShare on Twitter

Leading speech recognition technology startup Speechmatics has launched its ‘Autonomous Speech Recognition’ software that uses the latest deep learning techniques and breakthrough self-supervised models. The system has demonstrated an ability to outperform Amazon, Google, and Microsoft. 

Stanford’s Datasets

Speechmatics is based on datasets found in Stanford’s ‘Racial Disparities in Speech Recognition’ study, and it achieved an overall accuracy of 82.8% for African American voices. For reference, Google only achieved an accuracy rate of 68.7%, while Amazon achieved 68.6%.

The level of accuracy equates to a 45% reduction in speech recognition errors, which is the equivalent of three words in an average sentence. Not only is the new Speechmatics system accurate in this regard, but it also demonstrated improvements in accuracy across accents, age, dialects, and various other sociodemographic characteristics.

There is often misunderstanding in speech recognition due to the limited amount of labelled data that algorithms can use to train themselves. Labeled data is required to be manually classified by humans, which results in a lesser amount of data available for these systems. This also limits the representation of all voices, which creates a new set of issues.

Training on Unlabeled Data

Speechmatics is making big progress in this regard as its technology is trained on massive amounts of unlabeled data sourced directly from the internet. The data comes from things like social media content and podcasts. 

Self-supervised learning has enabled the system to be trained on 1.1 million hours of audio, which is an increase from the previous 30,000 hours. This enables it to have a much wider range of representation of voices, and it helps reduce AI bias and errors in speech recognition. 

When it comes to children’s voices, Speechmatics also demonstrated an ability to outperform competitors. Children’s voices are challenging to recognize through legacy speech recognition technology, but Speechmatics managed to record a 91.8% accuracy rate. Google could only achieve 83.4% and Deepgram 82.3%. 

Katy Wigdahl is CEO of Speechmatics. 

“We are on a mission to deliver the next generation of machine learning capabilities, and through that offer more inclusive and accessible speech technology. This announcement is a huge step towards achieving that mission.” 

“Our focus in tackling AI bias has led to this monumental leap forward in the speech recognition industry and the ripple effect will lead to changes in a multitude of different scenarios,” Wigdahl continued. “Think of the incorrect captions we see on social media, court hearings where words are mis-transcribed and eLearning platforms that have struggled with children’s voices throughout the pandemic. Errors people have had to accept until now can have a tangible impact on their daily lives.” 

Allison Zhu Koenecke is lead author of the Stanford study on speech recognition.

“It’s critical to study and improve fairness in speech-to-text systems given the potential for disparate harm to individuals through downstream sectors ranging from healthcare to criminal justice.” 

Credit: Source link

Previous Post

Fintech Wise Says It Won’t Be Looking to Buy

Next Post

Samsung announces cloud gaming for Tizen TVs, offers no further details

New York Tech Editorial Team

New York Tech Editorial Team

New York Tech Media is a leading news publication that aims to provide the latest tech news, fintech, AI & robotics, cybersecurity, startups & leaders, venture capital, and much more!

Next Post
Samsung announces cloud gaming for Tizen TVs, offers no further details

Samsung announces cloud gaming for Tizen TVs, offers no further details

  • Trending
  • Comments
  • Latest
Meet the Top 10 K-Pop Artists Taking Over 2024

Meet the Top 10 K-Pop Artists Taking Over 2024

March 17, 2024
Panther for AWS allows security teams to monitor their AWS infrastructure in real-time

Many businesses lack a formal ransomware plan

March 29, 2022
Zach Mulcahey, 25 | Cover Story | Style Weekly

Zach Mulcahey, 25 | Cover Story | Style Weekly

March 29, 2022
How To Pitch The Investor: Ronen Menipaz, Founder of M51

How To Pitch The Investor: Ronen Menipaz, Founder of M51

March 29, 2022
Japanese Space Industry Startup “Synspective” Raises US $100 Million in Funding

Japanese Space Industry Startup “Synspective” Raises US $100 Million in Funding

March 29, 2022
UK VC fund performance up on last year

VC-backed Aerium develops antibody treatment for Covid-19

March 29, 2022
Startups On Demand: renovai is the Netflix of Online Shopping

Startups On Demand: renovai is the Netflix of Online Shopping

2
Robot Company Offers $200K for Right to Use One Applicant’s Face and Voice ‘Forever’

Robot Company Offers $200K for Right to Use One Applicant’s Face and Voice ‘Forever’

1
Menashe Shani Accessibility High Tech on the low

Revolutionizing Accessibility: The Story of Purple Lens

1

Netgear announces a $1,500 Wi-Fi 6E mesh router

0
These apps let you customize Windows 11 to bring the taskbar back to life

These apps let you customize Windows 11 to bring the taskbar back to life

0
This bipedal robot uses propeller arms to slackline and skateboard

This bipedal robot uses propeller arms to slackline and skateboard

0
Coffee Nova’s $COFFEE Token

Coffee Nova’s $COFFEE Token

May 29, 2025
Money TLV website

BridgerPay to Spotlight Cross-Border Payments Innovation at Money TLV 2025

May 27, 2025
The Future of Software Development: Why Low-Code Is Here to Stay

Building Brand Loyalty Starts With Your Team

May 23, 2025
Tork Media Expands Digital Reach with Acquisition of NewsBlaze and Buzzworthy

Creative Swag Ideas for Hackathons & Launch Parties

May 23, 2025
Tork Media Expands Digital Reach with Acquisition of NewsBlaze and Buzzworthy

Strengthening Cloud Security With Automation

May 22, 2025
How Local IT Services in Anderson Can Boost Your Business Efficiency

Why VPNs Are a Must for Entrepreneurs in Asia

May 22, 2025

Recommended

Coffee Nova’s $COFFEE Token

Coffee Nova’s $COFFEE Token

May 29, 2025
Money TLV website

BridgerPay to Spotlight Cross-Border Payments Innovation at Money TLV 2025

May 27, 2025
The Future of Software Development: Why Low-Code Is Here to Stay

Building Brand Loyalty Starts With Your Team

May 23, 2025
Tork Media Expands Digital Reach with Acquisition of NewsBlaze and Buzzworthy

Creative Swag Ideas for Hackathons & Launch Parties

May 23, 2025

Categories

  • AI & Robotics
  • Benzinga
  • Cybersecurity
  • FinTech
  • New York Tech
  • News
  • Startups & Leaders
  • Venture Capital

Tags

3D bio-printing acoustic AI Allseated B2B marketing Business carbon footprint climate change coding Collaborations Companies To Watch consumer tech crypto cryptocurrency deforestation drones earphones Entrepreneur Fetcherr Finance Fintech food security Investing Investors investorsummit israelitech Leaders LinkedIn Leaders Metaverse news OurCrowd PR Real Estate reforestation software start- up Startups Startups On Demand startuptech Tech Tech leaders technology UAVs Unlimited Robotics VC
  • Contact Us
  • Privacy Policy
  • Terms and conditions

© 2024 All Rights Reserved - New York Tech Media

No Result
View All Result
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital

© 2024 All Rights Reserved - New York Tech Media