New York Tech Media
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital
No Result
View All Result
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital
No Result
View All Result
New York Tech Media
No Result
View All Result
Home AI & Robotics

Pavel Osokin, Co-Founder & CEO of AMAI – Interview Series

New York Tech Editorial Team by New York Tech Editorial Team
February 14, 2022
in AI & Robotics
0
Pavel Osokin, Co-Founder & CEO of AMAI – Interview Series
Share on FacebookShare on Twitter

Pavel Osokin is the Co-Founder & CEO of AMAI, a a San Francisco-based startup that produces AI voice engines. Pavel is leading the operation and strategy of Amai with a professional ambition to install its voice technology into every phone in the world. In AMAI they developed an AI voice that could not be discerned from a real human speech by 97% of users.

You’ve been a lifelong entrepreneur having launched your first company at age 13, what was your first attempt at business and what do you feel motivated this entrepreneurial mindset?

I didn’t really call it a company, but I made my first money by reselling some things or just washing cars on the street with a bucket. My motivation was that I wanted a Coke or a Snickers, and my parents did not have any money. I could either wait for the money to appear or earn it myself. Waiting does not appeal to me.

Could you share the genesis story behind AMAI?

I asked my partner, “What do companies around the world need?” In that conversation, I realized that every business is looking for a “sale.” We started making robots that could correspond with customers and sell products via mail and messengers. On the other hand, it wasn’t something particularly new as there are many chatbots available. So, we thought that if these robots could also make calls, it would be cool. As there were few good solutions on the market, we created a prototype of our own synthesized voice, and after the first sales, abandoned the robot and focused on TTS.

What does AMAI stand for specifically? 

This stands for I’m AI (I’m artificial intelligence).

Could you discuss some of the challenges behind designing state of the art Text-to-speech technology?

Designing state-of-the art TTS offers several challenges. The first one is collecting datasets. Training a neural network requires female and male voices of varying ages, and the more, the better. Second, you need to achieve a very close resemblance to a natural voice. The best method is to test different machine learning models and to constantly experiment with different cases of voice usage: in particular, you need to find the most problematic sample and process it separately. Speaking of long-term challenges, it can be difficult to assess whether the voice has become better or worse, and in what direction it should be improved.

What are some of the challenges behind speech recognition when it comes to humans interacting with the AMAI voice AI?

There are hundreds of companies working on voice recognition because it is easier to develop. The problem that currently has no solution is recognition of a child’s voice. Children have many characteristics of speech at a young age, so it is hard to take all of them into account. Nonetheless, we’ve been working on a solution to this problem, and we are very close to announcing the result – so soon, our AI won’t have any problems interacting not just with adults, but also with children.

What are some popular use cases for AMAI?

Right now, it’s audiobook dubbing and enterprise use in call centers.

What languages are currently offered, and what languages are currently being worked on?

Our multi-speaker system includes two languages, Russian and English. The idea is that a voice created in one language can speak all the other languages in our model as well. Currently, we are collecting data for 40 more languages, and very soon we will have 42.

What’s your vision for the future of AI voice assistants?

It is my belief that voice assistants will move into the metaverse, and we are studying these opportunities now. If you integrate the assistant with smart speakers or the web browser, more people will use voice search and interact with the assistant every day. You can talk to your refrigerator or TV.

Is there anything else that you would like to share about AMAI?

AMAI uses only its own proprietary technologies.

Thank you for the interview, readers who wish to learn more should visit AMAI.

Credit: Source link

Previous Post

Product Title Matching For SKU Management With NLP

Next Post

Drones, robots and tear gas: Man arrested after 7-hour standoff in Stafford

New York Tech Editorial Team

New York Tech Editorial Team

New York Tech Media is a leading news publication that aims to provide the latest tech news, fintech, AI & robotics, cybersecurity, startups & leaders, venture capital, and much more!

Next Post
Drones, robots and tear gas: Man arrested after 7-hour standoff in Stafford

Drones, robots and tear gas: Man arrested after 7-hour standoff in Stafford

  • Trending
  • Comments
  • Latest
Meet the Top 10 K-Pop Artists Taking Over 2024

Meet the Top 10 K-Pop Artists Taking Over 2024

March 17, 2024
Panther for AWS allows security teams to monitor their AWS infrastructure in real-time

Many businesses lack a formal ransomware plan

March 29, 2022
Zach Mulcahey, 25 | Cover Story | Style Weekly

Zach Mulcahey, 25 | Cover Story | Style Weekly

March 29, 2022
10 Raunchy Movies on Netflix You Won’t Regret Watching

10 Raunchy Movies on Netflix You Won’t Regret Watching

May 20, 2024
How To Pitch The Investor: Ronen Menipaz, Founder of M51

How To Pitch The Investor: Ronen Menipaz, Founder of M51

March 29, 2022
Japanese Space Industry Startup “Synspective” Raises US $100 Million in Funding

Japanese Space Industry Startup “Synspective” Raises US $100 Million in Funding

March 29, 2022
Startups On Demand: renovai is the Netflix of Online Shopping

Startups On Demand: renovai is the Netflix of Online Shopping

2
Robot Company Offers $200K for Right to Use One Applicant’s Face and Voice ‘Forever’

Robot Company Offers $200K for Right to Use One Applicant’s Face and Voice ‘Forever’

1
Menashe Shani Accessibility High Tech on the low

Revolutionizing Accessibility: The Story of Purple Lens

1

Netgear announces a $1,500 Wi-Fi 6E mesh router

0
These apps let you customize Windows 11 to bring the taskbar back to life

These apps let you customize Windows 11 to bring the taskbar back to life

0
This bipedal robot uses propeller arms to slackline and skateboard

This bipedal robot uses propeller arms to slackline and skateboard

0
laptop on glass table

Automat-it Cuts Deployment Friction as Monce Scales AI Order Processing on AWS

April 13, 2026
Lee's Famous Recipe Chicken

Why Lee’s Famous Recipe Chicken Is Betting on Hi Auto to Quietly Rewire the Drive-Thru

April 9, 2026
computer generated image of letters

San Francisco Tribune Lists 11 HumanX Startups Moving AI Closer to the Operating Core

April 8, 2026
Impala CEO and Highrise AI CEO

The Industrialization of AI Infrastructure: What Impala and Highrise AI Reveal About the Next Scaling Frontier

April 7, 2026
Employee Time Tracking

What is an Employee Time Tracking Solution? A Definite Guide for 2026

March 31, 2026
Voltify founders

Voltify Raises $30 Million Seed Round as It Challenges $1 Trillion Rail Electrification Model

March 31, 2026

Recommended

laptop on glass table

Automat-it Cuts Deployment Friction as Monce Scales AI Order Processing on AWS

April 13, 2026
Lee's Famous Recipe Chicken

Why Lee’s Famous Recipe Chicken Is Betting on Hi Auto to Quietly Rewire the Drive-Thru

April 9, 2026
computer generated image of letters

San Francisco Tribune Lists 11 HumanX Startups Moving AI Closer to the Operating Core

April 8, 2026
Impala CEO and Highrise AI CEO

The Industrialization of AI Infrastructure: What Impala and Highrise AI Reveal About the Next Scaling Frontier

April 7, 2026

Categories

  • AI & Robotics
  • Benzinga
  • Cybersecurity
  • FinTech
  • New York Tech
  • News
  • Startups & Leaders
  • Venture Capital

Tags

AI AI QSRs Allseated Automat-it AWS B2B marketing Business CISO CISO Whisperer Collaborations Companies To Watch cryptocurrency Cybersecurity Entrepreneur Fetcherr Finance FINQ Fintech Funding Announcement hi-tech Hi Auto Impala Investing Investors investorsummit Israel israelitech Leaders LinkedIn Leaders Metaverse Mindset Minnesota omri hurwitz PointFive PR QSR Real Estate start- up startupnation Startups Startups On Demand Tech Tech leaders Unlimited Robotics VC
  • Contact Us
  • Privacy Policy
  • Terms and conditions

© 2024 All Rights Reserved - New York Tech Media

No Result
View All Result
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital

© 2024 All Rights Reserved - New York Tech Media