Liv.ai speech-to-text conversion
This article explores the technology landscape which can be used to build similar platform / service offerings like Liv.ai.
First and foremost, congratulations to Liv.ai team for leveraging existing cloud-based AI and speech recognition (Speech-to-text conversion) technologies to come up with a set of business offerings which leverages speech-to-text conversion technology to create great value for businesses. The founding team (IIT KGP Alumni – Subodh Kumar, Sanjeev Kumar and Kishore Mundra) nailed it! Doing right thing at right time at right place.
Liv.ai enables developers to convert speech-to-text by using Powerful Neural Network Models with exceptional accuracy and minimal latency. At this point, the platform supports 9 languages including Hindi, English, Bengali, Gujarati, Telugu, Tamil, Marathi, Punjabi and Kannada.
The following is the list of areas where Liv.ai technologies look to be focusing on:
The following lists down some of the business use case in relation to speech-to-text conversion technology:
The following are some of the key building blocks of a platform like liv.ai leveraging speech-to-text conversion technology:
All of the above can be achieved using following:
When considering Indian languages or rather, languages spoken in India, Google Cloud Speech API supports speech to text conversion for following Indian languages (as supported by Liv.ai):
The following are some of the salient features of Google Cloud Speech API:
And, all of the above comes at a very decent pricing from Google:
| Monthly Usage | Price per 15 seconds |
|---|---|
| 0-60 minutes | Free |
| 61-1,000,000 minutes | $0.006 |
One can also try other cloud speech APIs such as following:
In case, you wanted to share your thoughts in relation with using Google or other cloud speech APIs to build speech-to-text conversion platforms such as liv.ai, please feel free to suggest.
We’ve all been in that meeting. The dashboard on the boardroom screen is a sea…
When building a regression model or performing regression analysis to predict a target variable, understanding…
If you've built a "Naive" RAG pipeline, you've probably hit a wall. You've indexed your…
If you're starting with large language models, you must have heard of RAG (Retrieval-Augmented Generation).…
If you've spent any time with Python, you've likely heard the term "Pythonic." It refers…
Large language models (LLMs) have fundamentally transformed our digital landscape, powering everything from chatbots and…