romeronathan Posted May 25 Share Posted May 25 Basically, my use case is when a user speaks during a stream, I'm trying to pipe that audio into a speech to text model so they have real time transcription. Can somebody point me in the right direction to accomplish this using AMS? Quote Link to comment Share on other sites More sharing options...
0 Connessione Posted May 28 Share Posted May 28 I think ant media has filter plugins to help you access raw data on serverside. But for your case i would first explore the possibility of using mic input from the browser into third party speech recognition engines via their sdks. For example Google speech to text SDK has client side example of capturing mic input using SDK methods. Quote Link to comment Share on other sites More sharing options...
Question
romeronathan
Basically, my use case is when a user speaks during a stream, I'm trying to pipe that audio into a speech to text model so they have real time transcription. Can somebody point me in the right direction to accomplish this using AMS?
Link to comment
Share on other sites
1 answer to this question
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.