I am building chat application with voice record. User should be able to send text message by typing through the voice recorder. But not in English, when he speaks in his native language it need to be typed in his native language only.
I have tried speech_to_text, speech_recognition and all and I couldn't find any solution.
Please do help me.
Related
I have problem with real-time voice transcription. In my application people can make a video call each other according to their language choice, I want to show subtitles of conversations in real time to users. I used Agora SDK for video call feature. Is it possible to add real-time transcription?
I found Google Cloud Speech API but there is not enough documentation for dart. I would really be pleased if you give me some starting point
It is just a suggestion. I don’t know if it is valid or not, but you can try. You can apply the Java code or Kotlin or Xamarin, this will help you in setting 'Google Cloud Speech API'.
Excuse me for my explanation, I hope to be of benefit to you
I'm am working on a Unity app that needs custom voice recognition. By custom I mean I cant have them be English or any known language. I want to teach the engine basically new words like gibberish words I made up and I need the voice recognition to recognize them, any ideas? Also I need it to work on both iOS/Android.
Thank you in advance.
Check
https://github.com/cmusphinx/pocketsphinx-unity-demo
also the main cmusphinx website
http://cmusphinx.github.io
I need to change the speech language for a specific response. I know I can change the TTS voice for the whole app, but I have not found a way to do that for a response. In this case, the supported user locales are English and German, but the text I want Google Assistant to speak is in Korean.
Interestingly, there is no problem if the user locale is German and the text is in English. However, when I tried to create a response with Korean text, there was no audio feedback.
Unfortunately, the Actions on Google platform does not have support for in-dialog language changes. The case you've outlined may be an exception based on certain languages having support for other-language words which are supported as a subset in the primary language.
One alternative you might consider here is using recorded spoken audio through SSML. This is a popular way to insert custom audio output into your app, which may make sense for your use case.
I want to integrate voice detection in my iPhone app. The iPhone app allow the user to search the word by using their voice. But, i don't know a single info about Voice Recognition in iPhone. Can you please suggest me any ideas,tutorials or sample code for this?
You can also use Google Chrome API to integrate voice recognition on your application, but there is a big problem : the API works only with FLAC encoded files, but this encoding isn't supported natively on iOS... :/
You can see those 2 links for more information :
http://www.albertopasca.it/whiletrue/2011/09/objective-c-use-google-speech-iphone/
http://8byte8.com/blog/2012/07/voice-recognition-ios/
EDIT :
I realized an application including voice recognition using Nuance SDK, but it's not free to use. You can register for free and get a developer key that allows you to test your application for 90-days. An application example is included, you can see the code, it's very easy to implement.
Good luck :)
The best approach will probably be to:
Record the voice on the phone
Send the recording to a server that runs the speech recognition software
Then return something to the phone to indicate what it should do
This approach is favorable as there are a number of open source voice to text softwares out there & you are not limited by computing power in the backend.
Having said that, iOS has OpenEars which is based on Pocket Sphinx. It looks promising...
Well voice recognition is not correlated with iphone. All you can do is record the voice in iphone. Once done, you can either code your one voice recognition module, or find a third party API and reuse it.
You can do google search on that.
I want to ask a question about the iPhone application. Does Apple provide any API to the developers to record the phone call and convert it to text message? Thank you.
In short, there are no APIs for recording phone calls or converting text to speech. You will need to create a speech recognition engine. I suspect the iPhone hardware will not be powerful enough to handle that type of processing though.
FYI...
APIs for converting Voice/Audio data in to text
API for Voice recognition in among group
iPhone speech recognition API?