I have severe deaf deafness from a very young age, but, fortunately, I can speak like an ordinary person. Verbal communication has always been difficult for me because of my weakened speech recognition abilities even when reading lips. I went to school and college just by reading boards, Powerpoint slides, books, and the Internet. I am very good at my current software development work, but lately I feel that I have to make some efforts to improve the situation.
Subtitles are my lifeguard in this country to understand films / shows on TV, and I only enjoyed this for the last 7 years (now I'm 31).
I really feel the need for the ability to see subtitles in real life when I talk with someone, even strangers. I want to develop unprepared speech in a text converter, and as a start, he does not even need to specify exact words for me, only the melodies on the syllables / phonetics will also be in order.
I have been looking for this for a while, but most of the results are text or semi-successful speech recognition attempts to give voice commands to the computer. I would really like to receive some guidance on how to start this project. In particular, I need steps such as, for example, how to process audio files and what processing should I do to get approximate phonetics as quickly as possible.
algorithm speech-to-text phonetics
Joy dutta
source share