Speech recognition is a complex domain with many specific algorithms, tools and methods. To create your own engine, you can start with the CMUSphinx open source speech recognition toolkit, which allows you to:
- Collection and processing of data necessary to support the Georgian language.
- Create models for Georgian
- Introduce a mechanism for speech recognition in Georgian.
- Use the engine to create a speech recognition application that runs on the desktop, on the server, or on the iPhone (via OpenEars).
CMUSphinx already supports English, German, Spanish, French, Dutch, Russian, Mandarin, Icelandic, Italian and many other languages. It is very simple to add a new one. For new people, it usually takes a month or two of concentrated work to complete the required process.
To get started, visit the homepage:
http://cmusphinx.sourceforge.net
and read the tutorial
http://cmusphinx.sourceforge.net/wiki/tutorial
If you have any questions, please post them on the forums or here!
And this is a very common misconception that you just make sounds when you speak Georgian. This does not apply to most languages ββin the world. To test the hypothesis, try recording some audio in the audio editor and check which sounds are actually being pronounced. You will be surprised. This tutorial details this issue.
Nikolay Shmyrev
source share