Hi,
congratulations for this valuable project!
I'd like to start experimenting with voice processing in order to build an automated language recognition engine.
I'm very new to this area but I can imagine a simple processing system that performs phoneme extraction from voice file, and then use an n-gram regognition system. Is the "accoustic model" of a language the right starting point for my approach?
Vincent
--- (Edited on 11/19/2007 11:38 am [GMT-0600] by f4bxw) ---
I suggest you to get some background in ASR first, read Spoken Language Processing by Huang and others:
http://dsp-book.narod.ru/SLP1.djvu
http://dsp-book.narod.ru/SLP2.djvuRabiner's tutorial or even the HTK book.
--- (Edited on 11/19/2007 12:58 pm [GMT-0600] by nsh) ---
Hi Vincent,
Another good reference/textbook is:
SPEECH and LANGUAGE PROCESSING: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, By Daniel Jurafsky and James H. Martin,
The second edition draft chapters are available for download in pdf format.
You might also look through the archives of comp.speech.research and/or the HTK email archives (you need to be registered with HTK) to see if anyone has already done some work in this area.
Good luck!
Ken
--- (Edited on 11/19/2007 2:43 pm [GMT-0500] by kmaclean) ---
There has been quite a bit of research on this. I suggest you do Google and Google Scholar searches for "language identification" and "language recognition".
Hope this helps,
David
--- (Edited on 11/20/2007 1:21 pm [GMT-0600] by DavidGelbart) ---