Italian speech recognition with sphinx's hello world

Italian

Flat

User: ingegner_ciccio
Date: 10/26/2014 4:23 am

Views: 22811
Rating: 3

Salve a tutti. Sono giorni che leggo il vostro forum e innanzitutto vi voglio fare i complimenti per ciò che fate. Vorrei sapere se qualcuno può aiutarmi. Vorrei far funzionare la demo "hello world" di sphinx4 in linguaggio JAVA, facendogli riconoscere parole italiane. Per tale scopo vorrei addestrare un modello in italiano, anche se ovviamente mi rendo conto che con pochi dati è una cosa molto difficile. Il primo problema è: qualcuno sa come impostare il file .xml o quali impostazioni cambiare per far funzionare la demo con un'altra lingua? Ho letto già molte guide sul sito di sphinx e molti altri suggerimenti su diversi forum, ma non mi è stato di grande aiuto. Ringrazio anticipatamente chi vorrà aiutarmi.

Re: Italian speech recognition with sphinx's hello world

User: nsh
Date: 10/26/2014 4:39 am

Views: 100
Rating: 4

Download latest demo

http://github.com/cmusphinx/sphinx4

Build it with maven, run TranscriberDemo from sphinx4-samples package.

There are no XML files anymore, to configure it with Italian you need to download Italian acoustic model from Voxforge:

AcousticModels.tgz

and configure the model in the source code:

configuration.setAcousticModelPath("file:voxforge_it.cd_cont_1000");

configuration.setDictionaryPath("file:voxforge_it.dict");

configuration.setLanguageModelPath("file:voxforge_it.lm.dmp");

You can learn more about models from CMUSphinx tutorial

http://cmusphinx.sourceforge.net/wiki/tutorial

Re: Italian speech recognition with sphinx's hello world

User: ingegner_ciccio
Date: 10/26/2014 5:01 am

Views: 124
Rating: 3

thanks for your quick reply. I'm trying to recognize speech from a microphone and not a wav file, so the demo I'm using is the hello world. Can I do the same thing for that? Also I need the program to recognize words that are not in the dictionary as unknown, can I activate the OutOfGrammar probability with a foreign language? So I think if I configure the acoustic model with this 3 lines of code:

configuration.setAcousticModelPath("file:voxforge_it.cd_cont_1000");

configuration.setDictionaryPath("file:voxforge_it.dict");

configuration.setLanguageModelPath("file:voxforge_it.lm.dmp");

I must eliminate this,

cm = new ConfigurationManager(SpeechRecognizer.class.getResource("speechRecognizer.config.xml"));

right?

So How can I activate OutOfGrammar probability in this case?

Re: Italian speech recognition with sphinx's hello world

User: nsh
Date: 10/26/2014 5:12 am

Views: 105
Rating: 3

> so the demo I'm using is the hello world

The demo for microphone recognition is called Dialog, we do not have hello world anymore

> I must eliminate this, cm = new ConfigurationManager(SpeechRecognizer.class.getResource("speechRecognizer.config.xml"));

right?

There is no such thing in latest code

> So How can I activate OutOfGrammar probability in this case?

First build Italian model that works reasonably well, that's already a huge task. You can think about out-of-grammar later.

Re: Italian speech recognition with sphinx's hello world

User: ingegner_ciccio
Date: 10/26/2014 5:36 am

Views: 136
Rating: 4

thanks for the clarification. I worked with the previous version of sphinx and not the 5prealpha. I should be able to recognize only a small group of words, at most 10, so would be enough for me to adjust some words from Spanish to reach my goal, but if I know from the beginning that I can not recognize the words outside the vocabulary as unknown, I can't use Dialog demo. So do you think it is possible to activate OutOfGrammar probability in dialog demo?

Re: Italian speech recognition with sphinx's hello world

User: nsh
Date: 10/26/2014 9:40 am

Views: 140
Rating: 3

> I should be able to recognize only a small group of words, at most 10, so would be enough for me to adjust some words from Spanish to reach my goal

> but if I know from the beginning that I can not recognize the words outside the vocabulary as unknown, I can't use Dialog demo. So do you think it is possible to activate OutOfGrammar probability in dialog demo?

Dialog demo has oog search activated

Re: Italian speech recognition with sphinx's hello world

User: ingegner_ciccio
Date: 10/27/2014 3:58 am

Views: 100
Rating: 4

Thanks for your answers, Nikolay. Now, I'm Trying to use the latest sphinx4 release, that you linked in a message ago. I've builded it and the AlignerDemo normally starts. When I start the Dialog Demo I've this error:

Exception in thread "main" java.lang.IllegalStateException: javax.sound.sampled.LineUnavailableException: line with format PCM_SIGNED 16000.0 Hz, 16 bit, mono, 2 bytes/frame, little-endian not supported.
at edu.cmu.sphinx.api.Microphone.<init>(Microphone.java:38)

I'm using Windows 7, with the latest Java Version. How can I solve this?

I've found that it is a common problem for some users, but I don't understand how to solve it, for run the Dialog Demo.

Thank you very much for your answers, which are ever very useful.

Re: Italian speech recognition with sphinx's hello world

User: ingegner_ciccio
Date: 10/27/2014 10:02 am

Views: 4518
Rating: 3

Ok, I solved the problem. I've modified some code lines and now I started the Demo. Italian language package doesn't contain the trained acoustic model, that I have to import to configure the model:

configuration.setAcousticModelPath("file:voxforge_it.cd_cont_1000");

I've imported the French model, to test the procedure, and it works. Now, I'm trying to import Spanish model, with the same procedure, but it doesn't works. I've an Array Out Of Bound Exception 29. How can I solve this exception?

Thank you very much for the help.

Previous • Next •


Username	Password