Click here to register.

Speech Recognition Engines

CMU Sphinx-4 in the Cloud
User: kmaclean
Date: 1/16/2010 2:00 pm
Views: 4029
Rating: 9 uses an open source stack which includes Asterisk (PSTN interface), the java Sphinx-4 speech decoder, and a mySQL database for user content on Amazon's EC2.  From their site:

Internally, uses the CMU Sphinx-4 decoder. A very fast trainer focused only on phonemes generates acoustic models. A typical training session generates models in a couple of hours on a dual core 2.6GHz box. The CMU-Cambridge Langauge Toolkit generates Langauge Models. The exact dictionary varies for every user and use case. We switch grammars at decode time depending on the user and use case. A typical grammar generation scenario is driven by user content stored in a database. These user specific words are put in a dictionary. A rules-engine generates phoneme sequences for each word. The CMU LM Toolkit generates an LM and a vocab.

--- (Edited on 1/16/2010 3:00 pm [GMT-0500] by kmaclean) ---