Click here to register.

Speech Recognition Engines

Comparing HTK and Sphinx
User: kmaclean
Date: 7/22/2008 1:43 pm
Views: 8228
Rating: 10

David Huggins-Daines (PocketSphinx developer) has an article in his wiki comparing Sphinx 3.0 and HTK speech recognition engines.  From the article:

The short story is that, after controlling for the beam widths, language model weights, number of tied states and Gaussians, and the acoustic features, Sphinx 3.0 is actually a bit more accurate than [HTK's] HVite. This is not a very meaningful test, because both are extremely slow (over 5xRT [Real Time] on a 2.8GHz Pentium4). However, it does show that there is nothing inherently wrong with the Sphinx trainer, at least on a basic maximum-likelihood training task.

The Julius speech recognition engine uses HTK acoustic models.

--- (Edited on 7/22/2008 2:43 pm [GMT-0400] by kmaclean) ---

Re: Comparing HTK and Sphinx
User: ubanov
Date: 11/10/2008 11:11 am
Views: 87
Rating: 5


Does anyone tested julius and sphinx voice recognition engines, in order to compare them (and tell me if you are agree with David Huggins-Daines or not, in the accuarate of both programs).

Thanks in advance.

--- (Edited on 11/10/2008 11:11 am [GMT-0600] by ubanov) ---

Re: Comparing HTK and Sphinx
User: kmaclean
Date: 11/12/2008 2:12 pm
Views: 747
Rating: 5

Hi Ubanov,

This page on the Simon project website is interesting, it's really the only comparison that I have found between Julius and Sphinx:

Analysis of existing software (translated from German using Google translate - original page)

They chose Julius over Sphinx and some other commercial products (based on criteria for their specific application context). 


--- (Edited on 11/12/2008 3:12 pm [GMT-0500] by kmaclean) ---

Re: Comparing HTK and Sphinx
User: kmaclean
Date: 2/17/2010 10:17 pm
Views: 1403
Rating: 4

Paper by Arthur called Comparison of the SPHINX and HTK Frameworks Processing the AN4 Corpus

--- (Edited on 2/17/2010 11:17 pm [GMT-0500] by kmaclean) ---

--- (Edited on 3/21/2011 12:59 am [GMT-0400] by kmaclean) ---