General Discussion

Flat
can we use th HUB4 or WSJ that's supported by Sphinx?
User: msatish
Date: 1/1/2007 5:24 pm
Views: 8507
Rating: 37

How do i use Hub4 or AN4 or WSJ models/dictionaries that are suported by sphinx?

Can we use them with Julius?

 

thanks
Satish

--- (Edited on 1/ 1/2007 6:24 pm [GMT-0500] by msatish) ---

Re: can we use th HUB4 or WSJ that's supported by Sphinx?
User: kmaclean
Date: 1/1/2007 8:52 pm
Views: 361
Rating: 34

Hi Satish,

My understanding is that the Sphinx HUB4 and WSJ acoustic models are in a Sphinx binary format.  Julius uses HTK ascii format files.  

I did run across a site by Keith Vertanen where he create binary HTK Acoustic Models using the WSJ corpus.  I've emailed him to see if we can get ASCII versions of his Acoustic Models - so they can be used with Julius.

Ken 

 

--- (Edited on 1/ 1/2007 9:52 pm [GMT-0500] by kmaclean) ---

Re: can we use th HUB4 or WSJ that's supported by Sphinx?
User: kmaclean
Date: 1/2/2007 12:40 pm
Views: 2251
Rating: 28

Hi Satish,

I emailed LEE Akinobu (Julius maintainer), and he provided me with the following method to convert binary HTK hmmdefs file to ASCII:

 % touch null.hed
 % HHEd -H binary_hmmdefs -w ascii_hmmdefs null.hed tiedlist

The tiedlist file is included in the AM archives.
Also please look at
http://kt-lab.ics.nitech.ac.jp/hts-users/spool/2006/msg00280.html.

I got Julian to work with Keith Vertanen's htk_wsj_all_10000_32.zip file, though recognition (with my voice) was not that good.  It may be that the Julian configuration file needs to be tweaked, or that the Acoustic Model needs to be adapted to my voice - which can be done since he provides the 'stats 'file with the distribution.

VoxForge provides some instruction for adapting Acoustic Models to your own voice (see this link), but using HTK version 3.2.1.  I could not get adaptation to work using HTK version 3.3.  I have not had a chance to try it with HTK rel 3.4. 

Ken 

 

--- (Edited on 1/ 2/2007 1:40 pm [GMT-0500] by kmaclean) ---

--- (Edited on 1/ 2/2007 1:42 pm [GMT-0500] by kmaclean) ---

PreviousNext