VoxForge
1. You need to tell HTK where all your feature vector files are located (those are the mfcc files you created in the last step). You do this with with a script file.
Therefore, create a file
called:
test.scp
2. You also need a configuration file. Create a file called
'config' in your 'voxforge/test' directory and add the following data:
| TARGETKIND = MFCC_0_D_N_Z TARGETRATE = 100000.0 SAVECOMPRESSED = T SAVEWITHCRC = T WINDOWSIZE = 250000.0 USEHAMMING = T PREEMCOEF = 0.97 NUMCHANS = 26 CEPLIFTER = 22 NUMCEPS = 12 |
3. Next use HTK's HVite command to recognize the test data as follows:
a) If you created your Acoustic Model using the How-to or the the Tutorial, execute the following command:
| $HVite -A -D -T 1 -H macros -H hmmdefs -C
config -S test.scp -l '*' -i recout.mlf -w wdnet -p 0.0 -s 5.0 ../lexicon/voxforge_lexicon tiedlist |
| $/home/yourusername/htk-3.2.1/bin.linux/HVite -A -D -T 1 -H macros -H hmmdefs -C config -S test.scp -l '*' -i recout.mlf -w wdnet -p 0.0 -s 5.0 ../lexicon/voxforge_lexicon tiedlist |
This will create the following file:
recout.mlf
| $HResults -I testref.mlf tiedlist recout.mlf |
which will display output similar to this (note: these are results for the 8kHz:16-bit VoxForge Speaker Independent Acoustic Model - build 396):
| ====================== HTK Results Analysis ======================= Date: Thu Sep 14 14:11:46 2006 Ref : testref.mlf Rec : recout.mlf ------------------------ Overall Results -------------------------- SENT: %Correct=60.00 [H=30, S=20, N=50] WORD: %Corr=96.83, Acc=76.19 [H=183, D=0, S=6, I=39, N=189] =================================================================== |
What this means is that: