1. You need to tell HTK where all your feature vector files are located (those are the mfcc files you created in the last step). You do this with with a script file.
Therefore, create a file
called:
test.scp
2. You also need a configuration file. Create a file called
'config' in your 'voxforge/test' directory and add the following data:
| TARGETKIND = MFCC_0_D_N_Z TARGETRATE = 100000.0 SAVECOMPRESSED = T SAVEWITHCRC = T WINDOWSIZE = 250000.0 USEHAMMING = T PREEMCOEF = 0.97 NUMCHANS = 26 CEPLIFTER = 22 NUMCEPS = 12 |
3. Next use HTK's HVite command to recognize the test data as follows:
a) If you created your Acoustic Model using the How-to or the the Tutorial, execute the following command:
| $HVite -A -D -T 1 -H macros -H hmmdefs -C
config -S test.scp -l '*' -i recout.mlf -w wdnet -p 0.0 -s 5.0 ../lexicon/voxforge_lexicon tiedlist |
| $/home/yourusername/htk-3.2.1/bin.linux/HVite -A -D -T 1 -H macros -H hmmdefs -C config -S test.scp -l '*' -i recout.mlf -w wdnet -p 0.0 -s 5.0 ../lexicon/voxforge_lexicon tiedlist |
This will create the following file:
recout.mlf
| $HResults -I testref.mlf tiedlist recout.mlf |
which will display output similar to this (note: these are results for the 8kHz:16-bit VoxForge Speaker Independent Acoustic Model - build 396):
| ====================== HTK Results Analysis ======================= Date: Thu Sep 14 14:11:46 2006 Ref : testref.mlf Rec : recout.mlf ------------------------ Overall Results -------------------------- SENT: %Correct=60.00 [H=30, S=20, N=50] WORD: %Corr=96.83, Acc=76.19 [H=183, D=0, S=6, I=39, N=189] =================================================================== |
What this means is that: