From tpavelk's post:
I have done some adaptation experiments of my own but only with HDecode, I have not tried Julius yet. I took the recordings from user "ralfherzog" (6620 sentences for adaptation, from training set, and 100 sentences for testing) and these are my results:
Without adpatation:
SENT: %Correct=63.64 [H=63, S=36, N=99]
WORD: %Corr=93.21, Acc=90.24 [H=783, D=2, S=55, I=25, N=840]
With adaptation:
SENT: %Correct=68.69 [H=68, S=31, N=99]
WORD: %Corr=95.12, Acc=92.26 [H=799, D=1, S=40, I=24, N=840]
So there is a visible improvement. Here are the commands used for the adaptation:
HHEd -H $hmmdir/$dir/macros -H $hmmdir/$dir/hmmdefs -M $hmmdir/classes scripts/regtree.hed tiedlist
HERest -C $config -C scripts/config.global -S $trainlist -I MLF/aligned.mlf -H $hmmdir/$dir/macros -u a -J $hmmdir/classes -K $hmmdir/xforms mllr1 -H $hmmdir/$dir/hmmdefs -h *\\%%%%*.mfc tiedlist
HERest -a -C $config -C scripts/config.rc -S $trainlist -I MLF/aligned.mlf -H $hmmdir/$dir/macros -u a -J $hmmdir/xforms mllr1 -J $hmmdir/classes -K $hmmdir/xforms mllr2 -H $hmmdir/$dir/hmmdefs -h *\\%%%%*.mfc tiedlist
and for testing:
Adapted:
hdecode -T 1 -m -H $hmmdir/$dir/hmmdefs -H $hmmdir/$dir/macros -J $hmmdir/xforms mllr2 -J $hmmdir/classes -h *\\%%%%*.mfc -C $hdecode_config -S $testlist -t 220.0 -l rec -w $language_model -p 0.0 -s 15.0 $hdecode_dict xwrdtiedlis
Speaker independent:
hdecode -H $hmmdir/$dir/hmmdefs -H $hmmdir/$dir/macros -C $hdecode_config -S $testlist -t 220.0 -l rec -w $language_model -p 0.0 -s 15.0 $hdecode_dict xwrdtiedlist
All configs scripts etc. were taken from the tutorial in HTKBook 3.4.1.
@Ken: What is the reason for reviewing of the HVite log in alignment? Is it that HTK may drop some sentences because of overpruning and not report any errors (unless you enable tracing)? That can be solved by turning off pruning (leave out the -t option). Or is there something I missed?
One more thing, the tutorial here is valid for older HTK versions only. HTK 3.4 does not have a separate tool (HEAdapt), everything is done by HERest.