Click here to register.

Edit Message

Visitor Name
Subject
Message

Re: Create script to train up acoustic models on speech audio from Librivox project.

One important wrinkle to using MP3 audio from Librivox (or even the WAV audio) is that some (not sure how much) of the speech audio submitted to Librivox has been 'processed' - i.e. the audio has been 'cleaned' with noise removal algorithms, audio level normalization, and/or equalization.

Not sure how this might affect a final acoustic model - the rule of thumb has been to use unaltered speech audio as much as possible. 

Ken