Click here to register.

Acoustic Model Discussions

Flat
Training recommendation?
User: Tom J.
Date: 11/9/2020 11:14 am
Views: 109
Rating: 0

I'm using Julius and am hoping someone could recommend a program to aid in training.

Essentially I'd like to say a phrase into the mic ten times and have it log to a file what it heard rather than what it thinks it heard.

My thoughts are to develop my own voca file based on my dialect, atmosphere, and specific hardware.

For instance I say:

   How are you

   How are you

   How are you

   How are you

   How are you

   How are you

   How are you

   How are you

   How are you

   How are you

And it (hopefully) logs:

   hh aw  aa r  y uw

   hh aw  aa r  y uw

   hh aw  aa r  y uw

   hh aw  aa r  y uw

   hh aw  aa r  y uw

   hh aw  aa r  y uw

   hh aw  aa r  y uw

   hh aw  aa r  y uw

   hh aw  aa r  y uw

   hh aw  aa r  y uw

Thank you for taking the time to read, I hope someone has a suggestion...

--- (Edited on 11/9/2020 11:14 am [GMT-0600] by Tom J.) ---

Re: Training recommendation?
User: kmaclean
Date: 11/9/2020 3:10 pm
Views: 8
Rating: 0

If you look at the Julius recognition output when you run Julius, you will see that it outputs phonemes it recognizes:

...
### Recognition: 2nd pass (RL heuristic best-first)
...
sentence1: <s> PHONE STEVE </s>
wseq1: 0 2 4 1
phseq1: sil | f ow n | s t iy v | sil
cmscore1: 1.000 1.000 1.000 1.000
score1: -16547.482422

--- (Edited on 11/9/2020 4:10 pm [GMT-0500] by kmaclean) ---

Re: Training recommendation?
User: Tom J.
Date: 11/12/2020 8:06 am
Views: 22
Rating: 0

Thank you for your response, I knew that was there but was hoping to do a bit more in depth comparison without having to scroll through a couple hundred lines.

I've written a REALLY crude API in C++, if theres no alternative I guess I could modify it for this.

--- (Edited on 11/12/2020 8:06 am [GMT-0600] by Tom J.) ---

Re: Training recommendation?
User: Tom J.
Date: 11/20/2020 11:07 am
Views: 22
Rating: 0

I did log phenome strings, it was as simple as a phenome to phenome dictionary.

At any rate I've taken this concept in a different direction and am attempting to hear strictly phonetic to eliminate a big dictionary.

Rather than having two threads on the same concept I'll leave this link to the current thread here.

http://voxforge.org/home/forums/message-boards/general-discussion/log-phenome#COhJIx4B9e5N2SkZBIXFjw

--- (Edited on 11/20/2020 11:07 am [GMT-0600] by Tom J.) ---

Next