General Discussion

General Discussion http://www.voxforge.org/home/forums/message-boards/general-discussion "Voice Control" used in an automobile ..... in 1960!!!??? http://www.voxforge.org/home/forums/message-boards/general-discussion/voice-control-used-in-an-automobile-.....-in-1960 I just stumbled across some articles about this car, and after everything I went through making Julius work well on my project was shocked to see the claim of voice control! http://www.voxforge.org/home/forums/message-boards/general-discussion/voice-control-used-in-an-automobile-.....-in-1960 Wed, 16 Feb 2022 21:19:28 -0600 juliuslib sample code C++ http://www.voxforge.org/home/forums/message-boards/general-discussion/juliuslib-sample-code-c I want to use Julius on a project in C++ and there are some docs and snippets of sample code but they're a little vague from a novice developer perspective. http://www.voxforge.org/home/forums/message-boards/general-discussion/juliuslib-sample-code-c Sun, 10 Jan 2021 23:41:50 -0600 Looking for guide/instructions how to set up more training files http://www.voxforge.org/home/forums/message-boards/general-discussion/looking-for-guide/instructions-how-to-set-up-more-training-files Hello I finally have julius working, and julia, and htk. I did the simple 'call/dial' 'steve/young' and numbers and it came out pretty good. I've changed to a word loop system to allow a more dictation approach over IVR but with a limited dictonary of 140 words directly related to the end use case. The end use case being a jasper system running on PI connected to the NMEA2000 can bus network on my boat. The idea being that i can vebally request information from the boat and have it give it to me, a la StarTrek: TNG :) So i have this 140 some odd words all revolving around these commands i'd like to use but the sample dictonary set up doesn't really cover it. Its catching 1 in 20 or so of those words. ergo, i need to know how to set up more training files. I have the 40 sample scripts but they don't seem to cover all the phonems i need. I just don't know how to either add to or modify those (i would presume add to). Logically, i'm guessing, i should just make vocal samples based on each of the commands i might say to the system. There's a good 30 of those alone. If i say them forward and backwards thats 60 more sample wavs i can work with. 100 total. Thing is... how do? Also, i'd like to try to adopt a grammar system that follows: Activator: Command: word loop Computer, turn ... (left/on lights/ etc) to help try my accuracy of responses. I'm not sure if it matters with the jasper engine behind it though, so if you got any experience with that i'm all ears. Thanks for all the help so far guys. I got it hearing 'computer' pretty reliably :) Onward and upward from here. http://www.voxforge.org/home/forums/message-boards/general-discussion/looking-for-guide/instructions-how-to-set-up-more-training-files Tue, 22 Dec 2020 22:51:24 -0600 Julius trained via julia script only 'hearing' "DIAL" and a number http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-trained-via-julia-script-only-hearing-dial-and-a-number Hello I finished doing the samples and the julia script for julius's model's and when running julius i get the following john@john-VirtualBox:~/voxforge/howto$ julius -input mic -C sample.jconf STAT: include config: sample.jconf pass1_best: <s> DIAL EIGHT sentence1: <s> DIAL ONE </s> pass1_best: <s> DIAL SIX sentence1: <s> DIAL TWO </s> pass1_best: <s> DIAL SIX sentence1: <s> DIAL THREE </s> pass1_best: <s> DIAL SIX sentence1: <s> DIAL FOUR </s> even though i might say 'cat' or 'dog' to it. i also noticed it wouldn't let me say the full sentance. So i'm trying to accomplish http://www.voxforge.org/home/dev/acousticmodels/linux/create/htkjulius/how-to/run-julius And i did not see: ----------------------- System Information end ----------------------- Notice for feature extraction (01), ************************************************************* * Cepstral mean normalization for real-time decoding: * * NOTICE: The first input may not be recognized, since * * no initial mean is available on startup. * ************************************************************* ------ ### read waveform input Stat: capture audio at 16000Hz Stat: adin_alsa: latency set to 32 msec (chunk = 1536 bytes) Error: adin_alsa: unable to get pcm info from card control Warning: adin_alsa: skip output of detailed audio device info STAT: AD-in thread created <<< please speak >>> When the command was submitted. Anyone know what i might be missing? http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-trained-via-julia-script-only-hearing-dial-and-a-number Mon, 21 Dec 2020 12:12:20 -0600 installation complication on r pi http://www.voxforge.org/home/forums/message-boards/general-discussion/installation-complication-on-r-pi Good evening all. I looked through the materials provided on the forum here and didn't quite find anything that fit my use case. I'm attempting to install the jasper solution on this PI http://jasperproject.github.io/documentation/configuration/#julius-stt Following this instruction set it informs me i need to adapt the profile to my own vocal patterns which makes sense, ok... http://www.voxforge.org/home/dev/acousticmodels/linux/adapt/htkjulius/download-htk I am following this guide then to attempt to do this and created the makeHTK.sh file as dictated here, changing the directory to my own /home/pi/htk http://www.voxforge.org/home/forums/message-boards/general-discussion/installation-complication-on-r-pi Sat, 19 Dec 2020 22:30:55 -0600 Julius Microphone optimization http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-microphone-optimization The jconf file has notations in it which are probably really useful but as a new user to voice recognition most of the terminology is beyond my grasp. http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-microphone-optimization Sat, 21 Nov 2020 12:23:48 -0600 Log Phenome? http://www.voxforge.org/home/forums/message-boards/general-discussion/log-phenome I'm currently working with Julius. An initial thought I had was to log the phenomes as heard by Julius so I could spot problems with my dialect vs the model. http://www.voxforge.org/home/forums/message-boards/general-discussion/log-phenome Wed, 18 Nov 2020 11:17:01 -0600 Mozilla's Open Speech Corpora status http://www.voxforge.org/home/forums/message-boards/general-discussion/mozillas-open-speech-corpora-status from Kelly Davis: http://www.voxforge.org/home/forums/message-boards/general-discussion/mozillas-open-speech-corpora-status Fri, 21 Aug 2020 07:19:25 -0500 ERROR [+2662] FindProtoModel: no proto in hSet http://www.voxforge.org/home/forums/message-boards/general-discussion/error-2662--findprotomodel-no-proto--in-hset Dear Team, http://www.voxforge.org/home/forums/message-boards/general-discussion/error-2662--findprotomodel-no-proto--in-hset Fri, 24 Jul 2020 09:52:53 -0500 Julius speaks other words which are not trained in our file http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-speaks-other-words-which-are-not-trained-in-our-file Dear team, http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-speaks-other-words-which-are-not-trained-in-our-file Fri, 24 Jul 2020 09:52:11 -0500 Running Julius Live not understanding the output format http://www.voxforge.org/home/forums/message-boards/general-discussion/-running-julius-live-not-understanding-the-output-format Dear Team, http://www.voxforge.org/home/forums/message-boards/general-discussion/-running-julius-live-not-understanding-the-output-format Wed, 22 Jul 2020 09:46:22 -0500 Corpus Specification http://www.voxforge.org/home/forums/message-boards/general-discussion/corpus-specification Hi there, http://www.voxforge.org/home/forums/message-boards/general-discussion/corpus-specification Sat, 13 Jun 2020 19:31:59 -0500 ls: cannot access '/tmp/bin.linux': No such file or directory on Ubuntu 18.04 http://www.voxforge.org/home/forums/message-boards/general-discussion/ls-cannot-access-/tmp/bin.linux-no-such-file-or-directory-on-ubuntu-18.04 I am attempting to install HTK on my Ubuntu 18.04 machine. I have managed to successfully run http://www.voxforge.org/home/forums/message-boards/general-discussion/ls-cannot-access-/tmp/bin.linux-no-such-file-or-directory-on-ubuntu-18.04 Thu, 22 Aug 2019 15:03:10 -0500 Julius now using 3-Clause BSD License http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-now-using-3-clause-bsd-license From the Julius project page: http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-now-using-3-clause-bsd-license Fri, 31 May 2019 08:58:06 -0500 adding hungarian http://www.voxforge.org/home/forums/message-boards/general-discussion/adding-hungarian I see Hungarian is not listed, in my question is, how would I go about setting a new language for collecting speach? I can't see any explanation of this on the website anywhere. http://www.voxforge.org/home/forums/message-boards/general-discussion/adding-hungarian Sun, 31 Mar 2019 15:57:14 -0500 Speaker recognition recout.mlf file http://www.voxforge.org/home/forums/message-boards/general-discussion/speaker-recognition-recout.mlf-file hello, now i'm trying to use HTK as speaker recognition, i'm only know the alphabets it's for "label". does any one know the number in recout.mlf files regarding to what? Thanks for your help http://www.voxforge.org/home/forums/message-boards/general-discussion/speaker-recognition-recout.mlf-file Wed, 05 Sep 2018 03:57:29 -0500 problem to upload using the jnlp file http://www.voxforge.org/home/forums/message-boards/general-discussion/problem-to-upload-using-the-jnlp-file Hi, i'm trying to upload speeches using the jnlp file, for spanish model. I record all sentencens, clic on "subir" (upload) and progress bar only moves a few at the beginning. I have been waiting for a long time and ... nothing. Is there any log or similar that i can view? http://www.voxforge.org/home/forums/message-boards/general-discussion/problem-to-upload-using-the-jnlp-file Mon, 11 Jun 2018 08:11:45 -0500 Is there a possibility about hmm in htk http://www.voxforge.org/home/forums/message-boards/general-discussion/is-there-a-possibility-about-hmm-in-htk i want to match a voice to compare with a Concrete hmm in htk to find out how much they far away with each other. http://www.voxforge.org/home/forums/message-boards/general-discussion/is-there-a-possibility-about-hmm-in-htk Sun, 13 May 2018 00:14:20 -0500 Mozilla Deep Speech corpus for Julius http://www.voxforge.org/home/forums/message-boards/general-discussion/mozilla-deep-speech-corpus-for-julius Hello http://www.voxforge.org/home/forums/message-boards/general-discussion/mozilla-deep-speech-corpus-for-julius Wed, 03 Jan 2018 17:07:28 -0600 how find Filter bank frequencies http://www.voxforge.org/home/forums/message-boards/general-discussion/-how-find-filter-bank-frequencies HI http://www.voxforge.org/home/forums/message-boards/general-discussion/-how-find-filter-bank-frequencies Wed, 20 Sep 2017 07:39:35 -0500 Error [+2121] HInit: Too Few Observation Sequences[0] http://www.voxforge.org/home/forums/message-boards/general-discussion/error-2121-hinit-too-few-observation-sequences0 When I run this commnad: http://www.voxforge.org/home/forums/message-boards/general-discussion/error-2121-hinit-too-few-observation-sequences0 Wed, 09 Aug 2017 14:07:52 -0500 graphic simulation for HTK http://www.voxforge.org/home/forums/message-boards/general-discussion/graphic-simulation-for-htk HI there! I have made a speech recognition system using HTK with so good results, but now I need to simulate this system. Im working with ubuntuStudio, so as I read LabView isnt a good option. I have read about using GTK+ with C or perl but I dont know if its posible. http://www.voxforge.org/home/forums/message-boards/general-discussion/graphic-simulation-for-htk Sat, 20 May 2017 05:42:46 -0500 Fillers and silence http://www.voxforge.org/home/forums/message-boards/general-discussion/fillers-and-silence Hi guys! http://www.voxforge.org/home/forums/message-boards/general-discussion/fillers-and-silence Thu, 27 Apr 2017 05:39:36 -0500 Registering at Voxfoge http://www.voxforge.org/home/forums/message-boards/general-discussion/registering-at-voxfoge http://www.voxforge.org/home/forums/message-boards/general-discussion/registering-at-voxfoge Mon, 17 Apr 2017 01:54:25 -0500 Defining fillers http://www.voxforge.org/home/forums/message-boards/general-discussion/defining-fillers Hi! http://www.voxforge.org/home/forums/message-boards/general-discussion/defining-fillers Fri, 24 Mar 2017 06:50:35 -0500 where does HTK save outputs? http://www.voxforge.org/home/forums/message-boards/general-discussion/where-does-htk-save-outputs Hi there! http://www.voxforge.org/home/forums/message-boards/general-discussion/where-does-htk-save-outputs Mon, 20 Mar 2017 06:59:12 -0500 untitled http://www.voxforge.org/home/forums/message-boards/general-discussion/untitled2 My friend told me that he saw article about person who is working on Phone version , I can't remeber the name of the software. Its not the Vox Forge but similar http://www.voxforge.org/home/forums/message-boards/general-discussion/untitled2 Fri, 17 Mar 2017 02:14:44 -0500 HInit error http://www.voxforge.org/home/forums/message-boards/general-discussion/hinit-error3 HInit -A -D -T 1 -S trainlist.txt -M model/hmm0 -H model/proto/hmm_one1 -l one1 -L data/train/lab one1 http://www.voxforge.org/home/forums/message-boards/general-discussion/hinit-error3 Fri, 03 Feb 2017 07:52:19 -0600 VoxForge for Arduino Recognizer http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-for-arduino-recognizer Hi all, http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-for-arduino-recognizer Mon, 02 Jan 2017 16:41:47 -0600 For fun: speaking with Siri in native Bahamian English http://www.voxforge.org/home/forums/message-boards/general-discussion/for-fun-speaking-with-siri-in-native-bahamian-english https://www.youtube.com/watch?v=2yf1dp0Jz_o http://www.voxforge.org/home/forums/message-boards/general-discussion/for-fun-speaking-with-siri-in-native-bahamian-english Sun, 25 Dec 2016 03:40:22 -0600 Real Time Speech Recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/real-time-speech-recognition Hello, http://www.voxforge.org/home/forums/message-boards/general-discussion/real-time-speech-recognition Tue, 20 Dec 2016 13:46:00 -0600 Nothing happens when running JULIUS http://www.voxforge.org/home/forums/message-boards/general-discussion/nothing-happens-when-running-julius Hi! http://www.voxforge.org/home/forums/message-boards/general-discussion/nothing-happens-when-running-julius Mon, 19 Dec 2016 12:14:55 -0600 HVite ERROR [+6006] http://www.voxforge.org/home/forums/message-boards/general-discussion/-hvite-error-6006 Hi guys, Im trying to run the recogniser live, and Im using the following command: http://www.voxforge.org/home/forums/message-boards/general-discussion/-hvite-error-6006 Mon, 28 Nov 2016 06:01:09 -0600 HTK LINUX http://www.voxforge.org/home/forums/message-boards/general-discussion/htk-linux Hi guys, I have recently installed Linux on a virtual machine and im trying to install HTK on it. Im getting several errors while compilling to get the executables. I only need the executables for my proyect. So, can someone pass me this executables on a comprimed file? it will help me a lot. http://www.voxforge.org/home/forums/message-boards/general-discussion/htk-linux Tue, 15 Nov 2016 10:45:29 -0600 Recording with audacity http://www.voxforge.org/home/forums/message-boards/general-discussion/recording-with-audacity Hi: http://www.voxforge.org/home/forums/message-boards/general-discussion/recording-with-audacity Wed, 28 Sep 2016 13:54:46 -0500 Polish speech models for Julius http://www.voxforge.org/home/forums/message-boards/general-discussion/polish-speech-models-for-julius2 Hello http://www.voxforge.org/home/forums/message-boards/general-discussion/polish-speech-models-for-julius2 Mon, 26 Sep 2016 17:10:10 -0500 Polish speech models for Julius http://www.voxforge.org/home/forums/message-boards/general-discussion/polish-speech-models-for-julius Hello http://www.voxforge.org/home/forums/message-boards/general-discussion/polish-speech-models-for-julius Mon, 26 Sep 2016 17:09:45 -0500 What is the stopping criteria to end HMM training in HTK? http://www.voxforge.org/home/forums/message-boards/general-discussion/what-is-the-stopping-criteria-to-end-hmm-training-in-htk What is the stopping criteria to end HMM training in HTK? http://www.voxforge.org/home/forums/message-boards/general-discussion/what-is-the-stopping-criteria-to-end-hmm-training-in-htk Thu, 04 Aug 2016 00:32:50 -0500 How to decide on development set for phone recognition? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-decide-on-development-set-for-phone-recognition Is it development set or validation set or cross-validation set, all are same? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-decide-on-development-set-for-phone-recognition Sun, 31 Jul 2016 14:02:53 -0500 gender conversion http://www.voxforge.org/home/forums/message-boards/general-discussion/gender-conversion Hi all http://www.voxforge.org/home/forums/message-boards/general-discussion/gender-conversion Tue, 05 Jul 2016 06:15:00 -0500 Error HHEd using windows 10 http://www.voxforge.org/home/forums/message-boards/general-discussion/error-hhed-using-windows-10 This is my command line: http://www.voxforge.org/home/forums/message-boards/general-discussion/error-hhed-using-windows-10 Tue, 28 Jun 2016 21:24:44 -0500 Error HVite using windows 10 http://www.voxforge.org/home/forums/message-boards/general-discussion/error-hvite-using-windows-10 "F:\htk-3.3-windows-binary\htk\HVite.exe" -H "E:\HMM_Baru\MFCC\hmm10\macros2" -H "E:\HMM_Baru\MFCC\hmm10\models" -S "E:\HMM_Baru\MFCC\Testing\baby_testing.scp" -1 * "E:\HMM_Baru\recout.mlf" -p 0.0 -s 5.0 "E:\HMM\Dictionary.txt" "E:\HMM_Baru\list_emosi.txt" http://www.voxforge.org/home/forums/message-boards/general-discussion/error-hvite-using-windows-10 Tue, 28 Jun 2016 01:43:49 -0500 Error HERest http://www.voxforge.org/home/forums/message-boards/general-discussion/error-herest ERROR [+7060] InitHMMSet: Expected newline after 3'th HMM http://www.voxforge.org/home/forums/message-boards/general-discussion/error-herest Sat, 25 Jun 2016 04:47:47 -0500 University of Edinburgh looking for voice quality feedback http://www.voxforge.org/home/forums/message-boards/general-discussion/university-of-edinburgh-looking-for-voice-quality-feedback The speech group at the university is running its annual ""Blizzard Challenge" to get feedback on progress in the quality of generated voices. From the festival-talk list: http://www.voxforge.org/home/forums/message-boards/general-discussion/university-of-edinburgh-looking-for-voice-quality-feedback Fri, 20 May 2016 07:21:23 -0500 Building model for Sphinx http://www.voxforge.org/home/forums/message-boards/general-discussion/building-model-for-sphinx Has anyone ever tried using the corpus to build a Sphinx model as described here? http://www.voxforge.org/home/forums/message-boards/general-discussion/building-model-for-sphinx Wed, 27 Apr 2016 19:13:13 -0500 Large Grammar for Julius? http://www.voxforge.org/home/forums/message-boards/general-discussion/large-grammar-for-julius Can someone point me to a large grammar for Julius built from the training samples collected by VoxForge? http://www.voxforge.org/home/forums/message-boards/general-discussion/large-grammar-for-julius Wed, 27 Apr 2016 14:46:08 -0500 Recent downloads http://www.voxforge.org/home/forums/message-boards/general-discussion/recent-downloads I was hoping to use voxforge models to improve my results with sphinx4 but downloads show sphinx links that are 6 years old. http://www.voxforge.org/home/forums/message-boards/general-discussion/recent-downloads Sat, 16 Apr 2016 03:34:04 -0500 Any ideas/solutions/helps on sound detection? http://www.voxforge.org/home/forums/message-boards/general-discussion/any-ideas/solutions/helps-on-sound-detection Hi everyone, http://www.voxforge.org/home/forums/message-boards/general-discussion/any-ideas/solutions/helps-on-sound-detection Thu, 24 Mar 2016 08:58:48 -0500 Mobile Application for VoxForge http://www.voxforge.org/home/forums/message-boards/general-discussion/mobile-application-for-voxforge Is there a way to record and submit directly from smartphone? http://www.voxforge.org/home/forums/message-boards/general-discussion/mobile-application-for-voxforge Tue, 15 Mar 2016 05:21:11 -0500 Cross benefits to Voxforge in Coursera language courses? http://www.voxforge.org/home/forums/message-boards/general-discussion/cross-benefits-to-voxforge-in-coursera-language-courses I see that Coursera is now offering a course in basic Korean https://www.coursera.org/learn/learn-korean. This has me wondering how Voxforge might benefit from and contribute to the learning by students interested in following such courses. http://www.voxforge.org/home/forums/message-boards/general-discussion/cross-benefits-to-voxforge-in-coursera-language-courses Wed, 10 Feb 2016 02:34:11 -0600 Vector_Formatter.exe http://www.voxforge.org/home/forums/message-boards/general-discussion/vector_formatter.exe What does Vector_Formatter.exe do? http://www.voxforge.org/home/forums/message-boards/general-discussion/vector_formatter.exe Sun, 31 Jan 2016 12:54:48 -0600 Julius socket server improvements - see github http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-socket-server-improvements---see-github For those who may not be aware, Julius development has moved to github at https://github.com/julius-speech/julius/ and the development is ongoing. I currently have an issue open regarding bugs and improvements to the socket server mode; please join in on the testing or discussion if you are so inclined. http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-socket-server-improvements---see-github Thu, 10 Dec 2015 04:09:16 -0600 Creating the prompts2wlist.jl file and details http://www.voxforge.org/home/forums/message-boards/general-discussion/creating-the-prompts2wlist.jl-file-and-details I have already created a prompts2wlist.jl file but when I attempt to run the command to create the wlist file I get command not found. http://www.voxforge.org/home/forums/message-boards/general-discussion/creating-the-prompts2wlist.jl-file-and-details Mon, 09 Nov 2015 19:40:23 -0600 phonetic coding of connected speech HTK http://www.voxforge.org/home/forums/message-boards/general-discussion/phonetic-coding-of-connected-speech-htk Hello to all; http://www.voxforge.org/home/forums/message-boards/general-discussion/phonetic-coding-of-connected-speech-htk Fri, 06 Nov 2015 13:19:17 -0600 Error attempting to install HTK http://www.voxforge.org/home/forums/message-boards/general-discussion/error-attempting-to-install-htk While starting to try Jasper, I've decided to go with Julius. I've gotten stuck installing on ArchLinux, where the makeHTK.sh comes in.. http://www.voxforge.org/home/forums/message-boards/general-discussion/error-attempting-to-install-htk Wed, 22 Jul 2015 01:34:44 -0500 error with HInit. http://www.voxforge.org/home/forums/message-boards/general-discussion/error-with-hinit2 http://www.voxforge.org/home/forums/message-boards/general-discussion/error-with-hinit2 Thu, 18 Jun 2015 13:47:40 -0500 Speech Recognition without Pronunciation Dictionary http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-recognition-without-pronunciation-dictionary In a paper entitled: Lexicon-Free Conversational Speech Recognition with Neural Networks by Maas, Xie, Jurafsky, and Ng, the authors describe a novel approach to creating acoustic models using the Kaldi speech toolkit without the use of a pronunciation dictionary: http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-recognition-without-pronunciation-dictionary Tue, 26 May 2015 11:24:28 -0500 new web speech recognition provider http://www.voxforge.org/home/forums/message-boards/general-discussion/new-web-speech-recognition-provider Hello Everyone, http://www.voxforge.org/home/forums/message-boards/general-discussion/new-web-speech-recognition-provider Mon, 18 May 2015 03:52:02 -0500 new speech corpus http://www.voxforge.org/home/forums/message-boards/general-discussion/new-speech-corpus Hi http://www.voxforge.org/home/forums/message-boards/general-discussion/new-speech-corpus Wed, 22 Apr 2015 06:28:03 -0500 Julius - Batch Mode http://www.voxforge.org/home/forums/message-boards/general-discussion/julius---batch-mode Hey Folks, http://www.voxforge.org/home/forums/message-boards/general-discussion/julius---batch-mode Tue, 13 Jan 2015 21:20:13 -0600 Somewhat off topic - very many ASR jobs and offer to build non-English ASR systems http://www.voxforge.org/home/forums/message-boards/general-discussion/somewhat-off-topic---very-many-asr-jobs-and-offer-to-build-non-english-asr-systems Ken - please pull this post if it is inappropriate, but as a long time contributer I may well be able to help the inhabitants of this forum out in a couple of ways: http://www.voxforge.org/home/forums/message-boards/general-discussion/somewhat-off-topic---very-many-asr-jobs-and-offer-to-build-non-english-asr-systems Sun, 11 Jan 2015 11:13:29 -0600 Voxforge Twitter account and Facebook Likes? http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-twitter-account-and-facebook-likes Hi, Would it be possible to show Twitter following, Facebook like, Reddit http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-twitter-account-and-facebook-likes Thu, 25 Dec 2014 13:41:49 -0600 Free/low cost service for speech to text http://www.voxforge.org/home/forums/message-boards/general-discussion/free/low-cost-service-for-speech-to-text http://www.voxforge.org/home/forums/message-boards/general-discussion/free/low-cost-service-for-speech-to-text Mon, 08 Dec 2014 09:12:49 -0600 Speech (Audio) to TEXT Query.. tool http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-audio-to-text-query..-tool All, http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-audio-to-text-query..-tool Mon, 08 Dec 2014 08:16:33 -0600 FLAT START MONOPHONES: Step#6 Issue ..!! http://www.voxforge.org/home/forums/message-boards/general-discussion/flat-start-monophones-step6-issue-.. All, http://www.voxforge.org/home/forums/message-boards/general-discussion/flat-start-monophones-step6-issue-.. Wed, 03 Dec 2014 03:34:37 -0600 Prompts File Creation: Step 2. http://www.voxforge.org/home/forums/message-boards/general-discussion/prompts-file-creation-step-2 You said - "HTK requires Pronunciation Dictionnary with at least 30-40 'sentences' http://www.voxforge.org/home/forums/message-boards/general-discussion/prompts-file-creation-step-2 Wed, 03 Dec 2014 03:31:33 -0600 G2P tool for hindi language http://www.voxforge.org/home/forums/message-boards/general-discussion/g2p-tool-for-hindi-language I tried to prepare a model using g2p tool and existing hindi lexicon. What I am observing is that the trained model is not accepting some hindi characters while applying the model for text. For example : à¤…à¤•à¥€à¤° is the hindi word which already exists in the hindi lexicon used for training. When I tried to generate phonemes for à¤…à¤•à¥€à¤° using the training model the g2p tool is skipping à¤… and giving phonemes for à¤•à¥€à¤°. It is the same for any new word starting with à¤…. How can I rectify this problem? http://www.voxforge.org/home/forums/message-boards/general-discussion/g2p-tool-for-hindi-language Wed, 12 Nov 2014 04:24:41 -0600 need some helps on ISOLATED WORD recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/need-some-helps-on-isolated-word-recognition Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/need-some-helps-on-isolated-word-recognition Sat, 23 Aug 2014 10:08:47 -0500 How to build the corpus from arpa file (Reverse Engineering) http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-build-the-corpus-from-arpa-file-reverse-engineering I had a arpa file from that I want to rebuild the corpus file which is used to build Language model (LM) Is there is any tools to do this reverse engineering ? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-build-the-corpus-from-arpa-file-reverse-engineering Tue, 12 Aug 2014 07:14:00 -0500 *How to write proto file when the features are not from MFCC http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-write-proto-file-when-the-features-are-not-from-mfcc I want to use HTK for action recognition. The data is from accelerometer, gyroscope and magnetometer (10 channels). I converted the data to HTK format using a C# program and could read the data correctly using HList. The proto file is as follow: http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-write-proto-file-when-the-features-are-not-from-mfcc Thu, 19 Jun 2014 06:38:35 -0500 Forced alignment did not change the transcription at all http://www.voxforge.org/home/forums/message-boards/general-discussion/forced-alignment-did-not-change-the-transcription-at-all Hi, All, http://www.voxforge.org/home/forums/message-boards/general-discussion/forced-alignment-did-not-change-the-transcription-at-all Tue, 20 May 2014 00:57:28 -0500 How Can I Help? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-can-i-help Hi, I'm very interested in helping this project in a long-term fashion. I have a background in both audio recording and open source software, and I am studying computational linguistics at school right now. I am quite excited to help this project grow but would like to know the most productive method for my involvement. http://www.voxforge.org/home/forums/message-boards/general-discussion/how-can-i-help Fri, 09 May 2014 14:13:25 -0500 http://www.voxforge.org/home/other doesn't work http://www.voxforge.org/home/forums/message-boards/general-discussion/http/www/home/other-doesnt-work Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/http/www/home/other-doesnt-work Sun, 20 Apr 2014 15:25:56 -0500 How to use HTK to speech recogntion ? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-use-htk-to-speech-recogntion- I'm a novice to this field. I'm trying to use HTK for recognize sinhala sounds. actually I'm trying to convert sounds into sinhala text for my final year project. http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-use-htk-to-speech-recogntion- Fri, 28 Mar 2014 11:54:47 -0500 Simon does not recognise anything! Ideas? http://www.voxforge.org/home/forums/message-boards/general-discussion/simon-does-not-recognise-anything-ideas Hi http://www.voxforge.org/home/forums/message-boards/general-discussion/simon-does-not-recognise-anything-ideas Tue, 25 Mar 2014 16:51:46 -0500 Upload of trainig data with Simon not possible http://www.voxforge.org/home/forums/message-boards/general-discussion/upload-of-trainig-data-with-simon-not-possible Hi http://www.voxforge.org/home/forums/message-boards/general-discussion/upload-of-trainig-data-with-simon-not-possible Tue, 25 Mar 2014 12:22:47 -0500 Aligning audio book with corresponding ebook http://www.voxforge.org/home/forums/message-boards/general-discussion/aligning-audio-book-with-corresponding-ebook Hello, I am looking for a way to do good quality forced alignment, considering OOV words. The idea is to be able to make synchronous playback of an ebook, that can contain words not part of the dictionary of the SR engine. I had a look at CMU Sphinx 4 which provides alignment functionality, but apparently OOV words are not supported. Any suggestions here? If possible Java-based. Thanks a lot in advance, Sébastien Druon http://www.voxforge.org/home/forums/message-boards/general-discussion/aligning-audio-book-with-corresponding-ebook Fri, 07 Mar 2014 09:13:35 -0600 Isolated word recognition training with HERest? http://www.voxforge.org/home/forums/message-boards/general-discussion/isolated-word-recognition-training-with-herest Hi, dear guys, http://www.voxforge.org/home/forums/message-boards/general-discussion/isolated-word-recognition-training-with-herest Sun, 16 Feb 2014 23:31:29 -0600 How to add Albanian language to this site http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-add-albanian-language-to-this-site How to add Albanian (Shqip) language on this website? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-add-albanian-language-to-this-site Mon, 27 Jan 2014 09:36:30 -0600 HTK Lbuild gives messy symbol output http://www.voxforge.org/home/forums/message-boards/general-discussion/htk-lbuild-gives-messy-symbol-output Hi, everyone, http://www.voxforge.org/home/forums/message-boards/general-discussion/htk-lbuild-gives-messy-symbol-output Sat, 14 Dec 2013 17:22:41 -0600 How to apply G2P model from Sequitur to create TTS http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-apply-g2p-model-from-sequitur-to-create-tts Dear all http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-apply-g2p-model-from-sequitur-to-create-tts Sun, 01 Dec 2013 20:56:52 -0600 slave_feat.pl creates an empty folder http://www.voxforge.org/home/forums/message-boards/general-discussion/slave_feat.pl-creates-an-empty-folder slave_feat.pl creates an empty folder http://www.voxforge.org/home/forums/message-boards/general-discussion/slave_feat.pl-creates-an-empty-folder Wed, 06 Nov 2013 05:06:04 -0600 Phoneme level Recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/phoneme-level-recognition Hi, I am trying to do phoneme level recognition and want to see the accuracy. But I am getting the following error when using HVite http://www.voxforge.org/home/forums/message-boards/general-discussion/phoneme-level-recognition Fri, 01 Nov 2013 02:30:42 -0500 Evaluation of various Speech Recognition Engines http://www.voxforge.org/home/forums/message-boards/general-discussion/evaluation-of-various-speech-recognition-engines I'm interested in performing a thorough evaluation of various speech recognition engines, however I currently don't have the means to do so. http://www.voxforge.org/home/forums/message-boards/general-discussion/evaluation-of-various-speech-recognition-engines Tue, 22 Oct 2013 19:58:30 -0500 Applet error: bad major version http://www.voxforge.org/home/forums/message-boards/general-discussion/applet-error-bad-major-version Hi http://www.voxforge.org/home/forums/message-boards/general-discussion/applet-error-bad-major-version Sat, 14 Sep 2013 13:14:16 -0500 submitting recordings: Illegal URL redirect http://www.voxforge.org/home/forums/message-boards/general-discussion/submitting-recordings-illegal-url-redirect I'm getting an "Illgal URL redirect" when running the recorder from http://www.voxforge.org/home/read. I found a suggestion for a French speaker that they use http://read.voxforge1.org/r0_1_10/SubmitSpeechWebGUI-FR.php instead. http://www.voxforge.org/home/forums/message-boards/general-discussion/submitting-recordings-illegal-url-redirect Fri, 23 Aug 2013 20:35:50 -0500 speaker recognition task http://www.voxforge.org/home/forums/message-boards/general-discussion/speaker-recognition-task Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/speaker-recognition-task Thu, 22 Aug 2013 16:47:59 -0500 problems submitting with Simon http://www.voxforge.org/home/forums/message-boards/general-discussion/problems-submitting-with-simon I am unable to provide samples to the open source community via the Simon uploader. When I click on "Contribue Samples" It gives me a timeout error (after only like 2 seconds) saying it could not connect to voxforge. http://www.voxforge.org/home/forums/message-boards/general-discussion/problems-submitting-with-simon Mon, 19 Aug 2013 15:34:32 -0500 How to compute ROC using HREsults? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-compute-roc-using-hresults Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-compute-roc-using-hresults Tue, 23 Jul 2013 13:49:30 -0500 MFCC generation http://www.voxforge.org/home/forums/message-boards/general-discussion/mfcc-generation How and where in Hcopy, MFCCs are generated..? http://www.voxforge.org/home/forums/message-boards/general-discussion/mfcc-generation Sun, 21 Jul 2013 15:32:37 -0500 Contribute smaller sections on Voxforge - "Some text in view has voice data/images/gifs/videos/examples/diagrams/extra detail/requests for additions associated with it. Highlight text that has them? See them? Fulfill requests for points/bounties?" http://www.voxforge.org/home/forums/message-boards/general-discussion/contribute-smaller-sections-on-voxforge---some-text-in-view-has-voice-data/images/gifs/videos/examples/diagrams/extra-detail/requests-for-additions-associated-with-it.-highli http://www.voxforge.org/home/forums/message-boards/general-discussion/contribute-smaller-sections-on-voxforge---some-text-in-view-has-voice-data/images/gifs/videos/examples/diagrams/extra-detail/requests-for-additions-associated-with-it.-highli Mon, 17 Jun 2013 11:01:34 -0500 Australian English pronunciation dictionary http://www.voxforge.org/home/forums/message-boards/general-discussion/australian-english-pronunciation-dictionary Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/australian-english-pronunciation-dictionary Tue, 04 Jun 2013 07:04:53 -0500 grammar file for text independent speaker recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/grammar-file-for-text-independent-speaker-recognition What is grammar file for text independent speaker recognition?? http://www.voxforge.org/home/forums/message-boards/general-discussion/grammar-file-for-text-independent-speaker-recognition Mon, 27 May 2013 01:52:05 -0500 Error in my mkdfa run.... http://www.voxforge.org/home/forums/message-boards/general-discussion/error-in-my-mkdfa-run... I'm running in windows vista... http://www.voxforge.org/home/forums/message-boards/general-discussion/error-in-my-mkdfa-run... Fri, 24 May 2013 00:00:39 -0500 HMMIRest lattice read in problem http://www.voxforge.org/home/forums/message-boards/general-discussion/hmmirest-lattice-read-in-problem Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/hmmirest-lattice-read-in-problem Thu, 25 Apr 2013 10:22:03 -0500 how to properly reference voxforge ? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-properly-reference-voxforge- Dear all, http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-properly-reference-voxforge- Thu, 07 Mar 2013 12:51:08 -0600 A source of wave files... http://www.voxforge.org/home/forums/message-boards/general-discussion/a-source-of-wave-files.. Wave files are natively on store bought cds but they can be pricey. http://www.voxforge.org/home/forums/message-boards/general-discussion/a-source-of-wave-files.. Wed, 20 Feb 2013 11:08:15 -0600 Is anyone familiar with Hidden Markov Models and their application to speech recognition? http://www.voxforge.org/home/forums/message-boards/general-discussion/is-anyone-familiar-with-hidden-markov-models-and-their-application-to-speech-recognition I have been curious about the algorithms used in speech recognition. I have read about them and how Hidden Markov Models are relivant to speech regocnition algorithms. However I find my understading of what a Hidden Markov Model actually is to be incomplete. http://www.voxforge.org/home/forums/message-boards/general-discussion/is-anyone-familiar-with-hidden-markov-models-and-their-application-to-speech-recognition Tue, 19 Feb 2013 08:21:22 -0600 MOOC on Natural Language Processing http://www.voxforge.org/home/forums/message-boards/general-discussion/mooc-on-natural-language-processing FYI the course on Coursera.org given by Michael Collins of Columbia University has just opened a few days early for those eager to get started. http://www.voxforge.org/home/forums/message-boards/general-discussion/mooc-on-natural-language-processing Tue, 12 Feb 2013 14:21:01 -0600 Audio Input not Supported (Error 6306) http://www.voxforge.org/home/forums/message-boards/general-discussion/audio-input-not-supported-error-6306 I created an Australian English Acoustic Model based on the ANDOSL corpus with the already compiled HTK 3.3 components. The recognition with HVite.exe works brilliant. http://www.voxforge.org/home/forums/message-boards/general-discussion/audio-input-not-supported-error-6306 Mon, 14 Jan 2013 22:23:27 -0600 Out-of-the-box dictation http://www.voxforge.org/home/forums/message-boards/general-discussion/out-of-the-box-dictation Hello Everybody http://www.voxforge.org/home/forums/message-boards/general-discussion/out-of-the-box-dictation Thu, 22 Nov 2012 10:33:33 -0600 quickstart failure http://www.voxforge.org/home/forums/message-boards/general-discussion/quickstart-failure I am using ubuntu 12.04. I have tried a couple of methods to get Julius working on this computer and am a bit frustrated. Resorting to the quickstart download I get this....... http://www.voxforge.org/home/forums/message-boards/general-discussion/quickstart-failure Sun, 18 Nov 2012 21:06:15 -0600 Building Mandarin acoustic models http://www.voxforge.org/home/forums/message-boards/general-discussion/building-mandarin-acoustic-models Hello! http://www.voxforge.org/home/forums/message-boards/general-discussion/building-mandarin-acoustic-models Sun, 04 Nov 2012 20:10:51 -0600 stop reading http://www.voxforge.org/home/forums/message-boards/general-discussion/stop-reading 1) the problem that i have always had with speach to text systems that i have tryed in the passt is that i speak creatively far more exspressivly than i read. http://www.voxforge.org/home/forums/message-boards/general-discussion/stop-reading Sun, 21 Oct 2012 08:08:09 -0500 How to make Julius_AcousticModels_16kHz-16bit_MFCC_O_D work with Julius QuickStart? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-make-julius_acousticmodels_16khz-16bit_mfcc_o_d-work-with-julius-quickstart Hi! http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-make-julius_acousticmodels_16khz-16bit_mfcc_o_d-work-with-julius-quickstart Thu, 04 Oct 2012 04:21:52 -0500 audio-visual problem http://www.voxforge.org/home/forums/message-boards/general-discussion/audio-visual-problem Dears I http://www.voxforge.org/home/forums/message-boards/general-discussion/audio-visual-problem Mon, 01 Oct 2012 13:47:26 -0500 How to add new language for audio submissions? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-add-new-language-for-audio-submissions Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-add-new-language-for-audio-submissions Mon, 03 Sep 2012 05:38:15 -0500 Coursera.org - Machine Learning http://www.voxforge.org/home/forums/message-boards/general-discussion/coursera.org---machine-learning Just wondering if anyone else here is doing Andrew Ng's Machine learning offering at coursera.org? Very interesting so far and could provide a lot of insights into voice analysis techniques. http://www.voxforge.org/home/forums/message-boards/general-discussion/coursera.org---machine-learning Sun, 26 Aug 2012 06:47:02 -0500 question about building dictation system using language model http://www.voxforge.org/home/forums/message-boards/general-discussion/question-about-building-dictation-system-using-language-model hello, guys. http://www.voxforge.org/home/forums/message-boards/general-discussion/question-about-building-dictation-system-using-language-model Fri, 24 Aug 2012 12:03:34 -0500 extracting features after forced alignment http://www.voxforge.org/home/forums/message-boards/general-discussion/extracting-features-after-forced-alignment http://www.voxforge.org/home/forums/message-boards/general-discussion/extracting-features-after-forced-alignment Fri, 24 Aug 2012 06:53:40 -0500 mp3 player with voice recording http://www.voxforge.org/home/forums/message-boards/general-discussion/mp3-player-with-voice-recording I have a mp3 player with voice recording (16k, stereo, ADPCM) http://www.voxforge.org/home/forums/message-boards/general-discussion/mp3-player-with-voice-recording Fri, 27 Jul 2012 19:05:24 -0500 need help in aligning lists of wav files http://www.voxforge.org/home/forums/message-boards/general-discussion/need-help-in-aligning-lists-of-wav-files hi everyone im a beginner in htk and i need some help about force aligning large numbers of wav files.. i was able to force align wav files one by one just like in the voxforge tutorial (Automated Audio Segmentation Using Forced Alignment) my methods are almost the same... so the problem is that i want to segment all of each wav files in the cmu_us_slt_arctic but the problems is that it has 1 thousand wav files and aligning each off them takes time. is there anyway to script it or something. help is greatly appreciated. http://www.voxforge.org/home/forums/message-boards/general-discussion/need-help-in-aligning-lists-of-wav-files Tue, 10 Jul 2012 08:49:39 -0500 Forced Alignment and Features http://www.voxforge.org/home/forums/message-boards/general-discussion/forced-alignment-and-features Once a wav file has been force aligned using HVite,how can one extract the features of only the phonetic segments and not the entire .wav file? Is this doable in HTK? http://www.voxforge.org/home/forums/message-boards/general-discussion/forced-alignment-and-features Mon, 09 Jul 2012 22:39:08 -0500 Librivox contributions and dates/numbers http://www.voxforge.org/home/forums/message-boards/general-discussion/librivox-contributions-and-dates/numbers In reviewing a possible audio file I came across a lot of dates in one section, 1800, 1839 and so on. http://www.voxforge.org/home/forums/message-boards/general-discussion/librivox-contributions-and-dates/numbers Fri, 18 May 2012 10:55:26 -0500 Native speaker only? http://www.voxforge.org/home/forums/message-boards/general-discussion/native-speaker-only Hi! http://www.voxforge.org/home/forums/message-boards/general-discussion/native-speaker-only Wed, 09 May 2012 07:19:59 -0500 How's the project doing? http://www.voxforge.org/home/forums/message-boards/general-discussion/hows-the-project-doing First and foremost I wish to thank the VoxForge project. As my body is showing increase signs of age and RSI, I've been more and more curious about alternative input methods. http://www.voxforge.org/home/forums/message-boards/general-discussion/hows-the-project-doing Sun, 15 Apr 2012 15:17:55 -0500 End-Point Detection Algorithm in HTK http://www.voxforge.org/home/forums/message-boards/general-discussion/end-point-detection-algorithm-in-htk Hello there, http://www.voxforge.org/home/forums/message-boards/general-discussion/end-point-detection-algorithm-in-htk Wed, 04 Apr 2012 17:14:57 -0500 Alignement using GIZA++ http://www.voxforge.org/home/forums/message-boards/general-discussion/alignement-using-giza Good morning everybody , http://www.voxforge.org/home/forums/message-boards/general-discussion/alignement-using-giza Fri, 30 Mar 2012 04:08:59 -0500 User Dependant Countinous Dictation [m i right?] http://www.voxforge.org/home/forums/message-boards/general-discussion/user-dependant-countinous-dictation-m-i-right I want ask if what i'm saying is it right or wrong I searched throughly for 8 to 9 hours and came to this conclusion: http://www.voxforge.org/home/forums/message-boards/general-discussion/user-dependant-countinous-dictation-m-i-right Wed, 28 Mar 2012 19:26:23 -0500 text parser http://www.voxforge.org/home/forums/message-boards/general-discussion/text-parser I am looking for a code that can parse a text given a JSGF grammar, supplying the action tags that appear in the gramamr. http://www.voxforge.org/home/forums/message-boards/general-discussion/text-parser Tue, 06 Mar 2012 02:29:03 -0600 How to get output from "cygwin results" http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-get-output-from-cygwin-results We are working on a project. We want to control a robot by verbal commands. http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-get-output-from-cygwin-results Fri, 24 Feb 2012 12:07:26 -0600 julius 4.2.1. troubles with the Sample.jconf-file http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-4.2.1.-troubles-with-the-sample.jconf-file hi, i have many troubles to run julius 4.2.1 on 64bit Ubuntu 10.10 - install ok - running julius under julius 4.2.1 to run: ./julius -input mic -C Sample_1.jconf STAT: include config: Sample_1.jconf ERROR: m_chkparam: you should specify at least one LM to run Julius! on my *.jconf-file i have changed the line -LM lm_1 have somebody a simple jconf-file for me only for a test: speak on the mic: "PHONE STEVE" thanks for help, hints and tricks regards nomad my Sample_1.jconf (only lines without # ) ----------------------------------------------- ###################################################################### #### GLOBAL OPTIONS ###################################################################### #### Misc. Options #### Audio Input # my change -input mic # live microphone #### #### Speech segment detection by level and zero-cross #### default: on for microphone, off for other sources -cutsilence # detection on #### Gaussian Mixture Model #### Decoding option -realtime # force real-time processing #-norealtime # force non real-time processing : #### #### -AM am_1 -AM am_2 #### -LM lm_1 (LM spec..) #### -LM lm_2 (same LM spec..) #### -SR search1 am_1 lm_1 #### -SR search2 am_2 lm_2 #### ## Create a new AM configuration set, and switch current to it. ## You should give a unique name. #my change -AM name -AM am_1 ## Create a new LM configuration set, and switch current to it. ## You should give a unique name. #my change -LM name -LM lm_1 ## Create a new Search configuration set with AM and LM, and switch ## current to it. AM and LM name can be either name or ID number. # my change #-SR name am_name_or_id lm_name_or_id #-SR sr_1 am_1 lm_1 ## Switch current AM to special one reserved for GMM, to specify ## analysis parameter for GMM. Be sure not to confuse with normal AM ## configuration. # -AM_GMM ## When using instance declarations, global options should be placed ## at top before any instance declaration, or after this option below. ## This option is only a switcher and can be used anywhere anytime. -GLOBAL ## This option disables the strict section checkings and back to 4.0 -nosectioncheck ###################################################################### #### LANGUAGE MODEL (-LM) ###################################################################### #### ACOUSTIC MODEL (-AM) (-AM_GMM) ###################################################################### #### Acoustic analysis parameters are included in this section, since #### the AM defines the required parameter. You can use different MFCC #### type for each AM. #### For GMM, the same parameter should be specified after "-AM_GMM" #### #### When using multiple AM, the values of "-smpPeriod", "-smpFreq", #### "-fsize" and "-fshift" should be the same among all AM. #### ## Acoustic model #-gprune {safe|heuristic|beam|none|default} # Gaussian pruning method -gprune safe #-iwcd1 {max|avg|best 3} # Inter-word triphone approximation method -iwcd1 max #-iwsppenalty -1.0 # pause insertion penalty for "-iwsp" #-gshmm hmmfile # HMM for Gaussian mixture selection #-gsnum 24 # Threshold number of HMM for gshmm ## Analysis #-smpPeriod 625 # sampling period (ns) (= 10000000 / smpFreq) # my change (ev.48000 -smpFreq 16000 # sampling rate (Hz) #-fsize 400 # window size (samples) #-fshift 160 # frame shift (samples) #-preemph 0.97 # pre-emphasis coef. #-fbank 24 # number of filterbank channels #-ceplif 22 # cepstral liftering coef. #-rawe # use raw energy #-norawe # disable "-rawe" (this is default) #-enormal # normalize log energy #-noenormal # disable "-enormal" (this is default) #-escale 1.0 # scaling log energy for enormal #-silfloor 50.0 # energy silence floor in dB for enormal #-delwin 2 # delta window (frames) #-accwin 2 # acceleration window (frames) #-hifreq -1 # cut-off hi frequency (Hz) (-1: disable) #-lofreq -1 # cut-off low frequency (Hz) (-1: disable) #-zmeanframe # frame-wise DC offset removal (same as HTK) #-nozmeanframe # disable "-zmeanframe" (this is default) #################################################################### #### RECOGNIZER (-SR) ###################################################################### #### #### Default values for beam width and LM weights will change #### according to compile-time setup of JuliusLib and model specification. #### Please see the startup log for the actual values. #### #### #### parameter (common) #### #-inactive # start this process with inactive status #-1pass # perform only the 1st pass, omit 2nd pass #-no_ccd # switch off the phone context dependency #-force_ccd # force on the phone context dependency #-cmalpha 0.05 # CM alpha value #-iwsp # append a skippable sp at all word ends #-transp 0.0 # transition penalty for transparent words #### #### parameter (1st pass) #### #-lmp weight penalty # LM weight and word insertion penalty (pass1) #my change -penalty1 penalty -penalty1 5.0 #-penalty1 penalty # word insertion penalty for grammar (pass1) #-b width # beam width (# of nodes) #-bs score # beam width (score) #-nlimit 3 # with enable-wpair-nlimit, set max N at nodes #-progout # progressive output while decoding #-proginterval 300 # output interval in msec for "-progout" #### #### parameter (2nd pass) #### #-lmp2 weight penalty # LM weight and word insertion penalty (pass2) #my change #-penalty2 penalty # word insertion penalty for grammar (pass2) -penalty2 20.0 #my change -b2 width -b2 200 # envelope beam width of 2nd pass (#word) #my change -sb 80.0 ev. 200.0 -sb 80.0 # envelope score width at 2nd pass #-s 500 # hypotheses stack size on 2nd pass (#hypo) #-m 2000 # hypotheses overflow threshold (#hypo) #-n n # num of sentences to find #-output 1 # num of sentences to output as result #-lookuprange 5 # hypo. lookup range at word expansion (#frame) #-looktrellis # expand only trellis words in grammar #-fallback1pass # output 1st pass result when 2nd pass fails ################################################################# end of file http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-4.2.1.-troubles-with-the-sample.jconf-file Tue, 07 Feb 2012 06:02:27 -0600 how to download audio files http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-download-audio-files Hi. What is the preferred method of downloading the audio files? Sure, a person could manually click on each link but that takes forever. Is there an FTP site? Thanks. http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-download-audio-files Mon, 06 Feb 2012 19:59:37 -0600 no .dfa or .dict file generated please help, all files installed as .dSYM http://www.voxforge.org/home/forums/message-boards/general-discussion/no-.dfa-or-.dict-file-generated-please-help-all-files-installed-as-.dsym I Downloaded Julius-4.2.1 on Mac OS 10.6.8 http://www.voxforge.org/home/forums/message-boards/general-discussion/no-.dfa-or-.dict-file-generated-please-help-all-files-installed-as-.dsym Fri, 20 Jan 2012 10:45:28 -0600 Using CMU Commands http://www.voxforge.org/home/forums/message-boards/general-discussion/using-cmu-commands Good Morning every body http://www.voxforge.org/home/forums/message-boards/general-discussion/using-cmu-commands Tue, 20 Dec 2011 03:39:22 -0600 How to add new language(Ukrainian) to this site http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-add-new-languageukrainian-to-this-site How to add ukrainian (ÑƒÐºÑ€Ð°Ñ—Ð½ÑÑŒÐºÐ°) on this website? I wanna help. I'm native speaker. http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-add-new-languageukrainian-to-this-site Tue, 13 Dec 2011 18:29:52 -0600 Hvite + Hresults http://www.voxforge.org/home/forums/message-boards/general-discussion/hvite--hresults below was the file generated by hvite(named results): http://www.voxforge.org/home/forums/message-boards/general-discussion/hvite--hresults Tue, 06 Dec 2011 07:28:28 -0600 HTK on gait recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/htk-on-gait-recognition Hi, i was using HTK for gait recognition and i having some problem with hvite and herest. http://www.voxforge.org/home/forums/message-boards/general-discussion/htk-on-gait-recognition Mon, 05 Dec 2011 08:07:04 -0600 Japanese Speech Corpus http://www.voxforge.org/home/forums/message-boards/general-discussion/japanese-speech-corpus Hi All, http://www.voxforge.org/home/forums/message-boards/general-discussion/japanese-speech-corpus Mon, 28 Nov 2011 19:38:06 -0600 How to install cygwin http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-install-cygwin Hello everybody, http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-install-cygwin Fri, 11 Nov 2011 12:26:40 -0600 Help with sphinx4 config http://www.voxforge.org/home/forums/message-boards/general-discussion/help-with-sphinx4-config Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/help-with-sphinx4-config Thu, 10 Nov 2011 06:49:35 -0600 Video transcription? http://www.voxforge.org/home/forums/message-boards/general-discussion/video-transcription I need to transcribe the conversation from some videos. Its in .mov format. Is this possible? http://www.voxforge.org/home/forums/message-boards/general-discussion/video-transcription Wed, 09 Nov 2011 07:42:50 -0600 Speech recognition / acoustic analysis - Consulting http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-recognition-/-acoustic-analysis---consulting Hi folks, The company I work for may soon engage in a project where some level of speech recognition or acoustic analysis might be necessary. We are seeking for advice from someone who has practical experience in that field to: - Point us out the *open source* software/library that would best suit our requirements. - Provide us (through a Skype session) an overview of how it works and how the development process would look like. Background: - The application aims to teach users how to pronounce Te Reo Maori words. It will play a sample utterance of a single word and record the user pronouncing it back. It must compare the user's utterance against the sample pronunciation and provide a score indicating how close/good that was. Assumptions and constraints: - We've only got one sample pronunciation of each word (male and female versions) available at the moment. We don't envisage investing more time recording more pronunciations. - Only open source software and libraries are acceptable. - The application will only deal with isolated utterances and must be speaker independent. We don't envisage having a training process in the application. - Comparison could be done either through voice recognition algorithms or acoustic analysis (i.e. formants, frequency, pitch, etc.), as long as the score provided is good/realistic. We appreciate expertise in the area and are able to contribute financially, to acknowledge the time involved. If you think you can help us and is interested in doing it, please get in touch with me at teolupus.ext [at] gmail.com. If not, but you know someone who might be the right person to support us, could you please send this message through? Kind regards, Bruno http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-recognition-/-acoustic-analysis---consulting Tue, 04 Oct 2011 04:10:30 -0500 Running Julius on 64-bit Ubuntu 10.04 http://www.voxforge.org/home/forums/message-boards/general-discussion/running-julius-on-64-bit-ubuntu-10.04 Is there any trick to getting the Linux quickstart to work on a 64-bit laptop running Ubuntu 10.04? I confirmed my laptop's mic works by recording speech with Audacity, but when I the README's suggesting "./julian -input mic -C julian.jconf", it seems to initialize, and I get the prompt: http://www.voxforge.org/home/forums/message-boards/general-discussion/running-julius-on-64-bit-ubuntu-10.04 Thu, 29 Sep 2011 21:20:06 -0500 Why simon can not be activate at all? http://www.voxforge.org/home/forums/message-boards/general-discussion/why-simon-can-not-be-activate-at-all Hi all, I'm using linux ubuntu 11.04. I already installed simon 0.3 (simon_0.3.0-1ubuntu8_i386.deb) I worked "by the book" I installed all relevant packages, HTK, voxforge. the bottom line: Simon can not be activate! The reason - when I try to activate: I quote the popup: "Could not start recognition because the system reports that the recognition is not ready. Please check if you have defined a vocabulary, an appropriate grammar and recorded a few trainings samples. The system will then, upon synchronization, generate the model which will be used for the recognition." http://www.voxforge.org/home/forums/message-boards/general-discussion/why-simon-can-not-be-activate-at-all Mon, 05 Sep 2011 09:10:55 -0500 HInit error http://www.voxforge.org/home/forums/message-boards/general-discussion/hinit-error2 I have been trying to use HInit to train the HMM prototypes for a http://www.voxforge.org/home/forums/message-boards/general-discussion/hinit-error2 Sun, 04 Sep 2011 23:26:07 -0500 How could julius process long rawfile input and how to extract the text result http://www.voxforge.org/home/forums/message-boards/general-discussion/how-could-julius-process-long-rawfile-input-and-how-to-extract-the-text-result Is there any code already developed for these issues? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-could-julius-process-long-rawfile-input-and-how-to-extract-the-text-result Tue, 23 Aug 2011 11:07:39 -0500 Where can I find the full transcription of Audio files http://www.voxforge.org/home/forums/message-boards/general-discussion/where-can-i-find-the-full-transcription-of-audio-files Hi guys, http://www.voxforge.org/home/forums/message-boards/general-discussion/where-can-i-find-the-full-transcription-of-audio-files Mon, 08 Aug 2011 11:10:17 -0500 I can not run julius-3.5.2-quickstart-linux http://www.voxforge.org/home/forums/message-boards/general-discussion/i-can-not-run-julius-3.5.2-quickstart-linux Hi there, http://www.voxforge.org/home/forums/message-boards/general-discussion/i-can-not-run-julius-3.5.2-quickstart-linux Sat, 06 Aug 2011 02:18:07 -0500 Festival lex.lookup http://www.voxforge.org/home/forums/message-boards/general-discussion/festival-lex.lookup I note that in the output from a request to lex.lookup in Festival that the response for "internet" (as per the example in http://www.voxforge.org/home/dev/autoaudioseg/step-2) is http://www.voxforge.org/home/forums/message-boards/general-discussion/festival-lex.lookup Fri, 05 Aug 2011 10:02:10 -0500 playing a segment of a speech file http://www.voxforge.org/home/forums/message-boards/general-discussion/playing-a-segment-of-a-speech-file Hello everyone, http://www.voxforge.org/home/forums/message-boards/general-discussion/playing-a-segment-of-a-speech-file Sun, 10 Jul 2011 16:05:13 -0500 Error [+2221] when running HRest http://www.voxforge.org/home/forums/message-boards/general-discussion/error-2221-when-running-hrest I have been trying to follow up on a tutorial to build a YES/NO Recognition System. http://www.voxforge.org/home/forums/message-boards/general-discussion/error-2221-when-running-hrest Tue, 05 Jul 2011 13:07:55 -0500 HInit error http://www.voxforge.org/home/forums/message-boards/general-discussion/hinit-error http://www.voxforge.org/home/forums/message-boards/general-discussion/hinit-error Thu, 30 Jun 2011 10:24:26 -0500 SRILM bigram Versus HTK bigram http://www.voxforge.org/home/forums/message-boards/general-discussion/srilm-bigram-versus-htk-bigram hi http://www.voxforge.org/home/forums/message-boards/general-discussion/srilm-bigram-versus-htk-bigram Mon, 20 Jun 2011 05:06:37 -0500 reverse trigram http://www.voxforge.org/home/forums/message-boards/general-discussion/reverse-trigram Hi all, i've built accoustic model and using grammar. when i try to using julius live system can recognize word/sentence well. but, when i try to build language model using htk to make reverse trigram live with julian and using accoutic model, the performance is worse than using grammar. i try language model using 3136 training sentence and 180 word list (wlist). when i executing julius there is no error message. is it true that the performance using language model is worse than using grammar? here is the step that i already did in language model htk: REVERSE TRIGRAM: - reverse all train corpus - make unigram and bigram - make trigram LBuild -f TEXT -T 1 -c 3 1 -n 3 -l lm_2k\bg1 lm_2k\2k.wmap lm_2k\tg1_1_new.arpa holmes.1\data.0 holmes.1\data.1 holmes.1\data.2 lm_2k\data.0 in julius: - mkbingram -nlr bg1_new.arpa -nrl tg1_1_new.arpa julius2.bin from the step above, is there wrong step or missing step? thanks. http://www.voxforge.org/home/forums/message-boards/general-discussion/reverse-trigram Thu, 28 Apr 2011 02:44:17 -0500 error with HInit. http://www.voxforge.org/home/forums/message-boards/general-discussion/error-with-hinit I have been trying to use HInit and have been getting different errors. The first one being the following: In this example, I am just trying to recognize a phoneme for the letter 'a'. http://www.voxforge.org/home/forums/message-boards/general-discussion/error-with-hinit Thu, 21 Apr 2011 07:44:04 -0500 Support your project http://www.voxforge.org/home/forums/message-boards/general-discussion/support-your-project Hi , we are from Turkey. We interested in your project much. We have online game.We will give gifts some people who talks on voxforge everyday. http://www.voxforge.org/home/forums/message-boards/general-discussion/support-your-project Sat, 09 Apr 2011 10:21:47 -0500 ERROR 3219 HVite: HMM list file name expected http://www.voxforge.org/home/forums/message-boards/general-discussion/error-3219-hvite-hmm-list-file-name-expected Hello, http://www.voxforge.org/home/forums/message-boards/general-discussion/error-3219-hvite-hmm-list-file-name-expected Wed, 30 Mar 2011 16:11:58 -0500 Utilize HTK Live recognizer http://www.voxforge.org/home/forums/message-boards/general-discussion/utilize-htk-live-recognizer Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/utilize-htk-live-recognizer Wed, 30 Mar 2011 11:37:05 -0500 Using Keith Vertanen's AM and LM in Julius http://www.voxforge.org/home/forums/message-boards/general-discussion/using-keith-vertanens-am-and-lm-in-julius http://www.voxforge.org/home/forums/message-boards/general-discussion/using-keith-vertanens-am-and-lm-in-julius Thu, 17 Mar 2011 12:31:25 -0500 License terms http://www.voxforge.org/home/forums/message-boards/general-discussion/license-terms Hello, http://www.voxforge.org/home/forums/message-boards/general-discussion/license-terms Wed, 16 Mar 2011 19:57:41 -0500 Speech controlled Program http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-controlled-program Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-controlled-program Wed, 02 Mar 2011 02:55:06 -0600 Download Nightly Build within my GPL application http://www.voxforge.org/home/forums/message-boards/general-discussion/download-nightly-build-within-my-gpl-application Hi! http://www.voxforge.org/home/forums/message-boards/general-discussion/download-nightly-build-within-my-gpl-application Tue, 22 Feb 2011 05:49:43 -0600 2011 Nightly Builds? http://www.voxforge.org/home/forums/message-boards/general-discussion/2011-nightly-builds hi everyone, http://www.voxforge.org/home/forums/message-boards/general-discussion/2011-nightly-builds Mon, 21 Feb 2011 08:13:20 -0600 Read not working. http://www.voxforge.org/home/forums/message-boards/general-discussion/read-not-working http://imagebin.org/135904 http://www.voxforge.org/home/forums/message-boards/general-discussion/read-not-working Wed, 02 Feb 2011 20:19:23 -0600 Speech samples and/or acoustic model for a particular dialect http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-samples-and/or-acoustic-model-for-a-particular-dialect Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-samples-and/or-acoustic-model-for-a-particular-dialect Wed, 02 Feb 2011 05:24:49 -0600 How to using htk on window http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-using-htk-on-window Hi. http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-using-htk-on-window Sun, 16 Jan 2011 04:00:10 -0600 New Audio Source http://www.voxforge.org/home/forums/message-boards/general-discussion/new-audio-source I'm not sure if this is a dumb question: http://www.voxforge.org/home/forums/message-boards/general-discussion/new-audio-source Thu, 06 Jan 2011 20:30:08 -0600 acoustics http://www.voxforge.org/home/forums/message-boards/general-discussion/acoustics Ken, http://www.voxforge.org/home/forums/message-boards/general-discussion/acoustics Wed, 22 Dec 2010 12:56:57 -0600 speech enabled websites http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-enabled-websites Hey folks, http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-enabled-websites Thu, 09 Dec 2010 07:47:07 -0600 How to start Turkish support in VoxForge? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-start-turkish-support-in-voxforge Hello, http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-start-turkish-support-in-voxforge Tue, 09 Nov 2010 14:34:35 -0600 Generated files in Subversion http://www.voxforge.org/home/forums/message-boards/general-discussion/generated-files-in-subversion While building databases I meet the same issue over and over again - side of the repository is unnecessary huge due to autogenerated files like *.mfc and resampled *.wav. I strongly recommend to remove such files from the repository since they do not add any information only increase pain with duplicated bugs. http://www.voxforge.org/home/forums/message-boards/general-discussion/generated-files-in-subversion Wed, 27 Oct 2010 16:58:16 -0500 Search indexed files? http://www.voxforge.org/home/forums/message-boards/general-discussion/search-indexed-files http://www.voxforge.org/home/forums/message-boards/general-discussion/search-indexed-files Wed, 20 Oct 2010 18:33:12 -0500 New from W3C: HTML Speech Incubator Group Charter http://www.voxforge.org/home/forums/message-boards/general-discussion/new-from-w3c-html-speech-incubator-group-charter From their web page: http://www.voxforge.org/home/forums/message-boards/general-discussion/new-from-w3c-html-speech-incubator-group-charter Wed, 29 Sep 2010 18:33:30 -0500 Problem with HInit http://www.voxforge.org/home/forums/message-boards/general-discussion/problem-with-hinit Hi everyone, http://www.voxforge.org/home/forums/message-boards/general-discussion/problem-with-hinit Fri, 24 Sep 2010 10:24:55 -0500 any conversations in the available audio http://www.voxforge.org/home/forums/message-boards/general-discussion/any-conversations-in-the-available-audio I'm looking for recordings of conversation that I can use for a speech application I'm working on. I've downloaded a few folders but so far they all consist of a single person speaking. http://www.voxforge.org/home/forums/message-boards/general-discussion/any-conversations-in-the-available-audio Thu, 23 Sep 2010 10:03:31 -0500 How to add new words... ? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-add-new-words...- I probably misunderstand the forums, but it seems like this thread that I posted to didn't bump because I didn't see it at the top of the category. So I'm posting a redirect... http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-add-new-words...- Tue, 14 Sep 2010 10:17:43 -0500 Convert sphinx .arpa language model to .DMP file issue http://www.voxforge.org/home/forums/message-boards/general-discussion/convert-sphinx-.arpa-language-model-to-.dmp-file-issue Hello everyone, http://www.voxforge.org/home/forums/message-boards/general-discussion/convert-sphinx-.arpa-language-model-to-.dmp-file-issue Mon, 13 Sep 2010 08:42:16 -0500 add Ralf's German dictionary 0.2.1 to Voxforge http://www.voxforge.org/home/forums/message-boards/general-discussion/add-ralfs-german-dictionary-0.2.1-to-voxforge Hi Ken, http://www.voxforge.org/home/forums/message-boards/general-discussion/add-ralfs-german-dictionary-0.2.1-to-voxforge Sat, 11 Sep 2010 09:11:38 -0500 Trying to follow guide to make my own acoustic model but problems... http://www.voxforge.org/home/forums/message-boards/general-discussion/trying-to-follow-guide-to-make-my-own-acoustic-model-but-problems.. http://www.voxforge.org/home/forums/message-boards/general-discussion/trying-to-follow-guide-to-make-my-own-acoustic-model-but-problems.. Fri, 10 Sep 2010 15:10:07 -0500 language model traning problem with utf-8 text corpus http://www.voxforge.org/home/forums/message-boards/general-discussion/language-model-traning-problem-with-utf-8-text-corpus Hello everyone, http://www.voxforge.org/home/forums/message-boards/general-discussion/language-model-traning-problem-with-utf-8-text-corpus Fri, 10 Sep 2010 05:00:35 -0500 Isolated speech data http://www.voxforge.org/home/forums/message-boards/general-discussion/isolated-speech-data Hello, people. http://www.voxforge.org/home/forums/message-boards/general-discussion/isolated-speech-data Mon, 06 Sep 2010 00:46:14 -0500 Wrong dictionary file with julius? http://www.voxforge.org/home/forums/message-boards/general-discussion/wrong-dictionary-file-with-julius I've got julius 4.1.5, the latest nightly 8khz acoustic model from this site, and the grammar from here: http://www.keithv.com/software/giga/lm_giga_64k_nvp_3gram.zip http://www.voxforge.org/home/forums/message-boards/general-discussion/wrong-dictionary-file-with-julius Tue, 31 Aug 2010 19:52:12 -0500 More than one language model as input to Julius http://www.voxforge.org/home/forums/message-boards/general-discussion/more-than-one-language-model-as-input-to-julius Hi Ken http://www.voxforge.org/home/forums/message-boards/general-discussion/more-than-one-language-model-as-input-to-julius Mon, 30 Aug 2010 14:14:53 -0500 Clarification on Julius Language Model http://www.voxforge.org/home/forums/message-boards/general-discussion/clarification-on-julius-language-model Hi http://www.voxforge.org/home/forums/message-boards/general-discussion/clarification-on-julius-language-model Fri, 27 Aug 2010 11:26:02 -0500 Dictionary format http://www.voxforge.org/home/forums/message-boards/general-discussion/dictionary-format Hello, http://www.voxforge.org/home/forums/message-boards/general-discussion/dictionary-format Wed, 25 Aug 2010 06:19:19 -0500 Windows 7 Speech Recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/windows-7-speech-recognition Windows 7 Speech Recognition is really pretty good but I look for open-source to lead the way. http://www.voxforge.org/home/forums/message-boards/general-discussion/windows-7-speech-recognition Sun, 22 Aug 2010 14:45:08 -0500 Research to Improve Speech Recognition Software http://www.voxforge.org/home/forums/message-boards/general-discussion/research-to-improve-speech-recognition-software This was featured in ACM TechNews. There isn't much details, but I'm copying it here in case someone finds it interesting. http://www.voxforge.org/home/forums/message-boards/general-discussion/research-to-improve-speech-recognition-software Fri, 13 Aug 2010 17:40:44 -0500 Using president's transcribed speeches http://www.voxforge.org/home/forums/message-boards/general-discussion/using-presidents-transcribed-speeches I did a quick search through the forums and didn't see any mention of this. The whitehouse.gov has a corpus of transcribed presidents' speeches along with the audio files. Have these been incorporated in Voxforge? http://www.voxforge.org/home/forums/message-boards/general-discussion/using-presidents-transcribed-speeches Thu, 29 Jul 2010 12:13:52 -0500 Voice Recognition Command System http://www.voxforge.org/home/forums/message-boards/general-discussion/voice-recognition-command-system Hello: http://www.voxforge.org/home/forums/message-boards/general-discussion/voice-recognition-command-system Fri, 09 Jul 2010 23:23:09 -0500 HVite - How to use it for labeling? http://www.voxforge.org/home/forums/message-boards/general-discussion/hvite---how-to-use-it-for-labeling Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/hvite---how-to-use-it-for-labeling Thu, 08 Jul 2010 10:10:12 -0500 Sphinx 4/ Audio cache and Speech recognizer issues http://www.voxforge.org/home/forums/message-boards/general-discussion/sphinx-4/-audio-cache-and-speech-recognizer-issues I have a an art gallery opening tomorrow. Yeah I know. I've been trying to fix this memory issue for two weeks now with no avail. The skinny: Here's the code I've started with. http://pastebin.org/366431 Basically, here's what the program is supposed to do: It's supposed to initialise a speech recognizer, and a microphone. Supposed to find tweets specified by an external file that has the grammar words "love" "hate" "sad" etc. It's supposed to speak the words when recognized and while it is speaking the microphone turns off. Then it is supposed to listen for the key word which is spoken and loop this procedure. The problem I'm having is that i'm running out of memory and getting out of memory java heap error. I've upped my cache to 500 mb and changed the configuration files to match, and I'm still running out. This is due to one thing and one thing only: the recognizer allocation. It's somehow meant to be in the constructor. I'm not sure how to get the allocater out of the loop and have it work. I'm supremely desperate and I've looked everywhere. They may not let me exhibit if I can't get the error fixed. Here is an alternate code that someone suggested. It's not right but maybe it's closer than the first. http://pastebin.org/367378 If anyone knows how to solve this issue i would greatly appreciate it! http://www.voxforge.org/home/forums/message-boards/general-discussion/sphinx-4/-audio-cache-and-speech-recognizer-issues Tue, 29 Jun 2010 20:37:11 -0500 transcribe from wav files? http://www.voxforge.org/home/forums/message-boards/general-discussion/transcribe-from-wav-files I downloaded http://www.repository.voxforge1.org/downloads/Main/Tags/Releases/0_1_1-build726/Julius-3.5.2-Quickstart-Linux_(0_1_1-build726).tgz, and, per the README, ran http://www.voxforge.org/home/forums/message-boards/general-discussion/transcribe-from-wav-files Tue, 22 Jun 2010 13:37:36 -0500 JVXML demo code build error http://www.voxforge.org/home/forums/message-boards/general-discussion/jvxml-demo-code-build-error Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/jvxml-demo-code-build-error Wed, 09 Jun 2010 16:53:55 -0500 iPhone http://www.voxforge.org/home/forums/message-boards/general-discussion/iphone Hi there, I'm planning on using voice recognition for 3-4 simple commands(single word) in an iPhone application I'm writting. http://www.voxforge.org/home/forums/message-boards/general-discussion/iphone Sun, 09 May 2010 18:04:53 -0500 Gaussian Markov Model (GMM) http://www.voxforge.org/home/forums/message-boards/general-discussion/gaussian-markov-model-gmm Hey, http://www.voxforge.org/home/forums/message-boards/general-discussion/gaussian-markov-model-gmm Wed, 05 May 2010 10:59:05 -0500 Need Help: Modify julius code? http://www.voxforge.org/home/forums/message-boards/general-discussion/need-help-modify-julius-code Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/need-help-modify-julius-code Sun, 02 May 2010 03:10:04 -0500 Is voice recognition worth the hassle? http://www.voxforge.org/home/forums/message-boards/general-discussion/is-voice-recognition-worth-the-hassle I'm considering doing a project for a particular department at my school in the area of artificial intelligence. I am only considering it because I've been literally waiting for YEARS (10??) for speech recognition (I last looked at sphinx 3) to catch up to everyone else. By that what I mean is the documentation and implementation of a small portion of the project seems terribly time consuming and it also seems necessary that I find out HOW this section of the project works rather than add it on as a stand alone portion of the greater whole. I've written a great deal of the project in the past in seperate programs and am only now considering a culmination of the AI as a really nice expert system/prolog based AI if I can get certain bells and whistles to work on it. One being this ;) http://www.voxforge.org/home/forums/message-boards/general-discussion/is-voice-recognition-worth-the-hassle Thu, 29 Apr 2010 20:46:42 -0500 How to deal with Out-Of-Vocabulary (OOV) Words http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-deal-with-out-of-vocabulary-oov-words Hi all, http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-deal-with-out-of-vocabulary-oov-words Thu, 15 Apr 2010 09:25:24 -0500 Grammer Files http://www.voxforge.org/home/forums/message-boards/general-discussion/grammer-files After reading through this site, I see that this project seems only to be aimed at creating a solid acoustic model for various software. I'm working with Juilis, and was looking into the grammar files wondering how to expand on them easily. http://www.voxforge.org/home/forums/message-boards/general-discussion/grammer-files Sun, 11 Apr 2010 20:47:58 -0500 Cambridge HTK server offline temporarily http://www.voxforge.org/home/forums/message-boards/general-discussion/cambridge-htk-server-offline-temporarily For those who may have tried to access the server at http://htk.eng.cam.ac.uk/ in the last few days and found that it is not behaving correctly, I have this message back from the admin at the engineering dept: http://www.voxforge.org/home/forums/message-boards/general-discussion/cambridge-htk-server-offline-temporarily Thu, 08 Apr 2010 11:33:04 -0500 Running Julian After Step 10 http://www.voxforge.org/home/forums/message-boards/general-discussion/running-julian-after-step-10 Hello, http://www.voxforge.org/home/forums/message-boards/general-discussion/running-julian-after-step-10 Thu, 01 Apr 2010 15:53:07 -0500 Basic help needed please http://www.voxforge.org/home/forums/message-boards/general-discussion/basic-help-needed-please http://www.voxforge.org/home/forums/message-boards/general-discussion/basic-help-needed-please Fri, 26 Mar 2010 05:15:49 -0500 Getting the phonems directly instead words http://www.voxforge.org/home/forums/message-boards/general-discussion/getting-the-phonems-directly-instead-words Guys, http://www.voxforge.org/home/forums/message-boards/general-discussion/getting-the-phonems-directly-instead-words Thu, 11 Mar 2010 05:13:53 -0600 Using and Setting Up Grammar http://www.voxforge.org/home/forums/message-boards/general-discussion/using-and-setting-up-grammar All, http://www.voxforge.org/home/forums/message-boards/general-discussion/using-and-setting-up-grammar Mon, 01 Mar 2010 21:55:07 -0600 Recognition results statistics http://www.voxforge.org/home/forums/message-boards/general-discussion/recognition-results-statistics Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/recognition-results-statistics Mon, 01 Mar 2010 11:00:07 -0600 transcribing seminar videos http://www.voxforge.org/home/forums/message-boards/general-discussion/transcribing-seminar-videos i have a group of seminar videos, I am looking at setting up a website like metavid ( http://metavid.org/ ) witch is based on the wikipedia software http://www.voxforge.org/home/forums/message-boards/general-discussion/transcribing-seminar-videos Thu, 25 Feb 2010 06:21:05 -0600 finding language model weight http://www.voxforge.org/home/forums/message-boards/general-discussion/finding-language-model-weight hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/finding-language-model-weight Sun, 21 Feb 2010 23:01:48 -0600 convert .mfc on sphinx into .mfc of htk http://www.voxforge.org/home/forums/message-boards/general-discussion/convert-.mfc-on-sphinx-into-.mfc-of-htk hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/convert-.mfc-on-sphinx-into-.mfc-of-htk Sat, 06 Feb 2010 11:47:40 -0600 frusterated http://www.voxforge.org/home/forums/message-boards/general-discussion/frusterated usually my forum posts are a little short tempered due to my frusteration with trying and researching and trying something and posting as a last resort so please bear with me... http://www.voxforge.org/home/forums/message-boards/general-discussion/frusterated Wed, 03 Feb 2010 18:06:29 -0600 diff julius julian http://www.voxforge.org/home/forums/message-boards/general-discussion/diff-julius-julian I guess I don't really understand the difference between julius and julian. I've read the technical stuff, but can somebody simplify the explanation? http://www.voxforge.org/home/forums/message-boards/general-discussion/diff-julius-julian Tue, 02 Feb 2010 20:55:53 -0600 Grammer? http://www.voxforge.org/home/forums/message-boards/general-discussion/grammer In the Julius manual, they always refer to grammer like its some sort of object. What does this mean? http://www.voxforge.org/home/forums/message-boards/general-discussion/grammer Tue, 02 Feb 2010 21:06:25 -0600 Game development http://www.voxforge.org/home/forums/message-boards/general-discussion/game-development Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/game-development Mon, 01 Feb 2010 05:25:51 -0600 Voxforge java speech applet and Firefox 3.6 http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-java-speech-applet-and-firefox-3.6 I just upgraded my Firefox to 3.6 and had a lot of problems getting my java plugin to work with the new version. Most of the instructions about installing the symbolic link state something like http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-java-speech-applet-and-firefox-3.6 Mon, 25 Jan 2010 05:10:31 -0600 Getting CSR that is comparable to Dragon http://www.voxforge.org/home/forums/message-boards/general-discussion/getting-csr-that-is-comparable-to-dragon What would be the approach to get either Sphinx, Julius or HTK to be as accurate as Dragon Naturally Speaking preferred or pro version in English (USA and or UK)? Has anyone achieved anything close? i.e. accurate CSR with a large english vocab. I assume that a much larger speech corpus than what VoxForge currently has would be required, along with a much larger dictionary than what is currently available. Even if VoxForge reaches 140 hours of English, would this be enough? What other things would have to be done to the 140 hours to have it be good enough? http://www.voxforge.org/home/forums/message-boards/general-discussion/getting-csr-that-is-comparable-to-dragon Fri, 08 Jan 2010 13:22:16 -0600 Sources of text http://www.voxforge.org/home/forums/message-boards/general-discussion/sources-of-text Is there any legal restriction on submitting recordings made from reading news articles from online news sites? http://www.voxforge.org/home/forums/message-boards/general-discussion/sources-of-text Thu, 07 Jan 2010 15:08:27 -0600 HTK Site not up? http://www.voxforge.org/home/forums/message-boards/general-discussion/htk-site-not-up I'm new to VoxForge (read: VF and all related areas of study) and tried to follow the tutorial for creating an acoustic model from my own voice, but ran into a problem when trying to get HTK. Simply put, I can't reach the site. I get errors like "Server not found" or "Connection to the Server was reset", etc. I tried the Google cache of the page and it reports that their latest cached copy was from Christmas Eve (24 Dec 2009 17:20:17 GMT). Other parts of Cambridge's Engineering Dept. appear to be up. Is this just temporary or is something up? I can't get any HTK stuff, so I'm stalled for now. Any help would be appreciated. http://www.voxforge.org/home/forums/message-boards/general-discussion/htk-site-not-up Thu, 31 Dec 2009 14:58:52 -0600 Deep learning for spoken language identification http://www.voxforge.org/home/forums/message-boards/general-discussion/deep-learning-for-spoken-language-identification The VoxForge corpus was used in research by Gregoire Montavon, Machine Learning Group, Berlin Institute of Technology, Germany on Deep learning for spoken language identification. From the abtract: http://www.voxforge.org/home/forums/message-boards/general-discussion/deep-learning-for-spoken-language-identification Tue, 15 Dec 2009 16:52:31 -0600 VoxForge applet parameters and source http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-applet-parameters-and-source Hi! http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-applet-parameters-and-source Mon, 07 Dec 2009 13:56:25 -0600 Can't run Julius/Julian in Ubuntu Karmic http://www.voxforge.org/home/forums/message-boards/general-discussion/cant-run-julius/julian-in-ubuntu-karmic When I execute ./julian -input mic -C julian.jconf it results in this: http://www.voxforge.org/home/forums/message-boards/general-discussion/cant-run-julius/julian-in-ubuntu-karmic Tue, 24 Nov 2009 11:32:17 -0600 Split development? http://www.voxforge.org/home/forums/message-boards/general-discussion/split-development Sam wrote in an email: http://www.voxforge.org/home/forums/message-boards/general-discussion/split-development Mon, 16 Nov 2009 13:32:52 -0600 Several questions / ideas http://www.voxforge.org/home/forums/message-boards/general-discussion/several-questions-/-ideas email from Sam: http://www.voxforge.org/home/forums/message-boards/general-discussion/several-questions-/-ideas Mon, 16 Nov 2009 12:48:27 -0600 Sphinx4 in NetBeans; PocketSphinx demo http://www.voxforge.org/home/forums/message-boards/general-discussion/sphinx4-in-netbeans-pocketsphinx-demo Hello :-)! Once I asked on this forum for help and KMacLean gave me really useful answer :-). This is why I thought that maybe you can help me a little bit with two things. First thing (most important) is that I've got Sphinx4 installed in Windows and I'd like to create my own application which uses Sphinx4 in NetBeans. In order to do it, first of all I'd like to recompile examplary application (HelloDigits) in NetBeans. However, there are some problems with building it, connected with structure of directories in Sphinx4. The second problem (less important than first one) is how to run examplary applications in PocketSphinx. Because I already posted those questions on sourceforge (and I wait really much for the answer to the second one) in order to avoid crossposting, let me give links to those topics here: Sphinx4 in NetBeans -> https://sourceforge.net/projects/cmusphinx/forums/forum/382337/topic/3453042 (post #1) how to run demos in PocketSphinx -> https://sourceforge.net/projects/cmusphinx/forums/forum/5471/topic/3445960 (post #4) Thanks very much for help in advance :-)! Greetings :-)! http://www.voxforge.org/home/forums/message-boards/general-discussion/sphinx4-in-netbeans-pocketsphinx-demo Sun, 08 Nov 2009 13:44:07 -0600 input by spelling a good in between objective? http://www.voxforge.org/home/forums/message-boards/general-discussion/input-by-spelling-a-good-in-between-objective It will probably take a while before usable OS dictation software will be a realistticobjective, however, I can imagine that an application to input text using spelling is not as unlikely at all. http://www.voxforge.org/home/forums/message-boards/general-discussion/input-by-spelling-a-good-in-between-objective Tue, 03 Nov 2009 13:28:49 -0600 Julius, HTK or Sphinx for mobile phone http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-htk-or-sphinx-for-mobile-phone Hello :-)! I'd like to create application for mobile phone such that: 1. user enters this application, 2. user speaks ten digits, then says stop, application checks control sum, if the sum is correct it asks the user for some additional informations, if not it asks whether the user accepts improper control sum or wants to repeat digits 3. based on this talk, application creates little text file and sends it to server (probably with Tomcat) The other possible approach (however that first one is better) is somehow different: 1. user calls special number, it connects him to server with Asterisk 2. the same as above 3. the only one difference is that there is no need to send it, it is just saved on the server. At first I thought about CMU Sphinx, i.e. PocketSphinx (C language) for first approach or Sphinx4 (Java) for second one. However now I think that it may be better idea to check other freeware, open-source systems like HTK or Julius. Which of these three (Sphinx, HTK, Julius) is better for first, and which for the second approach and why? (I cannot buy super-expensive mobile phone so hardware resources on phone are limited). Can you give me, please, some tutorials which say about (beginning from most important): 1. where to download the system, how to compile, install and run it on computer 2. how to create algorithm of recognition (as explained in second point above) 3. how to run it on mobile phone! 4. how to create my own model (what kind of models? acoustic and language I guess?). I need to create my own because I need other language that English and I am almost sure there won't be already existing, available for free model for my language (it is Polish language). However, maybe English model would be good enough, even if there may be some differences in phonemes. Thank you very much for your help in advance :-)! http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-htk-or-sphinx-for-mobile-phone Sun, 25 Oct 2009 06:01:46 -0500 Bluetooth device as audio in/out http://www.voxforge.org/home/forums/message-boards/general-discussion/bluetooth-device-as-audio-in/out I've been experimenting with a bluetooth dongle and a Jabra BT2040 as an audio input and output device. The idea is a handsfree communication with a Julius SRE in place of the wired USB headset I normally use. (Using Opensuse 11.1 + Blueman) http://www.voxforge.org/home/forums/message-boards/general-discussion/bluetooth-device-as-audio-in/out Sat, 10 Oct 2009 06:58:10 -0500 Relatiob with LibriVox http://www.voxforge.org/home/forums/message-boards/general-discussion/relatiob-with-librivox What is the relation of this project and a LibriVox? Do LibriVox utterances are incorporated into VoxForge or not? http://www.voxforge.org/home/forums/message-boards/general-discussion/relatiob-with-librivox Thu, 01 Oct 2009 05:02:00 -0500 Speech to Phenome to Speech http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-to-phenome-to-speech Just curious, http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-to-phenome-to-speech Thu, 10 Sep 2009 18:47:16 -0500 University of Washington Research Lectures http://www.voxforge.org/home/forums/message-boards/general-discussion/university-of-washington-research-lectures I was scanning the FTA satellites last evening and found this interesting lecture on extracting meaning from sentences in a voice interaction context from UWTV: http://www.voxforge.org/home/forums/message-boards/general-discussion/university-of-washington-research-lectures Thu, 10 Sep 2009 04:53:28 -0500 How to get the time of each Phoneme/Viseme in audio file for lipsync? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-get-the-time-of-each-phoneme/viseme-in-audio-file-for-lipsync hi,all I want to do the lipsync for my cg animation. I think I have to get the time of each Phoneme/Viseme in the audio file. My questions are: 1.How to get the time of each Phoneme/Viseme from audio file (e.g.wav/ogg/...)?for example, i need the information like this: http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-get-the-time-of-each-phoneme/viseme-in-audio-file-for-lipsync Tue, 01 Sep 2009 07:19:44 -0500 Dynamic phoneme generation http://www.voxforge.org/home/forums/message-boards/general-discussion/dynamic-phoneme-generation Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/dynamic-phoneme-generation Mon, 24 Aug 2009 14:51:53 -0500 sphinx + voxforge http://www.voxforge.org/home/forums/message-boards/general-discussion/sphinx--voxforge hey voxforge team http://www.voxforge.org/home/forums/message-boards/general-discussion/sphinx--voxforge Sat, 22 Aug 2009 12:28:37 -0500 htk silence removal http://www.voxforge.org/home/forums/message-boards/general-discussion/htk-silence-removal Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/htk-silence-removal Wed, 12 Aug 2009 00:32:10 -0500 phonetic dictionary create edit format http://www.voxforge.org/home/forums/message-boards/general-discussion/phonetic-dictionary-create-edit-format Hello all, I hope to make it easier to improve a phonetic dictionary with g2p4j. I welcome advice and anyone wishing to help with this project: http://g2p4j.sourceforge.net/ I post here instead of continuing audio-discussions medical-technical-language-voice-files since my interest for now is understanding phonetic dictionaries. I have compared machine made dictionaries of technical words. I find that Sequitur G2P and Festival introduced different errors. Both require a high percentage of manual corrections. Sequitur lacks the schwa ax. Could this be added? Technical words poorly follow the letter to phoneme model of general English. Examples: ey not ah to mean "not" (Greek) abacterial (ey/ah) b ae k t ih r iy ax l o vowel ow should not shift to ah adrenocortical ah d r eh n (ow/ah) k ao r t ah k ah l Although it may be overly ambitious, I use a tag for word alternatives. The same dictionary can be sorted to give preference to a region or patched and sorted to adapt a general acoustic model to an individual. I would appreciate any references that detail the role of alternate http://www.voxforge.org/home/forums/message-boards/general-discussion/phonetic-dictionary-create-edit-format Sun, 02 Aug 2009 07:44:05 -0500 GPL on audio files http://www.voxforge.org/home/forums/message-boards/general-discussion/gpl-on-audio-files Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/gpl-on-audio-files Tue, 07 Jul 2009 10:47:37 -0500 Guide to setting up speech recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/guide-to-setting-up-speech-recognition Hello, http://www.voxforge.org/home/forums/message-boards/general-discussion/guide-to-setting-up-speech-recognition Sun, 14 Jun 2009 13:22:17 -0500 Programming Library to Edit Audio http://www.voxforge.org/home/forums/message-boards/general-discussion/programming-library-to-edit-audio We have some 10-minute wav files that contain speech. There are well-defined pauses in the speech. We need to write a program that separates out the speech between these pauses and outputs them as separate audio files. We also need to be able to save the elapsed time at the end of each pause. http://www.voxforge.org/home/forums/message-boards/general-discussion/programming-library-to-edit-audio Mon, 08 Jun 2009 18:26:36 -0500 Seeking audio for generation of text-to-speech voices http://www.voxforge.org/home/forums/message-boards/general-discussion/seeking-audio-for-generation-of-text-to-speech-voices I have written a text-to-speech engine which lets people create their own voices, and wish to create more voices for it. I found the CMU Festival voices (like AWB, RMS, JMK, etc.), but need more voices. Are any of the voices here that are good for TTS? (aka: 1000+ sentences) http://www.voxforge.org/home/forums/message-boards/general-discussion/seeking-audio-for-generation-of-text-to-speech-voices Mon, 08 Jun 2009 16:51:58 -0500 preferred way to download all 16kHz english audio files http://www.voxforge.org/home/forums/message-boards/general-discussion/preferred-way-to-download-all-16khz-english-audio-files Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/preferred-way-to-download-all-16khz-english-audio-files Sun, 24 May 2009 20:11:17 -0500 Vote for VoxForge http://www.voxforge.org/home/forums/message-boards/general-discussion/vote-for-voxforge I just voted for VoxForge in the category: "Most Likely to Change the Way You Do Everything" http://www.voxforge.org/home/forums/message-boards/general-discussion/vote-for-voxforge Sat, 23 May 2009 04:43:24 -0500 Interns/Coops/etc. interested sphinx-related project work? http://www.voxforge.org/home/forums/message-boards/general-discussion/interns/coops/etc.-interested-sphinx-related-project-work Hi - http://www.voxforge.org/home/forums/message-boards/general-discussion/interns/coops/etc.-interested-sphinx-related-project-work Mon, 11 May 2009 06:55:25 -0500 Error in ./HTK_Compile_Model.sh http://www.voxforge.org/home/forums/message-boards/general-discussion/error-in-./htk_compile_model.sh http://www.voxforge.org/home/forums/message-boards/general-discussion/error-in-./htk_compile_model.sh Mon, 04 May 2009 13:57:18 -0500 Using VoxForge in a project http://www.voxforge.org/home/forums/message-boards/general-discussion/using-voxforge-in-a-project I want to use VoxForge to convert speech into text for words like "input", "output","int","float" etc.. http://www.voxforge.org/home/forums/message-boards/general-discussion/using-voxforge-in-a-project Sat, 02 May 2009 21:51:03 -0500 Whole-Word Recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/whole-word-recognition Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/whole-word-recognition Wed, 29 Apr 2009 09:43:27 -0500 LM Perplexity for LVCSR task http://www.voxforge.org/home/forums/message-boards/general-discussion/lm-perplexity-for-lvcsr-task Hi http://www.voxforge.org/home/forums/message-boards/general-discussion/lm-perplexity-for-lvcsr-task Fri, 24 Apr 2009 00:02:19 -0500 audio CAPTCHA suggestion http://www.voxforge.org/home/forums/message-boards/general-discussion/audio-captcha-suggestion You should consider building your voice recognition system into captcha inputs that people could embed into their own webpages. It would still be a visual captcha, since the person would have to read the word/letters - but you would provide the option of submitting it verbally to further the cause of open speech recognition. Would obviouslly have to be on words that were good enough to give a fair probability score when they're being explicitly looked for. http://www.voxforge.org/home/forums/message-boards/general-discussion/audio-captcha-suggestion Thu, 23 Apr 2009 21:46:40 -0500 Using adapted models with julius http://www.voxforge.org/home/forums/message-boards/general-discussion/using-adapted-models-with-julius Hello http://www.voxforge.org/home/forums/message-boards/general-discussion/using-adapted-models-with-julius Sat, 18 Apr 2009 05:05:11 -0500 Submissions Stalled http://www.voxforge.org/home/forums/message-boards/general-discussion/submissions-stalled I tried submitting several voice samples over a week ago, but I still haven't shown up on the metrics page. It looks like the "nightly" batch file that processes the samples might not be running anymore. My (and many other) samples are showing up on the "Speech Submissions Awaiting Processing" page (http://read.voxforge1.org/r0_1_6/endpage.php). http://www.voxforge.org/home/forums/message-boards/general-discussion/submissions-stalled Wed, 15 Apr 2009 14:43:30 -0500 Text To Speech http://www.voxforge.org/home/forums/message-boards/general-discussion/text-to-speech3 Hi there, http://www.voxforge.org/home/forums/message-boards/general-discussion/text-to-speech3 Tue, 14 Apr 2009 06:43:00 -0500 transcribing expermints http://www.voxforge.org/home/forums/message-boards/general-discussion/transcribing-expermints I want to do some expermints. I don't have any expectaions, so save the "it isn't going to work good enough" becaue all I am hoping for at first is to see how good/bad things are with various amounts of effort. http://www.voxforge.org/home/forums/message-boards/general-discussion/transcribing-expermints Tue, 07 Apr 2009 21:55:28 -0500 Free Speech Isolated Word Recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/free-speech-isolated-word-recognition Hi, http://www.voxforge.org/home/forums/message-boards/general-discussion/free-speech-isolated-word-recognition Mon, 06 Apr 2009 08:47:30 -0500 Error while creating reverse trigram http://www.voxforge.org/home/forums/message-boards/general-discussion/error-while-creating-reverse-trigram Hello http://www.voxforge.org/home/forums/message-boards/general-discussion/error-while-creating-reverse-trigram Mon, 06 Apr 2009 07:12:10 -0500 Compare speeches http://www.voxforge.org/home/forums/message-boards/general-discussion/compare-speeches Hello, I need to compare some voice speeches of different people to see if they say the same and if they pronounce it correctly. For this I am using Julius but the problem is that it always tries to match the speech with the grammar even if the word it is said isn't in it. I thought that maybe using dictation I could get better results, because I don't need to have an accurate output (it doesn't have to be the same as what it is said on the speech), if the output is the same for two speeches of different people where they say the same and correctly it's enough. What do you think? Is to possible to say to Julius not to give an output if what was said is not part of the grammar? I hope you understand what I want to do. Thank you very much and sorry for my English. http://www.voxforge.org/home/forums/message-boards/general-discussion/compare-speeches Fri, 03 Apr 2009 04:46:22 -0500 Old copyright line found http://www.voxforge.org/home/forums/message-boards/general-discussion/old-copyright-line-found It seems that most of pages on this site have up-to-date copyright line bottom of the page, but I scrolled http://read.voxforge1.org/r0_1_6/endpage.php down and found an old "2005-2007" there. http://www.voxforge.org/home/forums/message-boards/general-discussion/old-copyright-line-found Sat, 14 Mar 2009 07:41:14 -0500 Searching for voice test files to test voice quality, voice quality test files used in PESQ and other methods http://www.voxforge.org/home/forums/message-boards/general-discussion/searching-for-voice-test-files-to-test-voice-quality-voice-quality-test-files-used-in-pesq-and-other-methods Hi, Would greately appreciate any reference to test files http://www.voxforge.org/home/forums/message-boards/general-discussion/searching-for-voice-test-files-to-test-voice-quality-voice-quality-test-files-used-in-pesq-and-other-methods Thu, 05 Mar 2009 05:59:32 -0600 Just sayin Hello http://www.voxforge.org/home/forums/message-boards/general-discussion/just-sayin-hello I am new here and have read the archive of forum posts. I am somewhat new to speech recognition and have been bugging nsh quite a bit on freenode #cmusphinx. http://www.voxforge.org/home/forums/message-boards/general-discussion/just-sayin-hello Wed, 18 Feb 2009 04:23:31 -0600 NATO alphabet and voice recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/nato-alphabet-and-voice-recognition Just wondering about the applicability of the NATO alphabet in voice recognition. Originally intended to help with clear communication over radio, I'm wondering if the same principles apply with voice recognition, or perhaps there are other alphabets which fit with VR more effectively. http://www.voxforge.org/home/forums/message-boards/general-discussion/nato-alphabet-and-voice-recognition Sat, 07 Feb 2009 08:40:42 -0600 Julius Isolated Word http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-isolated-word Hi All, http://www.voxforge.org/home/forums/message-boards/general-discussion/julius-isolated-word Fri, 06 Feb 2009 22:58:31 -0600 Scaling a grammar http://www.voxforge.org/home/forums/message-boards/general-discussion/scaling-a-grammar I am using Julius + HTK and the voxforge scripts (latest versions of everything). http://www.voxforge.org/home/forums/message-boards/general-discussion/scaling-a-grammar Mon, 02 Feb 2009 09:23:22 -0600 Problems running Julius http://www.voxforge.org/home/forums/message-boards/general-discussion/problems-running-julius Hello, http://www.voxforge.org/home/forums/message-boards/general-discussion/problems-running-julius Fri, 09 Jan 2009 08:56:06 -0600 generic english .grammar and .voca files? http://www.voxforge.org/home/forums/message-boards/general-discussion/generic-english-.grammar-and-.voca-files Hi All, http://www.voxforge.org/home/forums/message-boards/general-discussion/generic-english-.grammar-and-.voca-files Tue, 30 Dec 2008 11:36:09 -0600 mfc extension http://www.voxforge.org/home/forums/message-boards/general-discussion/mfc-extension2 What do I need to open files with the .mfc extension? http://www.voxforge.org/home/forums/message-boards/general-discussion/mfc-extension2 Sun, 28 Dec 2008 10:19:46 -0600 mfc extension http://www.voxforge.org/home/forums/message-boards/general-discussion/mfc-extension What do I need to open files with the .mfc extension? http://www.voxforge.org/home/forums/message-boards/general-discussion/mfc-extension Sun, 28 Dec 2008 10:19:45 -0600 cross checking industry standards and best practices http://www.voxforge.org/home/forums/message-boards/general-discussion/cross-checking-industry-standards-and-best-practices 1. Any best practices or industry benchmark figures to compare time spent by customer in the IVR http://www.voxforge.org/home/forums/message-boards/general-discussion/cross-checking-industry-standards-and-best-practices Fri, 19 Dec 2008 17:41:07 -0600 ESV.org as an audiobook resource http://www.voxforge.org/home/forums/message-boards/general-discussion/esv.org-as-an-audiobook-resource ESV.org is the website for the English Standard Bible. You can go to a chapter of this bible (eg http://www.gnpcb.org/esv/search/?q=Genesis+1 ) and click on the "Listen" link. Each word is then read by a professional narrator, and the text (in context) is displayed on the screen. http://www.voxforge.org/home/forums/message-boards/general-discussion/esv.org-as-an-audiobook-resource Thu, 18 Dec 2008 08:48:35 -0600 How did Google collected a speech for their ASR http://www.voxforge.org/home/forums/message-boards/general-discussion/how-did-google-collected-a-speech-for-their-asr Google also had a useful set of data correlating speech samples with http://www.voxforge.org/home/forums/message-boards/general-discussion/how-did-google-collected-a-speech-for-their-asr Tue, 25 Nov 2008 13:24:53 -0600 DFA/ sp-phone http://www.voxforge.org/home/forums/message-boards/general-discussion/dfa/-sp-phone I began a few days ago with some julius-testing. But I have a problem. I don't understand the syntax of the sample.dfa-file. My configuration can recognize just two words now (but that actually works great, 100% accuracy;)). Is there a tutorial or a readme about this? http://www.voxforge.org/home/forums/message-boards/general-discussion/dfa/-sp-phone Sun, 23 Nov 2008 09:55:40 -0600 beginning or getting startet http://www.voxforge.org/home/forums/message-boards/general-discussion/beginning-or-getting-startet can someone tell me how to begin. i ´ve desperatly been looking for a voice recognition. and now since i ´ve finally found this, i cannot get startet, because i don´t understand how it works. i would be thankfull, if someone could give me an introduction. thanx in advance. joel david http://www.voxforge.org/home/forums/message-boards/general-discussion/beginning-or-getting-startet Fri, 07 Nov 2008 17:55:56 -0600 Funtionality request for Voxforge http://www.voxforge.org/home/forums/message-boards/general-discussion/funtionality-request-for-voxforge Hi again, Another thing, thinking about voice donor process, I have one idea... let's see if you think it could be viable. In english there are some programs that do voice recognition, but in spanish there is NO program that can do anything. (This happens in a lot of languages). If a possible voice donor, see the number of voice records that we have in any language like spanish, it seems that it's very dificult to reach the first 140 hours objetive. Then a person, like me, could forgot the donor, or he could think about preparing training data for sphinx2 or sphinx3 or julian, in order to have basic functions of recognition for his own voice... the process of creating this it's not easy. Could it be possible to have a application in voxforge that help the people in all this training process. Let's say that in the initial process you have the read applet. Then a person could record his voice (100 phrases could be about 10 minutes of voice), and voxforge will retain all this voice under the GNU license. Then let's say that Voxforge processes the voice of one person and prepare with HTK, the HMM files that this person could use it in order to use any of the voice recognition engines (the selected by voxforge). This could be a little gift that voxforge gives to a voice donor... and it could help convincing a person to donor his voice. If this process is something that could be automatically done, it could be possible to write some messages in some linux "freak" forums, explaining the easy method of having voice recognition for their desktop... What do you thing about this?!? could it be a way in order to persuade to donor his voice. Regards. http://www.voxforge.org/home/forums/message-boards/general-discussion/funtionality-request-for-voxforge Wed, 29 Oct 2008 17:19:24 -0500 New iPhone voice dialing apps http://www.voxforge.org/home/forums/message-boards/general-discussion/new-iphone-voice-dialing-apps Here are a couple of new apps for the iPhone that use Open Source speech recognition engines. Although the apps themselves are closed source, it is interesting to see what can be done with PocketSphinx and Julius. http://www.voxforge.org/home/forums/message-boards/general-discussion/new-iphone-voice-dialing-apps Mon, 29 Sep 2008 21:56:17 -0500 Legal information http://www.voxforge.org/home/forums/message-boards/general-discussion/legal-information Hi everyone, http://www.voxforge.org/home/forums/message-boards/general-discussion/legal-information Tue, 23 Sep 2008 13:47:45 -0500 Translate the VoxForge-applet to your own language. http://www.voxforge.org/home/forums/message-boards/general-discussion/translate-the-voxforge-applet-to-your-own-language You can translate the speech submission applet of VoxForge and make the first prompts for your language now at http://translations.launchpad.net/voxforge Daniël http://www.voxforge.org/home/forums/message-boards/general-discussion/translate-the-voxforge-applet-to-your-own-language Tue, 09 Sep 2008 09:16:12 -0500 FreeCLAS - "Free Commons of Linguistically Annotated Speech". http://www.voxforge.org/home/forums/message-boards/general-discussion/freeclas---free-commons-of-linguistically-annotated-speech From the a comp.speech.research post: FreeCLAS (http://www.ihear.com/FreeCLAS) is a new project to build a a data base of high-quality speech data. "High quality" means annotated data that have been validated by humans. Building such a data base has been expensive because it requires substantial investment of people's attention. As a result, high-quality speech data is not generally available. FreeCLAS uses a wiki. This is a call for people to join the wiki to build it. Embedded in the wiki is a tool, shva, which opens from your browser to let you hear, view and annotate any utterance in FreeCLAS. At this point, there is a seed data base of a small collection of utterances annotated in en-US and IPA. shva and other related software downloadable from FreeCLAS are all Free Software, licensed under GPL or other compatible licenses. The speech data is under the Creative Commons attribute-share-alike license. Their focus seems to be more collecting linguistic annotations of speech by getting users to provide/validate time stamps of utterances. This is a little different what VoxForge is doing. We are basically trying to collect speech prompts (15-20 words long), with little regard for accurate timings - since the HTK/SPhinx acoustic model training process can do this automatically (with short utterances) What is really interesting (from VoxForge standpoint at least) is their ALingA (GPLv3) annotation Java applet. I can't get the app the run on my PC (I have a 64-bit machine, which they don't provide support for...yet). However, from the screen shots, it looks very impressive for a Java applet. They use the JavaFX libraries, which is Sun's answer to creating rich Internet applications (RIAs)... i.e. Sun approach to creating a Flash-like environment. It might be a useful starting point for a speech submission annotation validator for VoxForge (but just to allow other users to validate that an utterance matches the prompt line). Ken http://www.voxforge.org/home/forums/message-boards/general-discussion/freeclas---free-commons-of-linguistically-annotated-speech Thu, 04 Sep 2008 12:33:52 -0500 Making VoxForge corpus useful for ASR research http://www.voxforge.org/home/forums/message-boards/general-discussion/making-voxforge-corpus-useful-for-asr-research Hello, I am posting to share some thoughts regarding ASR research and the planned 1.0 release of the VoxForge corpus. The goal of VoxForge is to create speech corpora for use by the FOSS http://www.voxforge.org/home/forums/message-boards/general-discussion/making-voxforge-corpus-useful-for-asr-research Wed, 03 Sep 2008 21:42:55 -0500 Acoustic model 0.1.2 http://www.voxforge.org/home/forums/message-boards/general-discussion/acoustic-model-0_1_2 When do we release 0.1.2? Ticket 202 (20% of 140 goal is fixed, according to the metrics page we have now 36%, but misses a month of speech submissions => 37%, 38%.) http://www.dev.voxforge.org/projects/Main/ticket/202 I thought 376 (http://www.dev.voxforge.org/projects/Main/ticket/376) is fixed (according to the forum thread.) I don't know about other tickets but 366 http://www.dev.voxforge.org/projects/Main/ticket/366 doesn't seem to be a showstopper for the English acoustic model. "Update Acoustic Model creation scripts and Tutorials (and Howtos) to Julius 4.0" which is supposed to acoustic model 0.1.3, I think that has a bigger priority to my opinion. Let me know what you think about it. (What was it again: "release soon, release often" isn't it? http://www.voxforge.org/home/forums/message-boards/general-discussion/acoustic-model-0_1_2 Mon, 01 Sep 2008 14:50:43 -0500 Different depths of voice http://www.voxforge.org/home/forums/message-boards/general-discussion/different-depths-of-voice Hi there, http://www.voxforge.org/home/forums/message-boards/general-discussion/different-depths-of-voice Wed, 27 Aug 2008 21:34:09 -0500 Processing of speech submissions delayed http://www.voxforge.org/home/forums/message-boards/general-discussion/processing-of-speech-submissions-delayed2 I am travelling for the first 3 weeks of August. Because of this, processing of audio submissions will be delayed until I return (if I have time, I may be able to do some of these remotely). You can still submitt speech - it will just collect on the submission server. I will process all submissions when I get back. thanks, Ken http://www.voxforge.org/home/forums/message-boards/general-discussion/processing-of-speech-submissions-delayed2 Sat, 09 Aug 2008 23:23:07 -0500 Page after submitting. http://www.voxforge.org/home/forums/message-boards/general-discussion/page-after-submitting Hi Ken, http://read.voxforge1.org/r0_1_4/endpage.php is currently in English and links again to http://www.voxforge.org/home/read. I've created the Dutch translation. Bedankt voor je bijdrage! Hieronder is de lijst te zien met bijdragen die momenteel wachten om opgenomen te worden in het corpus van VoxForge. Je spraak zal worden bewerkt en gecontroleerd worden om in het proces van deze avond worden opgenomen. Om de tijd te zien die je aan spraak hebt bijgedragen en hoe dicht we bij ons doel zijn kijk op [url=http://www.voxforge.org/home/downloads/metrics]de statistieken van de spraak-bijdrage[/url] (alleen nog in het Engels.) VoxForge's streven is om minstens 10 - 15 minuten spraak per spraakdonor te verkrijgen om aan ons doel van 140 uur te voldoen voor de eerste versie van de spraakcorpus en akoestische modellen van VoxForge. Om een goede dekking te leveren van de taal hebben we honderden verschillende zinnen aangemaakt. Maak je geen zorgen als je een aantal zinnen al hebt gedoneerd, dat is ook erg handig, een persoon zegt immers nooit twee dezelfde dingen op precies dezelfde manier! [url=http://voxforge.org/nl/read]Klik hier om opnieuw een bijdrage te leveren![/url] En vertel alsjeblieft ook vrienden en familie over VoxForge en vraag ze om donor te worden! Bedankt namens het VoxForge-team. -- http://www.voxforge.org/home/forums/message-boards/general-discussion/page-after-submitting Sat, 09 Aug 2008 07:54:51 -0500 Developing grammar http://www.voxforge.org/home/forums/message-boards/general-discussion/developing-grammar Hi, I work in a project for create an english course online. we are using julius + voxforge for reading exercise . Basically, I created a new file .voca and .grammar, then i compile with mkdfa.pl for generate the files .dfa .dict .term. So, i used these files for recognise the phrase that the student pronounce, then i compare the phrase spoken with the phrase tha i was waiting. The english exercise is an phrase that student listening and after he pronounce the phrase and the julius + voxforge recognise the phrase, if the phrase recognized is equals the phrase listening the exercise is correct. Then, i would know if anybody created grammar, because my grammar is very large and i want tips for otimaze it. Thanks http://www.voxforge.org/home/forums/message-boards/general-discussion/developing-grammar Wed, 06 Aug 2008 12:25:54 -0500 Writing a command and control application with voice recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/writing-a-command-and-control-application-with-voice-recognition The Bloc d’en RainCT blog has a post describing a simple program to control Rhythmbox with your voice on Ubuntu. It uses the Julius speech recognition engine and a VoxForge acoustic model. http://www.voxforge.org/home/forums/message-boards/general-discussion/writing-a-command-and-control-application-with-voice-recognition Sun, 27 Jul 2008 08:51:54 -0500 Application Java http://www.voxforge.org/home/forums/message-boards/general-discussion/application-java Hi, I am developing a application in java for speech recognition and i'm using voxforge with julius for this. Anybody here already do this? thanks... http://www.voxforge.org/home/forums/message-boards/general-discussion/application-java Tue, 22 Jul 2008 08:41:36 -0500 VoxForge User Submissions http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-user-submissions Is it time to grow the volunteer base for voxforge? From what I've read about the problem, what you need is lots of people to record their voice so you can produce acoustic models. Now if you really need people why don't you try and team up with the WikiMedia Foundation, if you can get even a small amount of wiki people to help out you'd vastly increase your incoming audio. That is of course if you do need more people. You may just be looking for more programmers with better ideas for the software and processing side of things. http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-user-submissions Mon, 14 Jul 2008 22:11:03 -0500 untitled http://www.voxforge.org/home/forums/message-boards/general-discussion/untitled btw i run Firefox (but I tried it also with IE on Win XP) http://www.voxforge.org/home/forums/message-boards/general-discussion/untitled Mon, 14 Jul 2008 13:45:48 -0500 Voxforge Submission Applet does not work due to router http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-submission-applet-does-not-work-due-to-router With some routers you can not upload recorded data via the Submission Applet because of the default firewall configuration in the router. I've tested it (I've a Thomson router). Is there a way to fix that in the Submission Applet? http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-submission-applet-does-not-work-due-to-router Thu, 10 Jul 2008 12:00:15 -0500 MyVox Telephony to VoIP Gateway http://www.voxforge.org/home/forums/message-boards/general-discussion/myvox-telephony-to-voip-gateway I just came across the MyVox website, which "turns any phone into a microphone hooked up to your application". It's ad supported, and the audio ads are a little "over-the-top" (although they are short: 5-7 seconds), but it is a very interesting model that might finally get the "Voice Web" up and running with things like speech recognition based web searching, etc., without the need to invest serious money in telephony infrastructure. This might also be interesting from a VoxForge perspective as another alternative to collect speech (maybe modifying trevarthen's VoxForgeIVR app to perform this task). Ken http://www.voxforge.org/home/forums/message-boards/general-discussion/myvox-telephony-to-voip-gateway Thu, 19 Jun 2008 11:29:26 -0500 How do you control license violation? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-do-you-control-license-violation I would like to get your input on the following. As far as I understand, the GPL license doesn't allow non-free derivative works out of a work licensed under GPL. However, I believe that it is impossible to know whether an acoustic model has been compiled out of voxforge's audio corpora. Basically, the creation of an acoustic model requires: pre-processing -> feature vector extraction -> classification For example, an Hidden Markov model is composed of state transition probabilities and of pairs of means and variances for the observation probability distributions... There's no way to be sure that an acoustic model comes from voxforge's audio corpora and the commercial product in question will never have to ship the audio corpora, only the acoustic models... In this respect, how do you control that a commercial product doesn't use your corpora? Thanks for your input Mathieu http://www.voxforge.org/home/forums/message-boards/general-discussion/how-do-you-control-license-violation Wed, 28 May 2008 09:25:56 -0500 Blizzard 2008 listening tests are open http://www.voxforge.org/home/forums/message-boards/general-discussion/blizzard-2008-listening-tests-are-open In order to better understand and compare research techniques in building corpus-based speech synthesizers on the same data, the Blizzard Challenge has been devised. The basic challenge is to take the released speech database, build a synthetic voice from the data and synthesize a prescribed set of test sentences. The sentences from each synthesizer will then be evaluated through listening tests. We're pleased to announce that the Blizzard 2008 listening test is now available. We need your help in getting as many subjects as possible to participate. There are different start pages for various listener types - please take care to use the correct one: English ====== Speech experts: http://groups.inf.ed.ac.uk/blizzard/blizza...egister-ES.html Volunteers: http://groups.inf.ed.ac.uk/blizzard/blizza...egister-ER.html Mandarin ======= Speech experts: http://groups.inf.ed.ac.uk/blizzard/blizza...egister-MS.html Volunteers: http://groups.inf.ed.ac.uk/blizzard/blizza...egister-MR.html Subjects do not need to be native speakers - we gather information about this in the questionnaire at the end of the listening test. The listening test can be completed in one session, or over multiple sessions. If you speak both English and Mandarin, then you may participate in both listening tests if you wish. Please publicise this as widely as possible on your mailing lists, blogs, web pages, or whatever. Also, please ensure that as many members of your research group, other colleagues, students, family, etc participate as possible! We need hundreds of subjects for each language. Remember to direct them to the appropriate start page. Note that we are also running several other listener groups for paid subjects; if you would like to organise a group of paid subjects, then please contact me. We can also set up groups for other specific listener types, if we know there is a large enough pool of available subjects with certain characteristics; contact me if you want to discuss this. Please report problems to blizzard@festvox.org http://www.voxforge.org/home/forums/message-boards/general-discussion/blizzard-2008-listening-tests-are-open Sun, 18 May 2008 12:01:25 -0500 Querying a database using open source voice control software http://www.voxforge.org/home/forums/message-boards/general-discussion/querying-a-database-using-open-source-voice-control-software There is an article on Linux.com (written by Colin Beckingham) that outlines the steps the author took to create a small system that can query a database using speech ... with help from HTK, Julius, Audacity, Festival and the VoxForge tutorial and howto. From the introductory paragraph of the article: http://www.voxforge.org/home/forums/message-boards/general-discussion/querying-a-database-using-open-source-voice-control-software Fri, 16 May 2008 12:21:32 -0500 Conversation with Richard Stallman re: VoxForge and FSF http://www.voxforge.org/home/forums/message-boards/general-discussion/conversation-with-richard-stallman-re-voxforge-and-fsf From Richard Stallman: The Free Software Foundation and the GNU Project would like to help http://www.voxforge.org/home/forums/message-boards/general-discussion/conversation-with-richard-stallman-re-voxforge-and-fsf Wed, 14 May 2008 08:58:30 -0500 Readying VoxForge for high traffic http://www.voxforge.org/home/forums/message-boards/general-discussion/readying-voxforge-for-high-traffic Here is a transcript of an e-mail I have sent to the following just now: Ken McLean, maintainer, VoxForge project CC'd to: Richard Stallman, founder of GNU/Linux, FSF David Huggins Daines, current sphinx maintainer Nickolai (nshm), sphinx developer Hi Ken! My name is Sam. I'm working on redesigning HCI, both at work and in my free time. I'm concerned that GNU/Linux doesn't as yet have any support for continuous speech recognition. I think that within the next 10 years there will be a major shift towards speech recognition, and I think it is important that GNU/Linux is not left out. I am in touch with the sphinx development community, including David Huggins Daines, the current sphinx maintainer. David confirmed that the problem is that there is no sizeable speech database (lots of .wav phrases together with their associated .txt), and hence sphinx cannot generate a decent voice model. I am also in touch with Richard Stallman, founder of GNU/Linux and the FSF. He is willing to publicise VoxForge to the FSF community. This will hit a lot of people, and there are a lot of linux users who really want to contribute but don't know how to code. This could generate a lot of traffic for VoxForge. Can VoxForge handle it? We should discuss this before throwing the gates open. Here are three things that hit me straight away: Firstly, To get people to contribute, it is important to have some simple feedback system. It is the difference between one one hand laying a brick, which disappears, and being told that one day a castle will appear, and on the other laying a brick on a partially built castle. Is there any chance you could include a usage graph on the website main page? x-axis: each pixel is one day y-axis: the number of phrases contributed that day And a thermometer! You know like those thermometers they use at fundraisers? This is how much we need to make a continuous speech recogniser, this is how much we have got... The key is to create a 'we can make it happen' vibe... Once people see the thermometer is starting to heat up I'm sure there will be a lot of people who put hours of effort in. Secondly, I just tried it out, imagining I'm a linux fan who has just seen an article by Richard Stallman in a linux magazine. I log onto Voxforge.org ... I didn't get very far. A dialogue box appeared telling me 'the page you're viewing requires java. More information is available on the Microsoft website.' And that was it. The page it takes me to has a link saying 'Information on the Java Security Warning pop-up', and this is quite a long page with a lot of information. It doesn't offer any solution to my problem; I have not been presented with any option to download the java virtual machine and. So I know better than try to get anything meaningful out of the Microsoft website! I go to google, put in 'download java virtual machine vista'... And have to take it from there. But this is going to put off a lot of people, maybe >90%. Is it possible for the browser to ascertain whether java is installed or not, and if it isn't, offer a link straight to downloading the appropriate java virtual machine executable for the operating system that person is using? ie minimize the amount of clicks and reading required... The third issue is the phrases themselves: where do you get them from? Nickolai (a major sphinx contributor, cc'd) and I were discussing ways to make entering speech more fun, so people would be encouraged to do it. A few ideas: To speak something out loud is a great aid to learning. Maybe we can find some resource of historical & scientific facts song lyrics (may be a bad idea because people would sing instead of speak.. But maybe that would be OK??) Movie scripts. I swear, if you find a good movie script (like Star Trek IV) you will get people who read through the entire movie. Making Voice-books. people can kill two birds with one stone - they can read in a document, or a chapter from a book, creating a Voice-book while adding to the database. Have two text boxes: URL[ ], starting from [ ] So if I put in URL[http://www.chordie.com/chord.pere/www.ultimate-guitar.com/print.php?what=tab&id=456256], starting from [I'm afraid] It starts presenting text from this location, one sentence at a time. hit spacebar to advance. Of course you may need several people to speak the same phrases. If this is true, these ideas could be adapted: you could have a pool of 'this is what the last hundred visitors chose to read out', and next to each one, a number which represents how many people have read from that source. So you can either click something existing, or choose something new. This could be a lot of fun - who knows what songs / movies / literature / jokes people are going to put up? Sam (sunfish7@gmail.com) http://www.voxforge.org/home/forums/message-boards/general-discussion/readying-voxforge-for-high-traffic Tue, 06 May 2008 11:40:04 -0500 Text to Speech http://www.voxforge.org/home/forums/message-boards/general-discussion/text-to-speech2 Would using a text to speech engine to read documents be a legitimate source of generating audio instead of manually reading it? http://www.voxforge.org/home/forums/message-boards/general-discussion/text-to-speech2 Thu, 24 Apr 2008 17:09:31 -0500 Donations? http://www.voxforge.org/home/forums/message-boards/general-discussion/donations Have you considered taking donations, and then using that to fund voice samples through amazon's Mechanical Turk system. I figure that a reasonable low wage of £3(sorry i'm in the uk, i don't know what a reasonable low wage is in the US) per hour of audio, would get interest. I guess it depends on whether there's enough interest for enough donations to be made to fund a significant job on mechanical turk. http://www.voxforge.org/home/forums/message-boards/general-discussion/donations Fri, 18 Apr 2008 14:21:00 -0500 Google App Engine http://www.voxforge.org/home/forums/message-boards/general-discussion/google-app-engine I've been playing with the Google App Engine SDK since it has been released (and am on the waiting list to get on the actual site). It runs Python, and can use the Django framework. It has a 500 meg storage limitation. From the Google App Engine blog: Google App Engine -- a developer tool that enables you to run your http://www.voxforge.org/home/forums/message-boards/general-discussion/google-app-engine Tue, 15 Apr 2008 20:13:04 -0500 Language models for speech and OCR? Grammar checkers? http://www.voxforge.org/home/forums/message-boards/general-discussion/language-models-for-speech-and-ocr-grammar-checkers I have been looking at speech recognition and OCR lately. I also read that language models can help speech recognition engines determine the most likely result from ambiguous input. It occurred to me that this is similar for OCR - OCR guesses letters and then has to determine which word is most likely based on what it thinks it "saw" and what the word is most likely to be. Grammar checkers in word processors must also determine the likelihood of entered text. This may be an off-the-wall suggestion, but would it be sensible for FSF to try and develop a GPLv3 language model that could be used for all three? I would have thought that a good language model was something important to Voxforge's aims (the ability to create speech recognition apps without the need for commercial resources). http://www.voxforge.org/home/forums/message-boards/general-discussion/language-models-for-speech-and-ocr-grammar-checkers Fri, 28 Mar 2008 21:27:22 -0500 Keep VoxForge alive http://www.voxforge.org/home/forums/message-boards/general-discussion/keep-voxforge-alive We need people to do this, to keep VoxForge alive: #1 Blog about VoxForge updates, link to this website on your site, make a screencast, etc etc #2 Tell your friends about VoxForge, tell them to submit some speech if they have some time (not only English people, we need speech from other languages too.) #3 Show your friends Gnome Voice control / Sphinx or something. #4 Submit speech by yourself, or develop things that are important for VoxForge, take a look on the GSoC ideas. #5 Make VoxForge popular by doing things you think they are good! http://www.voxforge.org/home/forums/message-boards/general-discussion/keep-voxforge-alive Tue, 25 Mar 2008 08:00:17 -0500 Links on news http://www.voxforge.org/home/forums/message-boards/general-discussion/links-on-news I don't see it was discussed somewhere btw, it would be nice to have a recommended reading page/thread. It would be nice to have a few documents that can get newbie into speech technology quickly. For a start I suggest to use David Gelbart's collection: http://www.icsi.berkeley.edu/~gelbart/edu.html But the question is a bit different, what resources do you use to track recent news in speech technologies? Probably some feed or blog is a good place to read. Something like http://www.speechtechblog.com for example http://www.voxforge.org/home/forums/message-boards/general-discussion/links-on-news Sun, 23 Mar 2008 03:21:49 -0500 Things to improve Voxforge http://www.voxforge.org/home/forums/message-boards/general-discussion/things-to-improve-voxforge Hi all, I am not a technical guy but I do see lot of short-comings for community participation & the lack of it. It might sound unpleasant or whatever but this is my take on stuff :- 1 . Forums need a usability shot :- The forums are in a bad bad shape. I don't know what forum software are u guys using, but its just not something which many people could use/understand. Most of the forum softwares have a 'search' query thing where people can know if somebody has asked something before, so things are more organised. It should be visible all the time. 2. FAQ :- This FAQ should be visible all the times. An FAQ which answers all the oft. repeated queries such as :- i. How can I record the sounds? ii. Which softwares do I need to record the sounds? iii. How many sound clips are needed? iv. from which countries? v. what accents? vi. any particular mike or hardware which would be useful? and so on & so forth. This would make for huge gains. 3. There needs to be a blog which is accessible from the top itself so people know what new improvements are happening. 4. Break the requirements into small doable targets which are in the form of graph. Also tell what improvements would one have when we touch that target. 5. Make frequent releases of the corpus done & interact & blog the resulting improvements made in the various open source speech recognition engines. 6. Lastly, give alternative ways to do ftp submissions to the site. Give some generic instructions for people using console-based or graphical ftp clients to upload stuff. Perhaps there could be a way to tag them so they are attached to the job number automatically. Feel free to suggest & improve the suggestions :) http://www.voxforge.org/home/forums/message-boards/general-discussion/things-to-improve-voxforge Sat, 22 Mar 2008 00:06:51 -0500 Google Summer of Code 2008 - status http://www.voxforge.org/home/forums/message-boards/general-discussion/google-summer-of-code-2008---status Unfortunately, we did not get accepted to this year's Google Summer of Code 2008 project (an amazing program that offers student developers stipends to write code http://www.voxforge.org/home/forums/message-boards/general-discussion/google-summer-of-code-2008---status Mon, 17 Mar 2008 19:36:10 -0500 Recognise its a voice, without needing to know whats said http://www.voxforge.org/home/forums/message-boards/general-discussion/recognise-its-a-voice-without-needing-to-know-whats-said Hi all. Im looking for something that just needs to recognise when something is being said (not what is said). Not quite silence detection because I want it to be able to distinguish between music and talking. A bit of a tall order? Any views or input greatly appreciated Dylan http://www.voxforge.org/home/forums/message-boards/general-discussion/recognise-its-a-voice-without-needing-to-know-whats-said Sun, 16 Mar 2008 07:30:37 -0500 Sentences from OpenTaal http://www.voxforge.org/home/forums/message-boards/general-discussion/sentences-from-opentaal The project OpenTaal (at opentaal.org) has collected a huge amount of Dutch sentences. OpenTaal is a Dutch project for creating dictionaries, grammar checking, synonyms and much more is coming. They made also 'Wordsharvester', a little app that collects and counts words from all over the web. Here is the link (more than 300 MB): http://opentaal.org/opentaalbank/test/zinnen.tgz They have also made a collection with the most used combinations of words (2,3,4 and 5 words), but that's currently not accessible due to a mysql error. I would greatly thank OpenTaal for the work that they did (and do)! Well, my question is, what can we exactly do with this huge information? How can we implement the information in the best way? Are there any ways of doing this yet? http://www.voxforge.org/home/forums/message-boards/general-discussion/sentences-from-opentaal Sat, 15 Mar 2008 12:20:16 -0500 untitled http://www.voxforge.org/home/forums/message-boards/general-discussion/untitled3 A new thing. http://voxforge.org/uploads/Ve/ei/VeeirBUXntnzL2oUm4s59A/Tekening.svg Open it with inkscape, in ff2 it looks nasty, ff3 crashes http://www.voxforge.org/home/forums/message-boards/general-discussion/untitled3 Fri, 14 Mar 2008 03:38:17 -0500 untitled http://www.voxforge.org/home/forums/message-boards/general-discussion/untitled4 A new one http://voxforge.org/uploads/yr/1r/yr1rPgM2e9PT7OYSztZiuQ/shutupcomputer.png http://www.voxforge.org/home/forums/message-boards/general-discussion/untitled4 Fri, 14 Mar 2008 16:06:08 -0500 Banner / slogan http://www.voxforge.org/home/forums/message-boards/general-discussion/banner-/-slogan I made this one http://voxforge.org/uploads/_5/X7/_5X7KruGxG-NxJkduIQEEg/8015banner.png Maybe a slogan: "Voxforge, because you want it to listen!" Sounds nice by me but I am not a native English speaker :) http://www.voxforge.org/home/forums/message-boards/general-discussion/banner-/-slogan Thu, 13 Mar 2008 13:58:51 -0500 Ekiga http://www.voxforge.org/home/forums/message-boards/general-discussion/ekiga Feature request: use Ekiga for collecting/contributing speech. http://www.voxforge.org/home/forums/message-boards/general-discussion/ekiga Tue, 11 Mar 2008 11:14:17 -0500 Google Summer of Code 2008 - application http://www.voxforge.org/home/forums/message-boards/general-discussion/google-summer-of-code-2008---application http://www.voxforge.org/home/forums/message-boards/general-discussion/google-summer-of-code-2008---application Mon, 03 Mar 2008 23:25:21 -0600 Voxforge buttons and banners and logo's and slogans. http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-buttons-and-banners-and-logos-and-slogans Is there already a page with buttons and banners for VoxForge Like http://www.spreadfirefox.com/?q=affiliates/homepage that people and websites can put it somewhere? The users of Firefox have also increased by this form of advertising and so can VoxForge. By the way, I prefer a more clear logo. Like a microfone http://www.midigraphics.co.kr/upload/img/product/Beta58.jpg or a speaker http://www.webgraffix.com/PSPImages/Speaker.jpg http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge-buttons-and-banners-and-logos-and-slogans Tue, 04 Mar 2008 10:21:38 -0600 Training speaker dependent models from scratch http://www.voxforge.org/home/forums/message-boards/general-discussion/training-speaker-dependent-models-from-scratch I have some english audio lectures from my professor at school I want to use Julius as a continuous dictation ASR to create some rough transcripts for. The lectures have technical terms that a general english language model/dictionary will not suffice. How should I start (from scratch)? What's existing in the public domain that I can use? How do I create a language model? How do I train an acoustic model? How much transcribed data do I need? How much training and testing data should I prepare? http://www.voxforge.org/home/forums/message-boards/general-discussion/training-speaker-dependent-models-from-scratch Sat, 01 Mar 2008 17:53:01 -0600 Use of the Quickstart nightly package with a HTK_AcousticModel nightly http://www.voxforge.org/home/forums/message-boards/general-discussion/use-of-the-quickstart-nightly-package-with-a-htk_acousticmodel-nightly Hello there, I have been using the quickstart linux nightly package for a few days and have ran into problems trying to use the HTK_AcousticModel nightly. From what I can see, the nightly Julius quickstart package contains the hmmdefs and tiedlist from the HTK_AcousticModel already. But julius is still using the sample.dict and sample.dfa of just 23 words. I would like to use the larger 11,000 word dictionary from the HTK_AcousticModel package but I’ve struggled to find any information on preparing Julius-friendly dict & dfa files. Can you point me in the right direction? Kind Regards, Oko http://www.voxforge.org/home/forums/message-boards/general-discussion/use-of-the-quickstart-nightly-package-with-a-htk_acousticmodel-nightly Wed, 27 Feb 2008 05:40:35 -0600 Google Summer of Code 2008 http://www.voxforge.org/home/forums/message-boards/general-discussion/google-summer-of-code-2008 What is Google Summer of Code? http://www.voxforge.org/home/forums/message-boards/general-discussion/google-summer-of-code-2008 Mon, 25 Feb 2008 13:47:55 -0600 Submit your speech and win Ipod Touch http://www.voxforge.org/home/forums/message-boards/general-discussion/submit-your-speech-and-win-ipod-touch I just wonder do we know about this initiative: http://www.voice2type.com/submit_speech https://sourceforge.net/forum/message.php?msg_id=4752651 http://www.voxforge.org/home/forums/message-boards/general-discussion/submit-your-speech-and-win-ipod-touch Thu, 31 Jan 2008 06:56:45 -0600 Forum Post Rating http://www.voxforge.org/home/forums/message-boards/general-discussion/forum-post-rating Hi everyone, In order encourage new users to submit questions/comments to VoxForge, we've decided to remove the "thumbs down" rating for posts in all the forums. You can still "thumbs-up" a post to give quick positive feedback. But if you don't like what was written, you can either post a reply to offer some constructive feedback or just ignore it. thanks, Ken http://www.voxforge.org/home/forums/message-boards/general-discussion/forum-post-rating Mon, 31 Dec 2007 20:32:07 -0600 Language Hebrew http://www.voxforge.org/home/forums/message-boards/general-discussion/language-hebrew Hello, First of all, great job with this project! I think that this project will be one the most popular projects in the open source world! I searched the forums a little bit, and found nothing regards Hebrew, so I guess no one worked on Hebrew yet ... Therefore, I willing to help open and maintain a section for Hebrew (with everything needed, speech recordings etc). However, I really don't know what I need to do in order to start such thing, so every help will be appreciated. Regards, Ofir http://www.voxforge.org/home/forums/message-boards/general-discussion/language-hebrew Sun, 23 Dec 2007 03:16:16 -0600 Testing nightly build of accustic models http://www.voxforge.org/home/forums/message-boards/general-discussion/testing-nightly-build-of-accustic-models Hey everybody, i can't figure out how to setup a grammar that is based on the latest nightly builds of the acoustic models. For testing i have configured a the following voca: % NS_B <s> sil % NS_E </s> sil % COMMAND_START_V TIME t ay m sp DEPARTURE d ix p aa r ch er sp where i have configured two words that i selected from the dict file. here's the grammar: S : NS_B SENT NS_E SENT: COMMAND_START_V if i start julian i get some error msgs regarding missing phones: ###### check configurations ###### initialize input device ###### build up system Reading in HMM definition...(ascii)...limit check passed defined HMMs: 6757 logical names: 8564 in HMMList base phones: 44 used in logical done Making pseudo bi/mono-phone for IW-triphone...1062 added as logical...done reading [grammar/nabaztag.dfa] and [grammar/nabaztag.dict]... Reading in dictionary... line 3: triphone "ay-m+sp" not found line 3: triphone "m-sp+*" or biphone "m-sp" not found > 2 [TIME] t ay m sp line 4: triphone "ch-er+sp" not found line 4: triphone "er-sp+*" or biphone "er-sp" not found > 2 [DEPARTURE] d ix p aa r ch er sp ////// Missing phones: ay-m+sp ch-er+sp er-sp+* or biphone er-sp m-sp+* or biphone m-sp ////////////////////// error in reading grammar/nabaztag.dict: 2 words failed out of 4 words which seems to be correct as they are not listed in the tiedlist file. can anybody give me a hint why these phones are missing? the master prompts files still lists some sentences using both "time" and "depature" sebastian http://www.voxforge.org/home/forums/message-boards/general-discussion/testing-nightly-build-of-accustic-models Mon, 03 Dec 2007 15:57:07 -0600 Gender & T's http://www.voxforge.org/home/forums/message-boards/general-discussion/gender--ts I would like to contribute to this project, however the first question is what gender am I? Well I am a transsexual and do not ascribe to either, and would still like to help by contributing to the voice sample project but am excluded How about being inclusive? http://www.voxforge.org/home/forums/message-boards/general-discussion/gender--ts Mon, 03 Dec 2007 07:48:30 -0600 Make Sure Your Audio Editor Uses libFLAC version 1.2.1 http://www.voxforge.org/home/forums/message-boards/general-discussion/make-sure-your-audio-editor-uses-libflac-version-1_2_1 From research.eeye.com: Multiple Vulnerabilities in .FLAC File Format and Various Media Applications Overview: eEye Digital http://www.voxforge.org/home/forums/message-boards/general-discussion/make-sure-your-audio-editor-uses-libflac-version-1_2_1 Mon, 19 Nov 2007 22:08:41 -0600 Language recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/language-recognition Hi, congratulations for this valuable project! I'd like to start experimenting with voice processing in order to build an automated language recognition engine. I'm very new to this area but I can imagine a simple processing system that performs phoneme extraction from voice file, and then use an n-gram regognition system. Is the "accoustic model" of a language the right starting point for my approach? Vincent http://www.voxforge.org/home/forums/message-boards/general-discussion/language-recognition Mon, 19 Nov 2007 11:38:22 -0600 MojoMove - LibriVox Community Podcast site and forum http://www.voxforge.org/home/forums/message-boards/general-discussion/mojomove---librivox-community-podcast-site-and-forum MojoMove is a new site that houses Podcasts that sometimes get incorporated into the LibriVox community podcast feed. They also have a Forum http://www.voxforge.org/home/forums/message-boards/general-discussion/mojomove---librivox-community-podcast-site-and-forum Mon, 08 Oct 2007 13:05:15 -0500 Childes/Talkbank http://www.voxforge.org/home/forums/message-boards/general-discussion/childes/talkbank Sorry if you know this link already. It seems missing on this site. I've just discovered our friends: http://talkbank.org/ - transcribed adults conversation under GPL http://childes.psy.cmu.edu/ - childrens under GPL http://www.voxforge.org/home/forums/message-boards/general-discussion/childes/talkbank Sat, 20 Oct 2007 08:44:22 -0500 Software Freedom Day - Saturday, September 15th http://www.voxforge.org/home/forums/message-boards/general-discussion/software-freedom-day---saturday-september-15th Software Freedom Day is a global, grassroots effort to educate the http://www.voxforge.org/home/forums/message-boards/general-discussion/software-freedom-day---saturday-september-15th Fri, 14 Sep 2007 08:00:36 -0500 AVIOS Student programming speech/multimodal application programming contest http://www.voxforge.org/home/forums/message-boards/general-discussion/avios-student-programming-speech/multimodal-application-programming-contest The Applied Voice Input/Output Society (AVIOS) has announced their second student application contest. Applications must involve speech input and/or output, but may be pure speech or multimodal. Cash and/or equipment prizes valued at over $1000 will be awarded to teams of student programmers who design and create applications judged by industry experts to be the most robust, useful, creative, innovative, and user friendly. The contest encourages students to develop applications using speech technologies such as automatic speech recognition and text to speech synthesis and to combine them with other modalities. This year students may use any of a variety of platforms, including Microsoft SAPI 5.3 in Windows Vista, CMU's RavenClaw/Olympus, Opera's X+V, Speech Application Language Tags (SALT), Voxeo Prophecy, as well as any of several on-line VoiceXML development environments (BeVocal Cafe, Loquendo Cafe, TellmeStudio, VoiceGenie Developer Workshop, and Voxpilot Voxbuilder). AVIOS president K.W. (Bill) Scholz explains: "Students will build creative and innovative applications that will lead the speech industry forward into new areas. The contest also provides a forum for students to show what they can do with the power of speech applications." Results from last year’s contest and more information about this year’s contest are at http://avios.com/contest.htm http://www.voxforge.org/home/forums/message-boards/general-discussion/avios-student-programming-speech/multimodal-application-programming-contest Wed, 12 Sep 2007 17:10:33 -0500 New submissions will be covered under GPL v3 http://www.voxforge.org/home/forums/message-boards/general-discussion/new-submissions-will-be-covered-under-gpl-v3 Just a note to let you all know that we've changed the license on the VoxForge site to GPLv3. Therefore, any new speech submissions to the VoxForge site will now be covered under GPLv.3. Since all speech submitted to VoxForge thus far included this notice: http://www.voxforge.org/home/forums/message-boards/general-discussion/new-submissions-will-be-covered-under-gpl-v3 Wed, 22 Aug 2007 20:56:04 -0500 Speech Submission Feedback http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-submission-feedback Here is a thread with respect to another user's (very valid) opinion as to the state of the VoxForge Submission System: http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-submission-feedback Sun, 22 Jul 2007 21:41:16 -0500 text to speech http://www.voxforge.org/home/forums/message-boards/general-discussion/text-to-speech can you tell me please how to make txt files into mp3 files of greek language? a list of tools and voices that can do this thanks http://www.voxforge.org/home/forums/message-boards/general-discussion/text-to-speech Fri, 06 Jul 2007 01:47:43 -0500 Speech synthesis using Acoustic Model? http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-synthesis-using-acoustic-model Hi, I am wondering if it is possible to use the HMMs of the trained Acoustic Model to synthesize speech. It should be possible to generate the most likely output sequence of MFCC frames given any input sequence of phonemes. Would this synthesized speech resemble the voice of the speaker who trained the AM (assuming that the AM was trained by a single speaker)? Maybe this is the standard synthesizer method in combined speech recognition & synthesis tools? Can someone point me to examples, possibly with technical description of the synthesizer? If this approach is not recommended, why (bad speech quality, waste of computing time,...)? Thanks John http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-synthesis-using-acoustic-model Mon, 02 Jul 2007 05:41:14 -0500 CMU language modeling toolkit installing in cygwin http://www.voxforge.org/home/forums/message-boards/general-discussion/cmu-language-modeling-toolkit-installing-in-cygwin Hi, I tried to install CMU language modeling toolkit in cygwin, but everytime I got error message. Does anyone know how to install it in cygwin? Best regards, Abdul. http://www.voxforge.org/home/forums/message-boards/general-discussion/cmu-language-modeling-toolkit-installing-in-cygwin Tue, 19 Jun 2007 04:56:54 -0500 New Logo http://www.voxforge.org/home/forums/message-boards/general-discussion/new-logo Many thanks to Zachary Whitley for submitting a stylized SVG version of the old VoxForge logo, which I then converted to 3D using GIMP. Ken http://www.voxforge.org/home/forums/message-boards/general-discussion/new-logo Wed, 30 May 2007 09:35:27 -0500 Unclear cause of errors when using trigram LM in Julius http://www.voxforge.org/home/forums/message-boards/general-discussion/unclear-cause-of-errors-when-using-trigram-lm-in-julius Hello, I have been trying to use HMM models that I've created with HTK and an ARPA LM in Julius (version 3.5.3, multipath compile option enabled, under Linux). Although the LM works without problems in HTK, Julius generates many warnings when parsing the LM and, finally, an error. All warnings are of this type: > Warning: context (z_y:_t_,z_j_@_n_) not exist in LR 2-gram (ignored) and this is the error: > Error: 2-gram has no upper 3-gram, but not 0.0 back-off weight What I don't understand is why HTK doesn't complain and Julius generates many warnings and an error. Does anyone know what these warnings mean? In other words: what is wrong with the LM, and why does Julius complain? I am loading the bigram and trigram LMs with the -nlr and -nrl options. The trigram LM file also contains the unigram and bigram LM. I found out that if I remove all unigrams and bigrams from the trigram file, Julius *does* start up in interactive mode. However, Julius gives an extra warning: > Reading in RL 3-gram... > Warning: 1-gram total num differ! may cause read error > Warning: 2-gram total num differ! may cause read error > reading 1-gram part... I am not sure if this warning is important or not. Can the results be trusted if I start Julius in this way? I have made one modification to the LM I used for HTK, in order to get it working in Julius. The LM used for HTK was in so-called "modified ARPA" (see HTK book) format, in which the back-off weights are optional. Julius doesn't (seem to) support this, so I filled in '0' everywhere a back-off weight was required but not filled in. Is this a good thing to do? Additionally, Julius doesn't load all trigrams: it stops after loading about 60% of all trigrams. Does anyone know why Julius would do this? Is it possible to find out if a triphone is responsible for stopping the loading? > 3-gram read 2500000 (57%) > <cut> > 3-gram read 2592555 end I would really appreciate your help! Best regards, Wout Output from Julius: include config: conf.jconf ###### check configurations ###### build up system Reading in HMM definition...(ascii)...limit check passed defined HMMs: 6849 logical names: 130051 in HMMList base phones: 51 used in logical done Making pseudo bi/mono-phone for IW-triphone...5150 added as logical...done Reading in dictionary... 4996 words...done Reading in LR 2-gram... reading 1-gram part... 1-gram read 4996 end reading 2-gram part... 2-gram read 0 (0%) 2-gram read 100000 (7%) 2-gram read 200000 (15%) <some lines removed> 2-gram read 1300000 (99%) 2-gram read 1306905 end done Reading in RL 3-gram... reading 1-gram part... 1-gram read 4996 end reading 2-gram part... Warning: (E_@_l_,2:_) not exist in LR 2-gram (ignored) Warning: (E_@_n_,2:_) not exist in LR 2-gram (ignored) Warning: (r_o:_,2:_) not exist in LR 2-gram (ignored) Warning: (t_s_e:_,2:_) not exist in LR 2-gram (ignored) <removed 100.000 lines> Warning: (z_a_x_,z_y:_t_) not exist in LR 2-gram (ignored) Warning: (z_i:_p_,z_y:_t_) not exist in LR 2-gram (ignored) Warning: (z_u:_,z_y:_t_) not exist in LR 2-gram (ignored) Warning: (z_y:_,z_y:_t_) not exist in LR 2-gram (ignored) 2-gram read 1306905 end reading 3-gram part... 3-gram read 0 (0%) Warning: context (2:_,Q_u:_6:_) not exist in LR 2-gram (ignored) Warning: context (2:_,Q_u:_6:_) not exist in LR 2-gram (ignored) Warning: context (2:_,Q_u:_6:_) not exist in LR 2-gram (ignored) Warning: context (2:_,d_i:_) not exist in LR 2-gram (ignored) Warning: context (2:_,r_) not exist in LR 2-gram (ignored) Warning: context (6:_,@_) not exist in LR 2-gram (ignored) Warning: context (6:_,Q_I_u:_) not exist in LR 2-gram (ignored) Warning: context (6:_,Q_I_u:_) not exist in LR 2-gram (ignored) Warning: context (6:_,Q_I_u:_) not exist in LR 2-gram (ignored) Warning: context (6:_,Q_I_u:_) not exist in LR 2-gram (ignored) Warning: context (6:_,Q_I_u:_) not exist in LR 2-gram (ignored) Warning: context (6:_,Q_I_u:_) not exist in LR 2-gram (ignored) Warning: context (6:_,Q_I_u:_) not exist in LR 2-gram (ignored) Warning: context (6:_,Q_I_u:_) not exist in LR 2-gram (ignored) <removed 1.500.000 lines> Warning: context (z_y:_t_,z_aI_n_) not exist in LR 2-gram (ignored) Warning: context (z_y:_t_,z_aI_n_) not exist in LR 2-gram (ignored) Warning: context (z_y:_t_,z_e:_6:_n_) not exist in LR 2-gram (ignored) Warning: context (z_y:_t_,z_i:_t_) not exist in LR 2-gram (ignored) Warning: context (z_y:_t_,z_j_@_n_) not exist in LR 2-gram (ignored) Warning: context (z_y:_t_,z_j_@_n_) not exist in LR 2-gram (ignored) Warning: context (z_y:_t_,z_j_@_n_) not exist in LR 2-gram (ignored) Warning: context (z_y:_t_,z_j_@_n_) not exist in LR 2-gram (ignored) Warning: context (z_y:_t_,z_o:_n_) not exist in LR 2-gram (ignored) 3-gram read 2592555 end Error: 2-gram has no upper 3-gram, but not 0.0 back-off weight Terminated http://www.voxforge.org/home/forums/message-boards/general-discussion/unclear-cause-of-errors-when-using-trigram-lm-in-julius Wed, 23 May 2007 08:17:48 -0500 VoxForge -- Ubuntu collaboration http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge----ubuntu-collaboration Hello, I'm the accessibility coordinator for the Ubuntu project. I'd like to see some progress made on the Linux speech recognition front and I think the VoxForge initiative is a great way to start. Pooling resources is the only way to go! I see a few ways we may collaborate. For a start we could work together on GSoC projects. We have a fairly good track record with Google, having mentored about 20 projects each year from 2005. I did two myself in the accessibility field last year and will do three this year. Accessibility is such a narrow field that I feel several projects like Ubuntu, VoxForge, Orca and eSpeak should consider making a common project application to Google for 10-15 places. (btw, I realise that VoxForge is only partly about accessibility) Several of the projects listed in your GSoC forum section would be suitable for collaboration between our projects, and certainly the voice recording client. If we distribute that with Ubuntu (in universe at least) we might see decent participation numbers. It would be great if the same application also facilitated auditing of text-to-speech output. Ubuntu 7.04 just shipped with the eSpeak TTS engine with a handful of languages, but most of them could use some work. I would think parts of that could be recycled in the Dialog Manager GUI as well. I'm currently working on a specification for the speech recognition front-end and will post a link here once I've completed the first draft of it. Henrik http://www.voxforge.org/home/forums/message-boards/general-discussion/voxforge----ubuntu-collaboration Thu, 26 Apr 2007 18:33:58 -0500 dictation http://www.voxforge.org/home/forums/message-boards/general-discussion/dictation hello I am interested to speech dictation for greek is there a program to do it? thanks http://www.voxforge.org/home/forums/message-boards/general-discussion/dictation Mon, 23 Apr 2007 10:32:53 -0500 Simon Dialog Manager and Julian Speech Recognition http://www.voxforge.org/home/forums/message-boards/general-discussion/simon-dialog-manager-and-julian-speech-recognition My post to the Simon SourcForge Forum Hi bedahr, http://www.voxforge.org/home/forums/message-boards/general-discussion/simon-dialog-manager-and-julian-speech-recognition Thu, 19 Apr 2007 10:24:08 -0500 German Speech Recognition Suite (GPL) http://www.voxforge.org/home/forums/message-boards/general-discussion/-german-speech-recognition-suite-gpl Post from Peter Grasch (see original post): http://www.voxforge.org/home/forums/message-boards/general-discussion/-german-speech-recognition-suite-gpl Mon, 16 Apr 2007 11:26:01 -0500 Google Voice Local Search http://www.voxforge.org/home/forums/message-boards/general-discussion/google-voice-local-search http://www.voxforge.org/home/forums/message-boards/general-discussion/google-voice-local-search Sat, 07 Apr 2007 11:37:48 -0500 Asterisk-based User Speech Submission System http://www.voxforge.org/home/forums/message-boards/general-discussion/asterisk-based-user-speech-submission-system Submission by trevarthan (see original post here) http://www.voxforge.org/home/forums/message-boards/general-discussion/asterisk-based-user-speech-submission-system Wed, 04 Apr 2007 10:36:59 -0500 Corpus Thresholds http://www.voxforge.org/home/forums/message-boards/general-discussion/corpus-thresholds Is there already some statistics about the actual size of Corpus being built and an estimate of the "distance" to a working threshold? http://www.voxforge.org/home/forums/message-boards/general-discussion/corpus-thresholds Tue, 03 Apr 2007 11:44:14 -0500 Recognise one sentence and save it.. http://www.voxforge.org/home/forums/message-boards/general-discussion/recognise-one-sentence-and-save-it__ My purpose of using speach recognition is to recognise one sentence, sending the result to another programfile and then exit. I would like to call julian using a systemcommand or some equal from a c-file (OS is linux) and then recognise one sentence, which result is sent to the c-file or saved to a textfile I can load from the c-file afterwards. I am a litte unsure about how to do this, and need help. My surgestion is to generate the grammar and record a voice-file with a matching sentence. Then execute julian from the c-file and make it listen to the voicefile to adjust input. Then it should record voice from a mic and try to recognise input. After this it should send the result or write it to a textfile and then exit.. How can I do this..? - Grant P.s. Sorry if it may have doubleposted this question, it did not seem to work the first time.. http://www.voxforge.org/home/forums/message-boards/general-discussion/recognise-one-sentence-and-save-it__ Wed, 28 Mar 2007 04:33:43 -0500 How to recognise one sentence and then exit http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-recognise-one-sentence-and-then-exit2 My purpose of using speach recognition is to recognise one sentence, sending the result to another programfile and then exit. I would like to call julian using a systemcommand or some equal from a c-file (OS is linux) and then recognise one sentence, which result is sent to the c-file or saved to a textfile I can load from the c-file afterwards. I am a litte unsure about how to do this, and need help. My surgestion is to generate the grammar and record a voice-file with a matching sentence. Then execute julian from the c-file and make it listen to the voicefile to adjust input. Then it should record voice from a mic and try to recognise input. After this it should send the result or write it to a textfile and then exit.. How can I do this..? - Grant http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-recognise-one-sentence-and-then-exit2 Wed, 28 Mar 2007 04:31:20 -0500 How to recognise one sentence and then exit http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-recognise-one-sentence-and-then-exit My purpose of using speach recognition is to recognise one sentence, sending the result to another programfile and then exit. I would like to call julian using a systemcommand or some equal from a c-file (OS is linux) and then recognise one sentence, which result is sent to the c-file or saved to a textfile I can load from the c-file afterwards. I am a litte unsure about how to do this, and need help. My surgestion is to generate the grammar and record a voice-file with a matching sentence. Then execute julian from the c-file and make it listen to the voicefile to adjust input. Then it should record voice from a mic and try to recognise input. After this it should send the result or write it to a textfile and then exit.. How can I do this..? - Grant http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-recognise-one-sentence-and-then-exit Wed, 28 Mar 2007 04:28:54 -0500 Google Summer of Code Application http://www.voxforge.org/home/forums/message-boards/general-discussion/google-summer-of-code-application Hi, The Google Summer of Code Mentor Application site opened Monday (March 5). I http://www.voxforge.org/home/forums/message-boards/general-discussion/google-summer-of-code-application Tue, 06 Mar 2007 00:03:02 -0600 How can I make new words..? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-can-i-make-new-words__ Hi.. I'm working on a project using Julian for speechrecognition and have followed the instruction regarding the grammar, but cannot find out how, if possible, I can create new words not listed in the dictionary-file/lexicon downloaded. I have created the grammarfiles but do not know how I can recognice these words. When I make the acoustic Model I will declare the pronaunsation of them, but compiling the grammar it just complaining about errors regarding these words, how can I help this..? Daniel http://www.voxforge.org/home/forums/message-boards/general-discussion/how-can-i-make-new-words__ Thu, 22 Feb 2007 04:51:00 -0600 Google Summer of Code http://www.voxforge.org/home/forums/message-boards/general-discussion/google-summer-of-code http://www.voxforge.org/home/forums/message-boards/general-discussion/google-summer-of-code Fri, 16 Feb 2007 13:14:58 -0600 SPICE: Speech Processing - Interactive Creation and Evaluation Toolkit for New Languages http://www.voxforge.org/home/forums/message-boards/general-discussion/spice-speech-processing---interactive-creation-and-evaluation-toolkit-for-new-languages http://www.voxforge.org/home/forums/message-boards/general-discussion/spice-speech-processing---interactive-creation-and-evaluation-toolkit-for-new-languages Tue, 13 Feb 2007 09:42:43 -0600 How to get many more contributions http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-get-many-more-contributions If you'd like to encourage many more contributions, how about developing a Macromedia Flash voice recorder and embedding it in your website? This could make it really quick and easy for people to contribute, and persuade many "casual visitors" to record a few of the scripts. Cheers, Jon (www.orangejon.com) http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-get-many-more-contributions Thu, 18 Jan 2007 18:48:25 -0600 Speech Recognition Engine comparison http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-recognition-engine-comparison Hello All, I am just curious to know if it possible to build a good recognition system comparable to commercial ones by Philips, Nuance etc. by using Open source technologies like Juilus, Sphinx etc. I am working hard to get a ASR (Sphinx ) working with Hub4/WSJ but it seems to be a distant fruit. Any wikis or tutorials that help much better? Any one ready to work on a ground-breaking idea using a mash up of Web2.0 Technologies and ASR?? thanks Satish mummsat@iit.edu http://www.voxforge.org/home/forums/message-boards/general-discussion/speech-recognition-engine-comparison Wed, 03 Jan 2007 18:50:30 -0600 can we use th HUB4 or WSJ that's supported by Sphinx? http://www.voxforge.org/home/forums/message-boards/general-discussion/can-we-use-th-hub4-or-wsj-thats-supported-by-sphinx How do i use Hub4 or AN4 or WSJ models/dictionaries that are suported by sphinx? Can we use them with Julius? thanks Satish http://www.voxforge.org/home/forums/message-boards/general-discussion/can-we-use-th-hub4-or-wsj-thats-supported-by-sphinx Mon, 01 Jan 2007 17:24:28 -0600 How to create reverse 3 gram for julius? http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-create-reverse-3-gram-for-julius3 I am able to create a back-off bigram using HTK with Julius uses for its first pass. But I really have no idea how to create the REVERSE 3 gram needed for the second pass. Could anyone shed some light? Thanks. http://www.voxforge.org/home/forums/message-boards/general-discussion/how-to-create-reverse-3-gram-for-julius3 Wed, 20 Dec 2006 23:11:54 -0600 GPL applicable to 'derived hardware'? http://www.voxforge.org/home/forums/message-boards/general-discussion/gpl-applicable-to-derived-hardware Hi, I wonder, how the GPL would be applied to the following situation: A huge number of users like you and me collect a speech corpus that allows to build acoustic models of similar quality compared to commercial systems. Now a hardware developer comes up with some speech recognition tool that is designed to work exactly with VoxForge's model. He doesn't distribute the model with his device, but provides a link to voxforge.org where customers may download the model. - Is this situation covered by GPL? If not, I'm reluctant to spend time on something that others may turn into profit without returning anything to the community. If yes: - Does the GPL require the device developer to distribute his device together with the model sources, OR - Does the GPL reuqire him to distribute his device together with the model sources PLUS any sources and hardware description of his own device? The latter situation is most desirable, because everybody who benefits from the acoustic models has to provide some improvement in return. But I doubt GPL includes anything except for directly derived code (or models). Your opinion? http://www.voxforge.org/home/forums/message-boards/general-discussion/gpl-applicable-to-derived-hardware Sun, 17 Dec 2006 16:10:58 -0600 beginner's questions http://www.voxforge.org/home/forums/message-boards/general-discussion/beginners-questions Hi, congratulations for this valuable project! I'd like to start experimenting with HTK and have a few questions. 1) Where do I find english documentation for Julius? On the Japanese sourceforge site first of all I see a lot of '?????'. Is it an alternative to HTK, or do I need to install it? 2) Will there be any compiled Acoustic Models (trained HMMs)? This is what I thought to find below downloads->acoustic models, instead there are word-phoneme dictionaries. Probably I am misunderstanding something? 3) If I submit speech, do I just have to provide a text transcription, or also a phonetic transcription? What list of phonemes do you use? The format of the text transcriptions seems to differ between the corpora, which rules should I follow? 4) Is there a standard, how the MFCC coefficients are calculated? There are a lot of options concerning frequency bands, triangular/hamming filter functions in mel/linear space etc. Arno http://www.voxforge.org/home/forums/message-boards/general-discussion/beginners-questions Fri, 15 Dec 2006 13:51:05 -0600 svn checkout http://www.voxforge.org/home/forums/message-boards/general-discussion/svn-checkout Where can I svn co the repo I see at http://www.dev.voxforge.org/browser/VoxForge/Trunk http://www.voxforge.org/home/forums/message-boards/general-discussion/svn-checkout Mon, 06 Nov 2006 00:43:25 -0600 How do someone create Language Model Julius/Julian http://www.voxforge.org/home/forums/message-boards/general-discussion/how-do-someone-create-language-model-julius/julian Hi All, http://www.voxforge.org/home/forums/message-boards/general-discussion/how-do-someone-create-language-model-julius/julian Sun, 22 Oct 2006 13:37:03 -0500