Click here to register.


Speech Recognition in the News

Click the 'Add' link to add a comment to this page.

AddSearch

Command handler using CMU Sphinx
By thomas@jaivox.com - 1/17/2013

The Google API uses their (hidden) speech recognizer.

I have created a simple  (one Java file) voice-based command handler for Linux using Sphinx. The current example works for only 10 voice commands, but this can be extended by adding more commands to a text file.  It is at http://www.jaivox.com/speechcommand.html

Voice recognition on Ubuntu (using Google)!
By bboyjkang - 12/2/2012

Voice recognition on Ubuntu!

A small test to show the ability to use google's voice recognition from the ubuntu desktop. In theory this could be made into something like an ubuntu desktop assistant.

This is just a test, and both accuracy and speed could be a lot better if this were written as a real app rather than a script.

http://www.youtube.com/watch?v=uM2Yb-PwP6o

Voice Recognition on Ubuntu, Part 2!

Before I showed you the ability to do voice recognition with google's servers without chrome. Now I built a working demo to show you possible uses of an ubuntu voice assistant.

http://www.youtube.com/watch?v=0kMWto5enlM

Ubuntu Speech Input
By Fredtechno - 7/13/2012

I have created a system that empowers Ubuntu Desktops with dictation from an Android app.

The site (with forum) is at http://ubuntuspeechinput.zymichost.com/

Selling well on Google play, with lots of positive feedback, this is a simple solution for those wanting to dictate to many Ubuntu programs without the hassle of configuring soundcards or having to spend hours training any software.

Ubuntu HUD, and future plans to include speech recognition
By kmaclean - 1/24/2012 - 1 Replies

From this article: Ubuntu rips up drop-down menus

Ubuntu is set to replace the 30-year-old computer menu system with a “Head-Up Display” that allows users to simply type or speak menu commands.
[...]
Ubuntu plans to integrate voice recognition with HUD in future releases, allowing users to dictate commands to their PC. 

HUD is described as follows

Basically rather than navigating menus to find an application function, just tap ALT and type what you want the application to do.

Some fuzzy logic matches what you typed with the application menus, and the most relevant commands are displayed.  To complete the action just press return, or select one of the alternative functions presented in the auto-complete. 

From Mark Shuttleworth's blog:

Voice is the natural next step

Searching is fast and familiar, especially once we integrate voice recognition, gesture and touch. We want to make it easy to talk to any application, and for any application to respond to your voice. The full integration of voice into applications will take some time. We can start by mapping voice onto the existing menu structures of your apps. And it will only get better from there.
 

 

Julius and online speech recognition
By Leslaw Pawlaczyk - 12/28/2011 - 2 Replies

Hello

I would like to present a new website which I just launched with a help from some of my friends dedicated to recognizing speech stored in multimedia files. The automatically transcribed speech is then later used in creation of subtitles played using smooth streaming and Silverlight 4.0 The website supports Polish and English languages in transcription. You can find out more on http://www.silesiaresearch.com - I hope that this website can popularize speech recognition in general and also present unique benefits of key word searching in media files.

Thanks

Leslaw Pawlaczyk

gnomeSpeak
By ghanitha - 4/12/2011 - 7 Replies

Hi,

gnomeSpeak is a two way voice application using GVC and festival.  Prototype is aimed to help the visually impaired. Currently supporting english and tamil.

Appreciate your  feedback on it.

http://gnomespeak.blogspot.com/2011/03/gnomespeak-lets-both-talk.html

cheers!

Hanitha

Voice control of Windows using Julius
By Leslaw Pawlaczyk - 4/1/2011 - 3 Replies

Hello

Me and my team just released a new open source software under LGPL license for controlling Windows using voice commands. This software is using Julius as a speech recognition engine. Currently we support Polish acoustic models, so anyone who has any knowledge of Slavic language are welcome to download it and try. Once again thanks goes to prof. Lee and his team for developing Julius

http://sourceforge.net/projects/skrybotdomowy/files/Releases/InstalatorSkrybotKomendy-1.3.1.294.exe/download

Thank you
Leslaw Pawlaczyk
http://skrybot.pl/en/

Google Chrome 11 beta includes server-based speech recognition
By kmaclean - 3/24/2011

From the Google Chrome blog:

Today, we’re updating the Chrome beta channel with a couple of new capabilities, especially for web developers. Fresh from the work that we’ve been doing with the HTML Speech Incubator Group, we’ve added support for the HTML speech input API. With this API, developers can give web apps the ability to transcribe your voice to text. When a web page uses this feature, you simply click on an icon and then speak into your computer’s microphone. The recorded audio is sent to speech servers for transcription, after which the text is typed out for you.

You can try this it out yourself on Google's html5rocks.com website (you need Google Chrome 11 beta installed)

It works on Linux - I tried it on Fedora 14 and Ubuntu 10.4 with no problems.

Open source dictation for Polish
By Leslaw Pawlaczyk - 6/8/2010 - 4 Replies

Hello everyone

I wanted to announce a first release of an open source project called Skrybot doMowy, which is based on a well known decoder Julius. This software is a result of 3 year research and is a LVCSR dictation system for Windows platform available from http://sourceforge.net/projects/skrybotdomowy/

The aim of this software which code is written in C# is to allow other fellow software engineers to write their own plugins and extensions to dictation system.

Currently the program supports only Polish acoustic and language models making it possible to use for dictation of emails or simple documents. It has a live view of a microphone input allowing the user to monitor the volume of their speech.

One of the other aims was to make speech dictation available for free to everyone with a quality similar to commercial programs.

I encourage other researchers or programmers to get into contact with me and and potentially develop other language GUI versions as well as acoustic and language models for their native languages. We are soon considering supporting British English version of this software, however we still need to develop such models.

More details can be found on http://skrybot.pl/en/

Best regards

Leslaw Pawlaczyk

Google building speech capabilities for browsers
By kmaclean - 5/27/2010 - 1 Replies

According to this InfoWorld article, Google is building speech-recognition technologies not just for Chrome, but for all browsers. 

 

Ian Fette, product manager for the Google Chrome team, said (at the Google I/O conference in San Francisco late last week): 

We're hoping that the text-to-speech APIs as well as the voice input, voice recognition ship in Chrome but also become a Web standard that is implementable by any browser out there.