From a post by Sam:
ways to make entering speech more fun, so people would be encouraged to do it.
A few ideas:
my reply:
>A few ideas: ...
The main issue with sourcing new text is to determine who owns Copyright (if any ...) in the text being read. Project Gutenberg might be better in this regard.
>1. To speak something out loud is a great aid to learning. Maybe we can find
>some resource of historical & scientific facts
A future release of the Speech Submission Java Applet will allow users to submit the text of their choice - we will still need to figure out how to confirm a submitted text is free from Copyright restrictions.
>song lyrics (may be a bad idea because people would sing instead of
>speak.. But maybe that would be OK??)
I'm not sure how singing might affect things (other than maybe being painful to review ... for my singing at least :) ), but the general rule is that you train with the same audio you want to recognize - so it may be that such audio might only be useful if we want to recognize singing voices.
>Movie scripts. I swear, if you find a good movie script (like Star Trek IV)
>you will get people who read through the entire movie.
These
would likely have Copyright issues - any recording of the dialog of a
Copyrighted movie would be considered a derivative work. Might be able
to find some open source movie scripts that people might be interested
in.
>Making Voice-books. people can kill two birds with one stone - they can
>read in a document, or a chapter from a book, creating a Voice-book while
>adding to the database.
We already do this in association with LibriVox. See these links:
We also did a project with MojoMove411 to short readings (poetry & prose)