Click here to register.

English Speech Files

Flat
Post Formatting
User: mfread
Date: 10/12/2006 1:43 pm
Views: 4972
Rating: 30

I have a couple questions regarding post formatting.

 

So far I have posted using the type "text" and copy and pasted content from my template text file (on windows, with windows line endings). I just paste in the prompts text and then copy and paste the whole thing into the submission box. 

 If I preview this, the text is run together without proper line breaks. I also noticed that there are sometimes html tags in the preview (mostly <br />).

If I click preview again without making any changes at all, then two copies of the text appears in the preview window. One ran together, and one formatted properly.

I have tried pasting the template into word and then saving as filtered html, then changing the content type to html but the result is that the first run together text displays first and then the second copy is plain text with all the html tags.

I am unable to view any of my submissions yet, so I don't know how these posts are coming out. I do know that there are definately bugs in the submission process on your side. Probably several.

 

At the very least the form does not recognize line endings properly when previewing, parsing the form, or both. There is probably also a bug related to changing the content type after previewing. And there is something causing the first preview to display regardless of how many subsequent previews you do (the top preview remains whatever you previewed first, not the last thing you previewed).

 

--- (Edited on 10/12/2006 1:43 pm [GMT-0500] by mfread) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: Post Formatting
User: kmaclean
Date: 10/12/2006 2:02 pm
Views: 280
Rating: 23

Hi,

First off, thanks for all the submissions, I will be committing them to our Subversion repository shortly.

Yes there is a problem with the submission system, and I think it is related to people changing the Content Type to "Text" and then copying and pasting from an HTML page thinking that the HTML  would be stripped off (which is what I would have thought my CMS would do ...).  But the best thing is to just leave the Content type set to "Mixed Text and HTML", and that seems to cause the least problems,

I am in the process of putting up the following notice where people submit audio:

Note: please leave your Content Type as "Mixed Text & HTML" in your submission form, and preview your submission before submitting it to make sure everything is OK.

Some have you have changed it to 'text', and it seems like you posted html to your form, and this is causing display problems with your submission (i.e. your post displays incorrectly with HTML code being visible - which can be fixed after I approve the submission )

until I can remove the 'text' option from the submission form. 

Ken 

--- (Edited on 10/12/2006 3:02 pm [GMT-0400] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: Post Formatting
User: kmaclean
Date: 10/12/2006 2:04 pm
Views: 265
Rating: 18

"And there is something causing the first preview to display regardless of how many subsequent previews you do (the top preview remains whatever you previewed first, not the last thing you previewed)."

That's a new one to me, I'll look into it,

Ken 

--- (Edited on 10/12/2006 3:04 pm [GMT-0400] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: Post Formatting
User: kmaclean
Date: 10/12/2006 2:08 pm
Views: 258
Rating: 17

This might be a caching issue ... I think some pages only get updated 1 minute after a change is made.
--- (Edited on 10/12/2006 3:08 pm [GMT-0400] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: Post Formatting
User: kmaclean
Date: 10/12/2006 2:21 pm
Views: 228
Rating: 15

Updated caching on the page to refresh after 10 seconds, but the problem with preview is still there, I'm going to have to look at the Style Template and see if the problem is there.
--- (Edited on 10/12/2006 3:21 pm [GMT-0400] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: Post Formatting
User: kmaclean
Date: 10/17/2006 9:10 pm
Views: 129
Rating: 9

Style template does not have user modifiable variables - would need to update WebGUI code to get at these variables.  Leave workaround (i.e. content type as "Text & HTML") as is - need to upgrade WebGUI to the 7.0 series.
--- (Edited on 10/17/2006 10:10 pm [GMT-0400] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: Post Formatting
User: kmaclean
Date: 10/29/2006 8:56 pm
Views: 120
Rating: 15

I think I isolated & fixed this problem - see ticket #97

Ken 

--- (Edited on 10/29/2006 9:56 pm [GMT-0500] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: Post Formatting
User: kmaclean
Date: 10/12/2006 2:37 pm
Views: 134
Rating: 30

"There is probably also a bug related to changing the content type after previewing."

I was struggling with this problem myself, and finally figured out that I needed to approve the submission before you or I can change the content type.

Ken 

--- (Edited on 10/12/2006 3:37 pm [GMT-0400] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: Post Formatting
User: kmaclean
Date: 11/7/2006 9:26 am
Views: 117
Rating: 13

Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Previous