VoxForge
morphologically rich pronunciation dictionaries
Current pronunciation dictionaries are simple lists of words and their pronunciations. While this works well for morphologically simple languages such as English, this is unnatural, inefficient and unmaintainable for morphologically richer languages, such as all Roman languages, German, Slavic languages and even more so for agglutinative languages such as Finno-Ugric languages and Turkish.
The purpose of this project is to extend the W3C Pronunciation Lexicon Specification allowing for different parts of speech, define their default derivations and to also support irregular forms if these occur for specific word types.
Solutions should contain an application to define allowed parts of speech and their behaviour for different languages and an ideally flexible application that can be used to build pronunciation resources for these languages.