Homework Project: LSA 2011, Text to Speech Synthesis
Due:
Submission procedure explained below.
A. TTS Evaluation
1. Propose a set of criteria for evaluating TTS systems and
justify this set. These criteria should be objective and results should
be reproducible. I.e., they should not be based primarily on listener
preferences, although you may also include subjective criteria as a secondary
set. Try to be as creative as possible. Consider many different
kinds of inputs, such as equations, addresses, temporal expressions, and inputs
appropriate to different types of applications. Try to exercise each of
the components of the system insofar as possible: text normalization,
pronunciation, intonation assignment, unit selection, etc.
- Develop
a set of inputs which will test each criterion. There should be a
sufficient number of input text sentences to demonstrate convincingly how
the system performs wrt each.
3.
For each of the following TTS interactive demo
sites: ATT,
Best,
Neospeech,
NaturalReader,
FlameReader, Cepstral,
Verbose,
Ultra Hal,
Acapela,
Nuance Vocalizer. Mary from DFKI,
and more freeware...
Test at least 3
voices for languages you know. Save input text and output .wav files,
appropriately marked, together with the conclusions you draw from each.
Note the major errors you discover.
4.
Provide an overall assessment of each TTS system and, within the system,
of the voices you test. What are the major strengths and weakness?
Which is the best system/voice?
Submit your
files in one zip file either in email or put the file on a website.
|