Corpora available at CU:
On line:
- ATIS corpus:
- Audix corpus: stories read by professional newsreader.
ToBI labeled.
- BDC corpus: Spontaneous and read versions of people
giving directions for getting to places in Cambridge MA.
4 speakers. ToBI labeled. /home/julia/data/bdc/data
- Communicator corpus: Read speech by professional reader.
ToBI labeled.
- Cue phrase corpus: Invited talk by Ron Brachman. Cue
phrases identified.
- ToBI examples: For Windows,
for Linux/Solaris
- TOOT corpus: Spoken dialogue system recordings of
subjects getting train information.
On cassette or dat:
- Cooking dialogues
- Harry Gross shows
Corpora obtainable from the Linguistic Data Consortium
- Broadcast News
- Call Home
- CMU kids corpus
- Santa Barbara corpus
- Trains
-