Princeton University Library Catalog

Boston University radio speech corpus.

Format:
Data file
Language:
English
Published/​Created:
[Philadelphia, Pennsylvania] : Linguistic Data Consortium, [1996]
Description:
1 online resource
Restrictions note:
Use of these data is restricted to Princeton University students, faculty, and staff for non-commercial statistical analysis and research purposes only.
Summary note:
"The Boston University Radio Speech Corpus was collected primarily to support research in text-to-speech synthesis, particularly generation of prosodic patterns. The corpus consists of professionally read radio news data, including speech and accompanying annotations, suitable for speech and language research.... The main radio news portion of the corpus consists of over seven hours of news stories recorded in the WBUR radio studio during broadcasts over a two year period.... Each story read by an announcer was digitized in paragraph size units.... The paragraphs were annotated with the orthographic transcription, phonetic alignments, part-of-speech tags and prosodic markers."--Resource home page, LDC website.
Notes:
  • Authors: Mari Ostendorf, Patti Price, Stefanie Shattuck-Hufnagel.
  • Data type: sound.
  • Data source: microphone speech.
  • Data language: English.
  • Data accessible via the Data and Statistical Services (DSS) website.
Source of description:
Title from Princeton University's Data and Statistical Services website (viewed on October 5, 2016).
Subject(s):
Form/​Genre:
Corpora (Linguistics)
Other title(s):
Boston University radio speech corpus speech and annotations
Title from ReadMe file:
  • Boston University radio news corpus on CD-ROM
ISBN:
  • 9781585630608
  • 1585630608
Publisher no.:
LDC96S36
OCLC:
931778326
Issuing body:
Other views: