Skip to search
Skip to main content
Catalog
Help
Feedback
Your Account
Library Account
Bookmarks
(
0
)
Search History
Search in
Keyword
Title (keyword)
Author (keyword)
Subject (keyword)
Title starts with
Subject (browse)
Author (browse)
Author (sorted by title)
Call number (browse)
search for
Search
Advanced Search
Bookmarks
(
0
)
Princeton University Library Catalog
Start over
Cite
Send
to
SMS
Email
EndNote
RefWorks
RIS
Printer
Bookmark
North American news text, complete.
Format
Data file
Language
English
Published/Created
[Philadelphia, Pa.] : Linguistic Data Consortium, [2008]
Description
1 online resource
Availability
Available Online
Princeton University Data and Statistical Services Studies (DSS)
Details
Subject(s)
Newspapers
—
Language
—
Databases
[Browse]
English language
—
Data processing
—
Databases
[Browse]
Computational linguistics
—
Databases
[Browse]
Author
Graff, David Andrew, 1962-
[Browse]
Issuing body
Linguistic Data Consortium
[Browse]
Library of Congress genre(s)
Databases
[Browse]
Restrictions note
Use of these data is restricted to Princeton University students, faculty, and staff for non-commercial statistical analysis and research purposes only.
Summary note
"A collection of English news text from the Los Angeles Times, Washington Post, New York Times, Reuters and the Wall Street Journal. This corpus was originally released in 1995 as the North American News Text Corpus (LDC95T21) and is reissued to complement the release of the Brown Laboratory for Linguistic Information Processing (BLLIP) North American News Text sets (LDC2008T13, LDC2008T14), which consist of Penn Treebank-style parsing of that news text. North American News Text is reissued in two versions: North American News Text, Complete LDC2008T15, the members-only original version ... ; and North American News Text, General Release LDC2008T16 (which does not include text from the Wall Journal Street Journal) ... The directory structure of each of these publications has been restructured to be identical to the directory structure of the BLLIP releases."--Resource home page, LDC website.
Notes
Author, David Graff.
Data type: text.
Data source: newswire.
Data language: English.
Issuing body
Made available by the Linguistic Data Consortium as part of their Corpora collection, hosted on the University of Pennsylvania website.
Source of description
Original version record; title from Princeton University's Data and Statistical Services website (viewed on November 8, 2017).
Other title(s)
Linguistic corpora.
Publisher no.
LDC2008T15
Statement on language in description
Princeton University Library aims to describe library materials in a manner that is respectful to the individuals and communities who create, use, and are represented in the collections we manage.
Read more...
Other views
Staff view
Ask a Question
Suggest a Correction
Report Harmful Language
Supplementary Information