OLAC Record
oai:www.ldc.upenn.edu:LDC2005T08

Metadata
Title:Discourse Graphbank
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Wolf, Florian, et al. Discourse Graphbank LDC2005T08. Web Download. Philadelphia: Linguistic Data Consortium, 2005
Contributor:Wolf, Florian
Gibson, Edward
Fisher, Amy
Knight, Meredith
Date (W3CDTF):2005
Date Issued (W3CDTF):2005-03-15
Description:*Introduction* Discourse Graphbank contains 135 newswire texts totalling 70,000 words annotated with coherence relations. The project was Florian Wolf's PhD thesis and aimed to define a descriptively adequate data structure for representing discourse coherence structures, investigated the impact of discourse coherence structures on other linguistic processes and natural language applications (e.g. anaphora resolution, summarization and information retrieval), and developed and tested discourse parsing algorithms. *Data* The source data consists of Assoicated Press and Wall Street Journal newswire data from TIPSTER Complete (LDC93T3A) annotated with coherence relations. The data was annotated by two independent annotators with 88% agreement. The annotators notated 11 types of coherence relations: Resemblance relations Parallel Contrast Example Generalization Elaboration Cause-Effect relations Explanation Violated Expectation Condition Temporal Sequence relation Attribution relation Same relation *Samples* For an example of the data in this corpus, please view this sample (JPG). *Updates* None at this time.
Identifier:LDC2005T08
https://catalog.ldc.upenn.edu/LDC2005T08
ISBN: 1-58563-320-8
ISLRN: 983-656-398-539-6
DOI: 10.35111/7snd-y397
Language:English
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2005T08
Rights Holder:Portions © 1988-1990 Associated Press, © 1986-1989 Dow Jones & Company, Inc., © 2005 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2005T08
DateStamp:  2021-11-15
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Wolf, Florian; Gibson, Edward; Fisher, Amy; Knight, Meredith. 2005. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Text iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2005T08
Up-to-date as of: Mon Mar 25 7:19:46 EDT 2024