OLAC Record

Title:AIDA Scenario 1 Practice Topic Annotation
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Tracey, Jennifer, et al. AIDA Scenario 1 Practice Topic Annotation LDC2024T02. Web Download. Philadelphia: Linguistic Data Consortium, 2024
Contributor:Tracey, Jennifer
Strassel, Stephanie
Getman, Jeremy
Bies, Ann
Griffitt, Kira
Graff, David
Caruso, Christopher
Date (W3CDTF):2024
Date Issued (W3CDTF):2024-02-15
Description:*Introduction* AIDA Scenario 1 Practice Topic Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of annotations for 212 English, Russian and Ukrainian web documents (text, image and video) from AIDA Scenario 1 Practice Topic Source Data (LDC2023T11). The DARPA AIDA (Active Interpretation of Disparate Alternatives) program aimed to develop a multi-hypothesis semantic engine to generate explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supported AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages. Each phase of the AIDA program centered on a specific scenario, or broad topic area, with related subtopics designated as either practice subtopics or evaluation subtopics. The Phase 1 scenario focused on political relations between Russia and Ukraine in the 2010s. This corpus contains annotations for the set of practice documents designated for annotation in Phase 1. *Data* Annotations are presented as tab separated files in the following categories for each topic. * Mentions: single references in source data to a real-world entity or filler, event, or relation. There are three mentions tables for each topic, one for entities and fillers, one for relations, and one for events. * Slots: pre-defined roles in an event or relation filled by an argument (entity mention). There are two slots tables per topic, one for relations and one for events. * Linking: entity mentions "linked" to entries in the knowledge base as a method of indicating the real-world entity to which an entity referred. *Sponsorship* This material is based upon work supported by Air Force Research Laboratory (AFRL) and the Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C-0013. *Samples* Please view the following samples: * Mentions * Slots * Linking *Updates* None at this time.
Extent:Corpus size: 12464 KB
ISLRN: 462-429-870-532-3
DOI: 10.35111/ffya-kx44
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2024T02
Rights Holder:Portions © 2024 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2024T02
DateStamp:  2024-06-11
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher. 2024. Linguistic Data Consortium.
Terms: area_Europe country_GB country_RU country_UA dcmi_Text iso639_eng iso639_rus iso639_ukr olac_primary_text

Up-to-date as of: Sun Jun 16 7:35:12 EDT 2024