OLAC Record
oai:www.ldc.upenn.edu:LDC2014S08

Metadata
Title:United Nations Proceedings Speech
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Chay, Kevin, Cecilia Elizalde, and Michal Ziemski. United Nations Proceedings Speech LDC2014S08. Hard Drive. Philadelphia: Linguistic Data Consortium, 2014
Contributor:Chay, Kevin
Elizalde, Cecilia
Ziemski, Michal
Date (W3CDTF):2014
Date Issued (W3CDTF):2014-10-15
Description:*Introduction* United Nations Proceedings Speech was developed by the United Nations (UN) and contains approximately 8,500 hours of recorded proceedings in the six official UN languages, Arabic, Chinese, English, French, Russian and Spanish. The data was recorded in 2009-2012 from sessions 64-66 of the General Assembly (GA) and First Committee (FC) (Disarmament and International Security), and meetings 6434-6763 of the Security Council. Recordings were made using a customized system following a daily internal circulated instruction from the Meetings Management Section. Most of the subjects and information related to a particular meeting or session are published in a UN Journal which can be found in the following link: http://www.un.org/en/documents/journal.asp *Data* Data is presented either as mp3 or flac compressed wav and are 16-bit single channel files in either 22,050 or 8,000 Hz organized by committee and session number, then language. The folder labeled "Floor" indicates the microphone used by the particular speaker. Those files may include other languages, for instance, if the speaker's language was not among the six official UN languages. File naming conventions for GA and FC data are in the form of LYY_ZZ_format.format and Security Council data is in the form of LYYYY_ZZ_format.format where L is a one letter language designation, YY is the meeting number, ZZ indicates the audio segment number and format.format is the wav or mp3 designation. Note that not all files are present for every language. *Samples* Please listen to the following samples * Floor * Arabic * Chinese * English * French * Russian * Spanish *Updates* None at this time.
Extent:Corpus size: 510192016 KB
Format:Sampling Rate: 22050
Sampling Format: flac
Identifier:LDC2014S08
https://catalog.ldc.upenn.edu/LDC2014S08
ISBN: 1-58563-693-2
ISLRN: 527-011-778-815-0
Language:English
Mandarin Chinese
Standard Arabic
French
Russian
Spanish
Language (ISO639):eng
cmn
arb
fra
rus
spa
License:United Nations Proceedings Speech: https://catalog.ldc.upenn.edu/license/united-nations-proceedings-speech.pdf
Medium:Distribution: Hard Drive
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Rights Holder:Portions © 2009-2012, 2014 United Nations, © 2014 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2014S08
DateStamp:  2019-01-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Chay, Kevin; Elizalde, Cecilia; Ziemski, Michal. 2014. Linguistic Data Consortium.
Terms: area_Asia area_Europe country_CN country_ES country_FR country_GB country_RU country_SA dcmi_Sound iso639_arb iso639_cmn iso639_eng iso639_fra iso639_rus iso639_spa olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2014S08
Up-to-date as of: Sun Sep 1 18:18:54 EDT 2019