OLAC Record
oai:www.ldc.upenn.edu:LDC2005S08

Metadata
Title:BBN/AUB DARPA Babylon Levantine Arabic Speech and Transcripts
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:BBN Technologies (with American University of Beirut a subcontractor), et al. BBN/AUB DARPA Babylon Levantine Arabic Speech and Transcripts LDC2005S08. Web Download. Philadelphia: Linguistic Data Consortium, 2005
Contributor:BBN Technologies (with American University of Beirut a subcontractor)
Makhoul, John
Zawaydeh, Bushra
Choi, Frederick
Stallard, David
Date (W3CDTF):2005
Date Issued (W3CDTF):2005-01-15
Description:*Introduction* BBN/AUB DARPA Babylon Levantine Arabic Speech and Transcripts consists of transcribed, spontaneous speech recorded from subjects speaking Levantine colloquial Arabic. Levantine Arabic is the dialect of Arabic spoken in Lebanon, Jordan, Syria, and Palestine. It is significantly different from Modern Standard Arabic. It is a spoken rather than a written language, and includes different words and pronounciations from Modern Standard Arabic. The corpus was developed with funding from the Defense Advanced Research Project Agency (DARPA), as part of the Babylon program. The Babylon program was intended to advance the state of the art in speech-to-speech translation systems by creating new technology and by developing systems for field use. BBN was funded under Babylon to develop a limited English/Arabic refugee/medical speech translation system for a handheld computer, and it collected this corpus as part of its work. The corpus may be useful for speech recognition in Levantine colloquial Arabic, including for speech translation and spoken dialog systems. *Samples* To see an example of this corpus, we have provided a audio sample and transcription.
Format:Sampling Rate: 16000
Sampling Format: pcm
Identifier:LDC2005S08
https://catalog.ldc.upenn.edu/LDC2005S08
ISBN: 1-58563-296-1
ISLRN: 500-300-564-790-5
Language:North Levantine Arabic
South Levantine Arabic
Language (ISO639):apc
ajp
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2005S08
Rights Holder: Portions © 2003 BBNT Solutions LLC, © 2004, 2005 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2005S08
DateStamp:  2019-01-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: BBN Technologies (with American University of Beirut a subcontractor); Makhoul, John; Zawaydeh, Bushra; Choi, Frederick; Stallard, David. 2005. Linguistic Data Consortium.
Terms: area_Asia country_JO country_SY dcmi_Sound iso639_ajp iso639_apc olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2005S08
Up-to-date as of: Fri Jun 21 11:27:35 EDT 2019