Resources in and about the Standard Arabic language


ISO 639-3: arb

The combined catalog of all OLAC participants contains the following resources that are relevant to this language:

Other known names and dialect names: Al-'Arabiyya, Al-Fusha, Arabic, Standard, Classical Arabic, Koranic Arabic, Literary Arabic, Modern Literary Arabic, Modern Standard Arabic, Quranic Arabic

Primary texts

  1. Arabic Newswire Part 1. David Graff and Kevin Walker. 2001. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2001T55
  2. Arabic Treebank: Part 1 v 2.0. Mohamed Maamouri, Ann Bies, Hubert Jin, and Tim Buckwalter. 2003. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2003T06
  3. Arabic Treebank: Part 1 - 10K-word English Translation. Moussa Bamba, Mohamed Maamouri, Hubert Jin, Ann Bies, and Xiaoyi Ma. 2003. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2003T07
  4. Arabic Gigaword. David Graff. 2003. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2003T12
  5. Multiple-Translation Arabic (MTA) Part 1. Kevin Walker, Moussa Bamba, David Miller, Xiaoyi Ma, Chris Cieri, and George Doddington. 2003. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2003T18
  6. Arabic Treebank: Part 2 v 2.0. Mohamed Maamouri, Ann Bies, Tim Buckwalter, and Hubert Jin. 2004. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2004T02
  7. TIDES Extraction (ACE) 2003 Multilingual Training Data. Alexis Mitchell, Stephanie Strassel, Mark Przybocki, JK Davis, George Doddington, Ralph Grishman, Adam Meyers, Ada Brunstein, Lisa Ferro, and Beth Sundheim. 2004. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2004T09
  8. Arabic Treebank: Part 3 v 1.0. Mohamed Maamouri, Ann Bies, Tim Buckwalter, and Hubert Jin. 2004. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2004T11
  9. Arabic News Translation Text Part 1. Xiaoyi Ma, Dalal Zakhary, and Moussa Bamba. 2004. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2004T17
  10. Arabic English Parallel News Part 1. Several. 2004. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2004T18
  11. Prague Arabic Dependency Treebank 1.0. Jan Hajic, Otakar Smrz, Petr Zemanek, Petr Pajas, Jan Snaidauf, Emanuel Beska, Jakub Kracmar, and Kamila Hassanova. 2004. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2004T23
  12. TDT4 Multilingual Broadcast News Speech Corpus. Junbo Kong and David Graff. 2005. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2005S11
  13. Arabic Treebank: Part 1 v 3.0 (POS with full vocalization + syntactic analysis). Mohamed Maamouri (project head), Ann Bies, Tim Buckwalter, and Hubert Jin. 2005. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2005T02
  14. Multiple-Translation Arabic (MTA) Part 2. Xiaoyi Ma. 2005. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2005T05
  15. ACE 2004 Multilingual Training Corpus. Alexis Mitchell, Stephanie Strassel, Shudong Huang, and Ramez Zakhary. 2005. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2005T09
  16. TDT4 Multilingual Text and Annotations. Stephanie Strassel, Junbo Kong, and David Graff. 2005. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2005T16
  17. Arabic Treebank: Part 3 (full corpus) v 2.0 (MPG + Syntactic Analysis). Mohamed Maamouri (project head), Ann Bies, Tim Buckwalter, Hubert Jin, and Wigdan Mekki. 2005. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2005T20
  18. Arabic Treebank: Part 4 v 1.0 (MPG Annotation). Mohamed Maamouri (Project head), Ann Bies, Tim Buckwalter, Hubert Jin, and Wigdan Mekki. 2005. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2005T30
  19. Arabic Broadcast News Speech. Mohamed Maamouri, David Graff, Christopher Cieri,. 2006. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2006S46
  20. Arabic Gigaword Second Edition. David Graff, Ke Chen, Junbo Kong, and Kazuaki Maeda. 2006. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2006T02
  21. ACE 2005 Multilingual Training Corpus. Christopher Walker, Stephanie Strassel, Julie Medero, and Kazuaki Maeda. 2006. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2006T06
  22. TDT5 Multilingual Text. David Graff, Junbo Kong, Kazuaki Maeda, Stephanie Strassel. 2006. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2006T18
  23. TDT5 Topics and Annotations. Meghan Glenn, Stephanie Strassel, Junbo Kong, Kazuaki Maeda. 2006. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2006T19
  24. Arabic Broadcast News Transcripts. Mohamed Maamouri, David Graff, Christopher Cieri. 2006. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2006T20
  25. 2003 NIST Rich Transcription Evaluation Data. Jonathan Fiscus, George Doddington, Audrey Le, Greg Sanders, Mark Przybocki, David Pallett. 2007. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2007S10
  26. GALE Phase 1 Distillation Training. Olga Babko-Malaya, Zhiyi Song, Ramez Zakhary, Julie Medero, Kazuaki Maeda, Stephanie Strassel. 2007. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2007T20
  27. GALE Phase 1 Arabic Broadcast News Parallel Text - Part 1. Xiaoyi Ma, Dalal Zakhary. 2007. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2007T24
  28. Arabic Gigaword Third Edition. David Graff. 2007. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2007T40
  29. TRECVID 2005 Keyframes & Transcripts. Peter Wilkins, Christian Petersohn, Kevin Walker. 2007. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2007V01
  30. GALE Phase 1 Arabic Blog Parallel Text. Xiaoyi Ma, Dalal Zakhary. 2008. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2008T02
  31. GALE Phase 1 Arabic Broadcast News Parallel Text - Part 2. Xiaoyi Ma, Dalal Zakhary. 2008. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2008T09
  32. 2007 NIST Language Recognition Evaluation
    Test Set.
    Alvin Martin, Audrey Le. 2009. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2009S04
  33. GALE Phase 1 Arabic Newsgroup Parallel Text - Part 1. Xiaoyi Ma, Dalal Zakhary and Stephanie Strassel. 2009. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2009T03
  34. 2008 NIST Metrics for Machine Translation (MetricsMATR08) Development Data. Mark Przybocki, Kay Peterson, Sébastien Bronsart. 2009. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2009T05
  35. Unified Linguistic Annotation Text Collection. Linguistic Data Consortium. 2009. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2009T07
  36. Language Understanding Annotation Corpus. Mona Diab, Bonnie Dorr, Lori Levin, Teruko Mitamura, Rebecca Passonneau, Owen Rambow, Lance Ramshaw. 2009. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2009T10
  37. REFLEX Entity Translation Training/DevTest. Christopher Walker, Zhiyi Song, Stephanie Strassel, Julie Medero and Kazuaki Maeda. 2009. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2009T11
  38. Arabic Newswire English Translation Collection. Xiaoyi Ma, Dalal Zakhary. 2009. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2009T22
  39. Arabic Gigaword Fourth Edition. Robert Parker, David Graff, Ke Chen, Junbo Kong, and Kazuaki Maeda. 2009. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2009T30
  40. NIST Open Machine Translation 2008 Evaluation (MT08) Selected Reference and System Translations. NIST Multimodal Information Group. 2010. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2010T01

Lexical resources

  1. Buckwalter Arabic Morphological Analyzer Version 1.0. Tim Buckwalter. 2002. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2002L49
  2. Buckwalter Arabic Morphological Analyzer Version 2.0. Tim Buckwalter. 2004. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2004L02

Language descriptions

  1. Standard ARABIC - statements. Amina CHENTIR. 2009. Laboratoire parole et langage (LPL). oai:crdo.fr:crdo000745
  2. Conflated field structures in Potawatomi and in Arabic. Hollenbach, Barbara E.; Pike, Kenneth L. 1964. International Journal of American Linguistics 30. oai:sil.org:12059
  3. WALS Online Resources for Arabic (Modern Standard). Haspelmath, Martin (editor); Dryer, Matthew S. (editor); Gil, David (editor); Comrie, Bernard (editor). 2008-05-01. Max Planck Digital Library (http://mpdl.mpg.de/). oai:wals.info:languoid/ams

Other resources about the language

  1. English-Arabic Treebank v 1.0. Ann Bies. 2006. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2006T10
  2. ISI Arabic-English Automatically Extracted Parallel Text. Dragos Stefan Munteanu, Daniel Marcu. 2007. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2007T08
  3. OntoNotes Release 3.0. Ralph Weischedel, Sameer Pradhan, Lance Ramshaw, Kaufman, Michelle Franchini, Mohammed El-Bachouti, Nianwen Xue, Martha Palmer, Mitchell Marcus, Ann Taylor, Craig Greenberg, Eduard Hovy, Robert Belvin, Ann Houston. 2009. The LDC Corpus Catalog. oai:www.ldc.upenn.edu:LDC2009T24
  4. An Introduction to Modern Literary Arabic. Cowan, David. 1958. University Press. oai:refdb.wals.info:2245
  5. A New Arabic Grammar of the Written Language. Haywood, J. A.; Nahmad, H. M. 1965. Lund Humphries. oai:refdb.wals.info:2400
  6. The structure of Arabic: from sound to sentence. Nasr, Raja T. 1967. Librairie du Liban. oai:refdb.wals.info:5189
  7. LINGUIST List Resources for Arabic, Standard. Anthony Aristar, Director of Linguist List (editor); Helen Aristar-Dry, Director of Linguist List (editor). 2010-02-16. The LINGUIST List (www.linguistlist.org). oai:linguistlist.org:lang_arb
  8. Arabic, Standard: a language of Saudi Arabia. Lewis, M. Paul (editor). 2009. SIL International (www.sil.org). oai:ethnologue.com:arb
  9. Eritrea national reading survey, September 2002. Davis, Patricia M.; Idris, Saleh Mahmud; Walter, Stephen L.; Tekruray, Daniel Teclemariam; Osman Ali, Mohammed; Okbatsion, Solomon Berhane; Mohammed, Ibrahim; Hoatson, Kristen; Ghebremedhin, Tesfai Bariagaber; Hammadu, Osman; Awed, Sulus Beyed; Ashkaba, John Abraha; Abushush, Dawd Adem; Hoatson, Todd. 2005. Ministry of Education of the State of Eritrea. oai:sil.org:47813
  10. Modern standard Arabic for southern Sudan: Alphabet book. Diku, David; Fodo, William. 1983. Institute of Regional Languages. oai:sil.org:19716
  11. Modern standard Arabic for southern Sudan: Alphabet story book. Diku, David; Fodo, William. 1983. Institute of Regional Languages and Summer Institute of Linguistics. oai:sil.org:19717
  12. The use of Arabic in the Krongo Jebels and at Tabanya. Kilpatrick, Eileen. 1978. Language Survey Series 3. oai:sil.org:21429
  13. Surrey Syncretisms Database. Baerman, Matthew; Brown, Dunstan; Corbett, Greville. 2002. University of Surrey. oai:surrey.smg.surrey.ac.uk:syncretism
  14. Arabic. Laycock, Don (recorder). 1960. Pacific And Regional Archive for Digital Sources in Endangered Cultures (PARADISEC). oai:paradisec.org.au:DL1-038

Other resources in the language

  1. Sura Alla-a (Les noms de Dieu). Haggar, Inouss (editor); Walters, K. (dit Daouda) (editor). 2004. Partenariat Niger. oai:sil.org:45338

Other search terms: dialect, vernacular, discourse, documentation, lexicon, dictionary, vocabulary, wordlist, phrase book, grammar, syntax, morphology, phonology, orthography