OLAC Record
oai:lindat.mff.cuni.cz:11372/LRT-1181

Metadata
Title:COWAL - combined word aligner
Bibliographic Citation:http://hdl.handle.net/11372/LRT-1181
Contributor:Tufiş, Dan
Ion, Radu
Ceauşu, Alexandru
Ştefănescu, Dan
Date (W3CDTF):2014-07-30T21:27:46Z
Date Available:2014-07-30T21:27:46Z
Description:COWAL is a wrapper of two stand-alone word aligners [[http://www.clarin.eu/tools/yawa-yet-another-word-aligner|YAWA]] and [[http://www.clarin.eu/tools/meba-word-aligner|MEBA]]. COWAL merges the alignments produced by each stand-alone aligner and then uses a trained SVM classifier to prune the unlikely alignment links. The classifier is based on the [[http://www.csie.ntu.edu.tw/~cjlin/papers/quadworkset.pdf|LIBSVM kit]], used with the default parameters (C-SVC classification and radial basis kernel function). The classifier was trained with positive and negative hand-validated examples of word alignment links. With the current F-measure of 83.98%, COWAL won the first place in the lexical alignment competition held with the occasion of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05) workshop on “Building and Using Parallel Texts: Data Driven Machine Translation and Beyond”, Ann Arbor, USA. More detailed descriptions are available in [[http://www.racai.ro/~tufis/papers|the following papers]]: -- Dan Tufiş (2007). Exploiting Aligned Parallel Corpora in Multilingual Studies and Applications. In Toru Ishida, Susan R. Fussell, and Piek T.J.M. Vossen (eds.), Intercultural Collaboration. First International Workshop (IWIC 2007), volume 4568 of Lecture Notes in Computer Science, pp. 103-117. Springer-Verlag, August 2007. ISBN 978-3-540-73999-9. -- -- Dan Tufiş, Radu Ion, Alexandru Ceauşu, and Dan Ştefănescu (2006). Improved Lexical Alignment by Combining Multiple Reified Alignments. In Toru Ishida, Susan R. Fussell, and Piek T.J.M. Vossen (eds.), Proceedings of the 11th Conference EACL2006, pp. 153-160, Trento, Italy, April 2006. Association for Computational Linguistics. ISBN 1-9324-32-61-2. -- Dan Tufiş, Radu Ion, Alexandru Ceauşu, and Dan Ştefănescu (2005). Combined Aligners. In Proceedings of the ACL Workshop on Building and Using Parallel Texts: Data-Driven Machine Translation and Beyond, pp. 107-110, Ann Arbor, USA, June 2005. Association for Computational Linguistics. ISBN 978-973-703-208-9.
Identifier (URI):http://hdl.handle.net/11372/LRT-1181
Language:English
Romanian
Language (ISO639):eng
ron
Publisher:Research Institute for Artificial Intelligence, Romanian Academy of Sciences
Type:toolService
Type (DCMI):Software

OLAC Info

Archive:  LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11372/LRT-1181
DateStamp:  2016-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Tufiş, Dan; Ion, Radu; Ceauşu, Alexandru; Ştefănescu, Dan. 2014. Research Institute for Artificial Intelligence, Romanian Academy of Sciences.
Terms: area_Europe country_GB country_RO dcmi_Software iso639_eng iso639_ron


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11372/LRT-1181
Up-to-date as of: Sun Nov 26 2:07:45 EST 2017