OLAC Record
oai:catalogue.elra.info:ELRA-E0037

Metadata
Title:CLEF Domain Specific Test Suites (2004-2008) ? Evaluation Package
Abstract:The CLEF Domain Specific Test Suites (2004-2008) contain the data used for the Domain Specific track of the CLEF campaigns carried out from 2004 to 2008. This track tested the performance of monolingual, bilingual and multilingual Information Retrieval (IR) systems on multilingual collections of scientific articles.
Access Rights:Rights available for: Evaluation Use
Date Available (W3CDTF):2010-11-03
Date Issued (W3CDTF):2010-11-03
Date Modified (W3CDTF):2010-11-03
Description:Written Corpora
The Cross-Language Evaluation Forum (CLEF) promotes RThe Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) creating test-suites of reusable data which can be employed by system developers for benchmarking purposes. The CLEF Domain SpecificTest Suites (2004-2008) contain the data used for the Domain Specific track of the CLEF campaigns carried out from 2004 to 2008. This track tested the performance of monolingual, bilingual and multilingual Information Retrieval (IR) systems on multilingual collections of scientific articles. The CLEF Test Suite is composed of: ? Data Collections ? Topics ? Guidelines ? Relevance assessments ? Official campaign results ? Working notes papers The Data Collections consist of the following datasets: ? German Indexing and Retrieval Test database (302,638 documents, 524 Mb): Data collection (social sciences) including a German corpus (151,319 documents) and a pseudo-English corpus which is in fact a translation of the German corpus into English (does not contain as much textual information as the German version). ? Cambridge Scientific Abstracts - Sociological Abstracts (20,000 documents, 38.5 Mb): Database of Sociological Abstracts from Cambridge Scientific Abstracts. ? Russian Social Science Corpus (94,581 documents, 65 Mb): Russian sociology database data from the Russian Social Science Corpus. ? Institute of Scientific Information for Social Sciences (Russian Academy of Science) (145,802 documents, 12 Mb): The INION-ISISS corpus consists of bibliographical data from the ISISS database (03.02.2006) covering economics (~99,000 documents) and social sciences (46,000 documents). The full package consists of 617 Mb and is stored on 1 CD.
Identifier:ELRA-E0037
http://catalog.elra.info/product_info.php?products_id=1128
Language:English
German
Russian
Language (ISO639):eng
deu
rus
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-E0037
DateStamp:  2010-11-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2010. ELRA (European Language Resources Association).
Terms: area_Europe country_DE country_GB country_RU dcmi_Text iso639_deu iso639_eng iso639_rus olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-E0037
Up-to-date as of: Mon Feb 27 0:31:38 EST 2017