OLAC Record

Title:CLEF Domain Specific Test Suites (2004-2008) – Evaluation Package
Access Rights: Rights available for: evaluationUse
Date Available (W3CDTF):2010-11-03
Date Issued (W3CDTF):2010-11-03
Date Modified (W3CDTF):2010-11-03
Description:The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) creating test-suites of reusable data which can be employed by system developers for benchmarking purposes.The CLEF Domain SpecificTest Suites (2004-2008) contain the data used for the Domain Specific track of the CLEF campaigns carried out from 2004 to 2008. This track tested the performance of monolingual, bilingual and multilingual Information Retrieval (IR) systems on multilingual collections of scientific articles.The CLEF Test Suite is composed of: •Data Collections•Topics•Guidelines•Relevance assessments•Official campaign results•Working notes papersThe Data Collections consist of the following datasets: •German Indexing and Retrieval Test database (302,638 documents, 524 Mb):Data collection (social sciences) including a German corpus (151,319 documents) and a pseudo-English corpus which is in fact a translation of the German corpus into English (does not contain as much textual information as the German version).•Cambridge Scientific Abstracts - Sociological Abstracts (20,000 documents, 38.5 Mb):Database of Sociological Abstracts from Cambridge Scientific Abstracts.•Russian Social Science Corpus (94,581 documents, 65 Mb):Russian sociology database data from the Russian Social Science Corpus.•Institute of Scientific Information for Social Sciences (Russian Academy of Science) (145,802 documents, 12 Mb):The INION-ISISS corpus consists of bibliographical data from the ISISS database (03.02.2006) covering economics (~99,000 documents) and social sciences (46,000 documents).The full package consists of 617 Mb and is stored on 1 CD.
ISLRN: 609-362-685-537-2
Identifier (URI):http://catalog.elra.info/en-us/repository/browse/ELRA-E0037/
Language (ISO639):deu
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-E0037
DateStamp:  2010-11-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2010. ELRA (European Language Resources Association).
Terms: area_Europe country_DE country_GB country_RU dcmi_Text iso639_deu iso639_eng iso639_rus olac_primary_text

Up-to-date as of: Wed Nov 17 9:14:04 EST 2021