OLAC Record
oai:catalogue.elra.info:ELRA-E0036

Metadata
Title:CLEF AdHoc-News Test Suites (2004-2008) ? Evaluation Package
Abstract:The CLEF AdHoc-News Test Suites (2004-2008) contain the data used for the main AdHoc track of the CLEF campaigns carried out from 2004 to 2008. This track tested the performance of monolingual, bilingual and multilingual Information Retrieval (IR) systems on multilingual news collections.
Access Rights:Rights available for: Evaluation Use
Date Available (W3CDTF):2010-11-03
Date Issued (W3CDTF):2010-11-03
Date Modified (W3CDTF):2010-11-03
Description:Written Corpora
The Cross-Language Evaluation Forum (CLEF) promotes RThe Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) creating test-suites of reusable data which can be employed by system developers for benchmarking purposes. The CLEF AdHoc-News Test Suites (2004-2008) contain the data used for the main AdHoc track of the CLEF campaigns carried out from 2004 to 2008. This track tested the performance of monolingual, bilingual and multilingual Information Retrieval (IR) systems on multilingual news collections. The CLEF Test Suite is composed of: ? News Data Collections ? Topics ? Guidelines ? Relevance assessments ? Official campaign results ? Working notes papers The News Data Collections consist of the following datasets: ? Bulgarian o Sega 2002 (33,356 documents, 120 Mb) o Standart 2002 (35,839 documents, 93 Mb) ? Czech o Mladna frontaDnes (68,842 documents, 143 Mb) o Lidove Noviny (12,893 documents, 35 Mb) ? Dutch o NRC Handelsblad 1994/95 (84,121 documents, 299 Mb) o Algemeen Dagblad 1994/95 (106,483 documents, 241 Mb) ? English o Glasgow Herald 1995 (56,472 documents, 154 Mb) o Los Angeles Times 1994 (113,005 documents, 425 Mb) o Los Angeles Times 2002 (135,153 documents, 434 Mb) ? Finnish o Aamulehti late-1994/95 (55,344 documents, 137 Mb) ? French o Le Monde 1994 (44,013 documents, 157 Mb) o Le Monde 1995 (47,646 documents, 156 Mb) o SDA French 1994 (43,178 documents, 86 Mb) o SDA French 1995 (42,615 documents, 88 Mb) ? German o Frankfurter Rundschau 1994 (139,715 documents, 320 Mb) o Der Spiegel 1994/95 (13,979 documents, 63 Mb) o SDA German 1994 (71,677 documents, 144 Mb) o SDA German 1995 (69,438 documents, 141 Mb) ? Hungarian o Magyar Hirlap 2002 (49,530 documents, 105 Mb) ? Italian o La Stampa 1994 (58,051 documents, 193 Mb) o SDA Italian 1994 (50,527 documents, 85 Mb) o SDA Italian 1995 (48,980 documents, 85 Mb) ? Persian o Hamshahri 1996-2002 (166,774 documents, 611 Mb) ? Portuguese o P?blico 1994 (51,751 documents, 164 Mb) o P?blico 1995 (55,070 documents, 176 Mb) o Folha de S?o Paulo 1994 (51,875 documents, 108 Mb) o Folha de S?o Paulo 1995 (52,038 documents, 116 Mb) ? Russian o Izvestia 1995 (16,716 documents, 68 Mb) ? Spanish o EFE 1994 (215,738 documents, 509 Mb) o EFE 1995 (238,307 documents, 577 Mb) ? Swedish o Tidningarnas Telegrambyr? 1994/95 (142,819 documents, 352 Mb) The full package consists of 2.43 Gb and is stored on 1 DVD.
Identifier:ELRA-E0036
http://catalog.elra.info/product_info.php?products_id=1127
Language:Bulgarian
Czech
Dutch, Flemish
English
Finnish
French
German
Hungarian
Italian
Persian
Portuguese
Russian
Spanish, Castilian
Swedish
Language (ISO639):bul
ces
nld
eng
fin
fra
deu
hun
ita
fas
por
rus
spa
swe
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-E0036
DateStamp:  2010-11-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2010. ELRA (European Language Resources Association).
Terms: area_Europe country_BG country_CZ country_DE country_ES country_FI country_FR country_GB country_HU country_IT country_NL country_PT country_RU country_SE dcmi_Text iso639_bul iso639_ces iso639_deu iso639_eng iso639_fas iso639_fin iso639_fra iso639_hun iso639_ita iso639_nld iso639_por iso639_rus iso639_spa iso639_swe olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-E0036
Up-to-date as of: Fri Jun 23 1:06:10 EDT 2017