Summary of the paper

Title Arabic, English and French: Three Languages in a Filtering Systems Evaluation Project
Authors Romaric Besançon, Djamel Mostefa, Ismaïl Timimi, Stéphane Chaudiron, Mariama Laïb and Khalid Choukri
Abstract The InFile project (INformation, FILtering, Evaluation) is a cross-language adaptive filtering evaluation campaign, sponsored by the French National Research Agency. The project is organized by the CEA-LIST, ELDA and the Laboratory GERIICO of the University Lille3. It has an international scope as it was a pilot track of the CLEF 2008 and a main track of the CLEF 2009 campaigns. The corpus is a collection of about 1,4 millions newswires in three languages, Arabic, English and French provided by Agence France Press (AFP) and selected from a 3 years period. The profiles' corpus (the corpus requests) is made of 51 profiles from which 30 concern general news and events (national and international affairs, politics, sports…) and 21 concern scientific and technical information. This paper is presenting the InFile evaluation paradigm in general and focuses on a study of the Arabic part of the corpus in particular. The coverage mismatch between profiles and Arabic documents, conceptual and terminology gaps in the transfer between English/French and Arabic are also discussed in this article.
Topics Exploitation of LRs in different types of applications (information extraction, information retrieval, speech dictation, translation, summarisation, web services, semantic web, etc.),
Monolingual and multilingual LRs,
Evaluation in multilingual Arabic language processing
Full paper Arabic, English and French: Three Languages in a Filtering Systems Evaluation Project
Bibtex @InProceedings{BESANON09.78,
  author = {Romaric Besançon, Djamel Mostefa, Ismaïl Timimi, Stéphane Chaudiron, Mariama Laïb and Khalid Choukri},
  title = {Arabic, English and French: Three Languages in a Filtering Systems Evaluation Project},
  booktitle = {Proceedings of the Second International Conference on Arabic Language Resources and Tools},
  year = {2009},
  month = {April},
  date = {22-23},
  address = {Cairo, Egypt},
  editor = {Khalid Choukri and Bente Maegaard},
  publisher = {The MEDAR Consortium},
  isbn = {2-9517408-5-9},
  language = {english}
  }

Powered by ELDA © 2009 The MEDAR Consortium