Name
Provider
Availability
Size
Language(s)
Insertion date
ACE (Automatic Content Extraction) 2005 Corpus LDC from Linguistic Data Consortium 300K words per language English, Chinese, Arabic 2010-09-29
Acquis , UNs , Meedan , LDC2004T17 Freely Available 0.3 GB Arabic,English,Danish 2010-09-29
Collective Action Framing Corpus From Owner English, Modern Standard Arabic, Mandarin Chinese, Spanish 2010-09-29
IR Multilingual Resources at UniNE Freely Available ~hundred of words in stoplists, ~thousand of most frequent words in given language English, French, German, Italian, Spanish, Portuguese, Finnish, Swedish, Arabic, Russian, Hungarian, 2010-09-29
Multilingual Question-Answer Pair Corpus From Owner English, Arabic, Romanian, Spanish, French, Turkish 2010-09-29
MultiUN Freely Available Arabic, Chinese, English, French, Russian, Spanish, German 2010-09-29
NIST Open Machine Translation (OpenMT) Evaluation Join the evaluation and get the corpus English, Chinese, Arabic, Urdu 2010-09-29
Qurany Freely Available Arabic, English 2010-09-29
The Quran and Tafsir Corpus From Owner Arabic, English 2010-09-29
TIDES Extraction (ACE) 2003 Multilingual Training Data LDC From Data Center(s) English, Chinese, Arabic 2010-09-29
ACE 2007 LDC From Data Center(s) Arabic, English 2010-09-29
BTEC IWSLT evaluation 80K sentences English, Chinese, Arabic 2010-09-29
Multilingual MPQA Freely Available 5 MB Arabic, French, German 2010-09-29
Powered by ELDA © 2009 ELDA/ELRA