Version 3.0 is now available. Go to http://langtech.jrc.it/JRC-Acquis.html to get this latest and extended version.
This is the download page of Version 2.2 of the aligned multilingual corpus JRC-ACQUIS . The dataset contains resources for the following languages: Czech, Danish, German, Greek, English, Spanish, Estonian, Finnish, French, Hungarian, Italian, Lithuanian, Latvian, Maltese, Dutch, Polish, Portuguese, Romanian, Slovak, Slovene, Swedish.
News: Version 2.2 contains alignment data. The ACQUIS corpus has been reduced to those texts that are really in the original language (For more information, read the "news" page)
By downloading these resources, you agree to the usage conditions.
This multilingual parallel corpus has been compiled by the Language Technology team of the European Commission's Joint Research Centre (JRC) in the context of the workshop Exploiting parallel corpora in up to 20 languages, held in Arona, Italy, on 26 and 27 September 2005.