GERMAN

UZH Logo b4c Logo

Corpora

Credit Suisse News Corpus

February 2019: This corpus contains around 1600 "News" articles from the Credit Suisse Website in German, French, Italian and English. It is freely available for research. More information in the readme-file.
Download: Credit Suisse News Corpus Release 5.0 (~ 2 million tokens per language, with PoS tags and lemmas; 73 MByte).

Credit Suisse PDF Bulletin Corpus

July 2021: This corpus contains 416 magazines in total. It features approximately 2500 articles each in German, French, and Italian and around 1200 in English. It is freely available for research. More information in the readme-file.
Download: Credit Suisse PDF Bulletin Corpus Release 6.0 (~ 3 million tokens per language, with PoS tags and lemmas; 118 MByte).

Credit Suisse Bulletin In Print Corpus

February 2019: This corpus contains around 700 magazines in German and French, around 100 in English and Italian and 19 in Spanish. It is freely available for research. More information in the readme-file.
Download: Credit Suisse Bulletin In Print Corpus Release 3.0 (~ 15 million tokens de/fr, ~ 5 million tokens en/it, ~ 1 million tokens es; with PoS tags and lemmas; 328 MByte).