Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
public:pacoco:credit_suisse [2019-07-18 22:24] – [Table] Johannes Graënpublic:pacoco:credit_suisse [2023-09-15 20:33] (current) – external edit 127.0.0.1
Line 1: Line 1:
 +~~NOTOC~~
 ====== Credit Suisse ====== ====== Credit Suisse ======
  
Line 6: Line 7:
 The corpus consists of three main subcorpora: Credit Suisse News corpus, Credit Suisse PDF Bulletin corpus and Credit Suisse Bulletin In Print corpus. The corpus consists of three main subcorpora: Credit Suisse News corpus, Credit Suisse PDF Bulletin corpus and Credit Suisse Bulletin In Print corpus.
  
-==== Credit Suisse News corpus ====+ 
 +===== Credit Suisse News corpus =====
  
 The Credit Suisse News Corpus is a collection of news articles from the Credit Suisse web page in four languages (English, French, German, Italian). They range from 2001 to 2017. The Credit Suisse News Corpus is a collection of news articles from the Credit Suisse web page in four languages (English, French, German, Italian). They range from 2001 to 2017.
Line 17: Line 19:
 ^ Total  ^  7883458 ^  279456 ^  126461 ^  419562 ^   6756 ^ ^ Total  ^  7883458 ^  279456 ^  126461 ^  419562 ^   6756 ^
  
 +==== Alignment ====
 +The corpus has been aligned on the document and sentence level.
  
-==== Credit Suisse PDF Bulletin corpus ====+ 
 +===== Credit Suisse PDF Bulletin corpus =====
  
 The Credit Suisse PDF Bulletin Corpus is a collection of magazine articles from the Credit Suisse Bulletin in four languages (English, French, German, Italian). They range from 1998 to 2017. The Credit Suisse PDF Bulletin Corpus is a collection of magazine articles from the Credit Suisse Bulletin in four languages (English, French, German, Italian). They range from 1998 to 2017.
Line 29: Line 34:
 ^ Total  ^  13240987 ^  514928 ^  209723 ^  878098 ^   9050 ^ ^ Total  ^  13240987 ^  514928 ^  209723 ^  878098 ^   9050 ^
  
-==== Credit Suisse Bulletin In Print corpus ====+==== Alignment ==== 
 +The corpus has been aligned on the document and sentence level. 
 + 
 + 
 +===== Credit Suisse Bulletin In Print corpus =====
  
 The Credit Suisse Bulletin In Print Corpus is a collection of magazine articles from the Credit Suisse Bulletin in five languages (English, French, German, Italian, Spanish). They range from 1895 to 1997. The Credit Suisse Bulletin In Print Corpus is a collection of magazine articles from the Credit Suisse Bulletin in five languages (English, French, German, Italian, Spanish). They range from 1895 to 1997.
Line 41: Line 50:
 ^ Total  ^  40532276 ^  1018150 ^  282989 ^  3285925 ^   1633 ^ ^ Total  ^  40532276 ^  1018150 ^  282989 ^  3285925 ^   1633 ^
  
----------+==== Alignment ==== 
 +The corpus has not been aligned yet.
  
-=== Relevant links === 
  
 +===== Publications =====
 +
 +  * Building a Parallel Corpus on the World's Oldest Banking Magazine [[https://www.zora.uzh.ch/id/eprint/125746/|Volk et al. 2016]]
 +
 +
 +===== Relevant links =====
 +  * Multilingwis example ‹rentrer chez soi›: [[mlw>[rentrer chez soi] /corpus=cs]]
   *[[https://www.credit-suisse.com/about-us/en/reports-research/studies-publications/bulletin.html?t=940_0.5668645529305478|Credit Suisse Bulletin]]   *[[https://www.credit-suisse.com/about-us/en/reports-research/studies-publications/bulletin.html?t=940_0.5668645529305478|Credit Suisse Bulletin]]
   *[[https://pub.cl.uzh.ch/projects/b4c/de/korpora.php|Project website]]   *[[https://pub.cl.uzh.ch/projects/b4c/de/korpora.php|Project website]]
 +  *[[https://pub.cl.uzh.ch/projects/sparcling/multilingwis2.demo/#%7B%22queryInput%22%3A%22%22%2C%22selectedCorpusId%22%3A%22cs%22%2C%22options%22%3A%7B%22token_lemma%22%3Atrue%2C%22content_words_only%22%3Afalse%2C%22hitlimit%22%3A1000%2C%22all_hits%22%3Afalse%7D%2C%22inputLanguage%22%3Anull%2C%22autoDetectInputLanguage%22%3Atrue%2C%22searchStarted%22%3Afalse%2C%22selectedVariants%22%3A%7B%7D%2C%22examplePointer%22%3A0%2C%22metadataFilter%22%3A%7B%7D%7D|Search Credit Suisse in Multilingwis]]
 +
  
-=== Publications === 
-  * Building a Parallel Corpus on the World's Oldest Banking Magazine [[https://www.zora.uzh.ch/id/eprint/125746/|Volk et al. 2016]] 

CL Wiki

Institute of Computational Linguistics – University of Zurich