Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
public:pacoco:rumantsch_grischun [2019-07-18 01:25]
tkew created
public:pacoco:rumantsch_grischun [2019-07-22 09:41]
Johannes Graën
Line 1: Line 1:
-====== Rumantsch-Grischun ======+~~NOTOC~~ 
 +====== Rumantsch Grischun ======
  
-The Rumantsch-Grischun corpus contains legal and press texts from the State Chancellory of the Swiss Canton of Graubünden. The corpus is entirely parallel, containing more than 5000 texts in both Romansh (Rumantsch) and German.+The Rumantsch Grischun corpus contains legal and press texts from the State Chancellory of the Swiss Canton of Graubünden. The corpus is entirely parallel, containing more than 5000 texts in both Romansh (Rumantsch) and German.
  
-This corpus proves to be a valuable resource for the low-resource language Romansh.+<div center round important 60%> 
 +In the currently available version, only the legal texts are available. 
 +</div>
  
-^lang ^ tokens ^ types ^ lemmas ^ sents ^ texts ^ +This corpus proves to be a valuable resource for the low-resource language Romansh.
-|de | 432862 | 23813 | 15003 | 28783 | 5641 | +
-|rm | 543173 | 13868 | 7973 | 28811 | 5570 | +
-^Total ^ 976035 ^ 37681 ^ 22976 ^ 57594 ^ 11211 ^+
  
-*Note: in the currently available version, only the legal texts are available.+^ lang   ^ tokens  ^ types  ^ lemmas  ^ sents  ^ texts  ^ 
 +^ de      432862 |  23813 |   15003 |  28783 |   5641 | 
 +^ rm      543173 |  13868 |    7973 |  28811 |   5570 | 
 +^ Total  ^  976035 ^  37681 ^   22976 ^  57594 ^  11211 ^
  
----------+==== Alignment ==== 
 +The corpus has been aligned on the document, sentence and word level.
  
  
 +===== Relevant links =====
  
 +  * Multilingwis example ‹dumonda›: [[mlw>dumonda /corpus=rumantsch_grischun]]

CL Wiki

Institute of Computational Linguistics – University of Zurich