The Swatchgroup Geschäftsbericht corpus is a parallel corpus of Standard High German and Swiss German dialectal variants.
lang | tokens | types | lemmas | sents | texts |
---|---|---|---|---|---|
de | 77173 | 13561 | 9854 | 5557 | 83 |
gsw | 79628 | 17369 | 17041 | 5557 | 83 |
The corpus has been aligned on the document, sentence and word level.