This is an old revision of the document!


The Text+Berg Corpus

Page from the 1906 SAC yearbook

The Text-Berg corpus is a heritage corpus of alpine and mountaineering texts. Texts have been digitised from the yearbooks of the Swiss Alpine Club, Echo des Alpes and Die Alpen, as well as the British Alpine Club's Alpine Journal. The table below provides an overview of the source material, timespan and languages included in the corpus.

Source Timespan Language(s)
Das Jahrbuch des SAC 1864-1923 de, fr, it, rm, en (mixed)
Das Echo des Alpes 1872-1924 fr
Die Alpen 1925-1956 de, fr, it, rm, en (mixed)
Die Alpen 1957-2011 de, fr (parallel)
The Alpine Journal 1969-2008 en

Being a diarchronic heritage corpus, its development has inspired numerous experiments in order to semantically enrich this corpus as both a historic and a linguistic resource (see below).


Relevant links:

Publications:


CL Wiki

Institute of Computational Linguistics – University of Zurich