User Tools


This page describes the compound alignment gold standard COMPAL GS V1.0, a multilingual (German, English, French, Italian, Spanish) translation spotting resource for German compounds.

The current version contains 362 annotated German compounds (137 different types in total).

Viewing and XML Download


This work was carried out in the context of the SPARCLING project, a project headed by Marianne Hundt and Martin Volk. The resource was created by Stéphanie Lehner (manual data selection and annotation), Johannes Graën (linguistic data), Simon Clematide (alignment/visualization software, guidance).


The corpus is available under the CC BY 4.0 license:

Please cite this work by the publication

	Address = {Amsterdam and Philadephia},
	Author = {Clematide, Simon and Lehner, Stéphanie and Graën, Johannes and Volk, Martin},
	Booktitle = {Multiword Units in Machine Translation and Translation Technology},
	Editor = {Ruslan Mitkov and Johanna Monti and Gloria Corpas Pastor and Violeta Seretan},
	Pages = {125-145},
	Publisher = {John Benjamins},
	Series = {Current Issues in Linguistic Theory},
	Title = {A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish},
	Volume = {341},
	Year = {2018}}


CL Wiki

Institute of Computational Linguistics – University of Zurich