Table of Contents
COMPAL GS
This page describes the compound alignment gold standard COMPAL GS V1.0, a multilingual (German, English, French, Italian, Spanish) translation spotting resource for German compounds.
The current version contains 362 annotated German compounds (137 different types in total).
Viewing and XML Download
- Version V1.0 XML version for download 2015-03-28: Beta version was replaced because the translation lexicon was not complete on the German side
Acknowledgment
This work was carried out in the context of the SPARCLING project, a project headed by Marianne Hundt and Martin Volk. The resource was created by Stéphanie Lehner (manual data selection and annotation), Johannes Graën (linguistic data), Simon Clematide (alignment/visualization software, guidance).
License
The corpus is available under the CC BY 4.0 license: http://creativecommons.org/licenses/by/4.0/
Please cite this work by the publication https://doi.org/10.1075/cilt.341.06cle
@incollection{ClematideLehner:2018, Address = {Amsterdam and Philadephia}, Author = {Clematide, Simon and Lehner, Stéphanie and Graën, Johannes and Volk, Martin}, Booktitle = {Multiword Units in Machine Translation and Translation Technology}, Editor = {Ruslan Mitkov and Johanna Monti and Gloria Corpas Pastor and Violeta Seretan}, Pages = {125-145}, Publisher = {John Benjamins}, Series = {Current Issues in Linguistic Theory}, Title = {A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish}, Volume = {341}, Year = {2018}}