User Tools


Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
public:compal_gs:start [2015-03-28 14:10] Simon Clematidepublic:compal_gs:start [2019-01-21 15:47] – [License] Simon Clematide
Line 1: Line 1:
 +====== COMPAL GS ======
 +This page describes the **comp**ound **al**ignment **g**old **s**tandard COMPAL GS V1.0, a multilingual (German, English, French, Italian, Spanish) translation spotting resource for German compounds.
  
 +The current version contains 362 annotated German compounds (137 different types in total).
 +
 +===== Viewing and XML Download =====
 +  * [[http://pub.cl.uzh.ch/corpora/compal_gs/v1.0/svg|Version V1.0 SVG format for manual inspection]]
 +  * [[http://pub.cl.uzh.ch/corpora/compal_gs/v1.0/compal-1.0.xml.zip|Version V1.0 XML version for download]] 2015-03-28: Beta version was replaced because the translation lexicon was not complete on the German side
 +
 +===== Acknowledgment =====
 +This work was carried out in the context of the [[http://www.cl.uzh.ch/research/parallelcorpora/sparcling_en.html|SPARCLING project]], a project headed by [[http://www.es.uzh.ch/Subsites/Personal/mhundt.html|Marianne Hundt]] and [[http://www.cl.uzh.ch/people/team/volk.html|Martin Volk]]. 
 +The resource was created by Stéphanie Lehner (manual data selection and annotation), [[http://www.cl.uzh.ch/people/team/graen.html|Johannes Graën]] (linguistic data), [[http://www.cl.uzh.ch/siclemat|Simon Clematide]] (alignment/visualization software, guidance).
 +
 +
 +===== License =====
 +The corpus is available under the CC BY 4.0 license:
 +http://creativecommons.org/licenses/by/4.0/
 +
 +
 +Please cite this work by the publication
 +https://doi.org/10.1075/cilt.341.06cle
 +
 +
 +<code>
 +@incollection{ClematideLehner:2018,
 + Address = {Amsterdam and Philadephia},
 + Author = {Clematide, Simon and Lehner, Stéphanie and Graën, Johannes and Volk, Martin},
 + Booktitle = {Multiword Units in Machine Translation and Translation Technology},
 + Editor = {Ruslan Mitkov and Johanna Monti and Gloria Corpas Pastor and Violeta Seretan},
 + Pages = {125-145},
 + Publisher = {John Benjamins},
 + Series = {Current Issues in Linguistic Theory},
 + Title = {A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish},
 + Volume = {341},
 + Year = {2018}}
 +</code>
 +===== Contact =====
 +<simon.clematide@uzh.ch>

CL Wiki

Institute of Computational Linguistics – University of Zurich