User Tools


Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
public:compal_gs:start [2015-03-09 21:51] – [License] Simon Clematidepublic:compal_gs:start [2023-09-15 20:33] (current) – external edit 127.0.0.1
Line 1: Line 1:
 +====== COMPAL GS ======
 +This page describes the **comp**ound **al**ignment **g**old **s**tandard COMPAL GS V1.0, a multilingual (German, English, French, Italian, Spanish) translation spotting resource for German compounds.
  
 +The current version contains 362 annotated German compounds (137 different types in total).
 +
 +===== Viewing and XML Download =====
 +  * [[http://pub.cl.uzh.ch/corpora/compal_gs/v1.0/svg|Version V1.0 SVG format for manual inspection]]
 +  * [[http://pub.cl.uzh.ch/corpora/compal_gs/v1.0/compal-1.0.xml.zip|Version V1.0 XML version for download]] 2015-03-28: Beta version was replaced because the translation lexicon was not complete on the German side
 +
 +===== Acknowledgment =====
 +This work was carried out in the context of the [[http://www.cl.uzh.ch/research/parallelcorpora/sparcling_en.html|SPARCLING project]], a project headed by [[http://www.es.uzh.ch/Subsites/Personal/mhundt.html|Marianne Hundt]] and [[http://www.cl.uzh.ch/people/team/volk.html|Martin Volk]]. 
 +The resource was created by Stéphanie Lehner (manual data selection and annotation), [[http://www.cl.uzh.ch/people/team/graen.html|Johannes Graën]] (linguistic data), [[http://www.cl.uzh.ch/siclemat|Simon Clematide]] (alignment/visualization software, guidance).
 +
 +
 +===== License =====
 +The corpus is available under the CC BY 4.0 license:
 +http://creativecommons.org/licenses/by/4.0/
 +
 +
 +Please cite this work by the publication
 +https://doi.org/10.1075/cilt.341.06cle
 +
 +
 +<code>
 +@incollection{ClematideLehner:2018,
 + Address = {Amsterdam and Philadephia},
 + Author = {Clematide, Simon and Lehner, Stéphanie and Graën, Johannes and Volk, Martin},
 + Booktitle = {Multiword Units in Machine Translation and Translation Technology},
 + Editor = {Ruslan Mitkov and Johanna Monti and Gloria Corpas Pastor and Violeta Seretan},
 + Pages = {125-145},
 + Publisher = {John Benjamins},
 + Series = {Current Issues in Linguistic Theory},
 + Title = {A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish},
 + Volume = {341},
 + Year = {2018}}
 +</code>
 +===== Contact =====
 +<simon.clematide@uzh.ch>

CL Wiki

Institute of Computational Linguistics – University of Zurich