Zmorge - The Zurich Morphological Analyzer for German
Description
Zmorge is a morphology tool that combines a lexicon that is automatically extracted from Wiktionary, and a modified version of the finite-state morphological grammar SMOR.
The extraction script is open source, so that new versions of the lexicon can be extracted from future, expanded versions of Wiktionary.
Modifications to SMOR grammar
- the lexicon, grammar and transducer all use UTF-8 encoding.
-
the output is no longer a derivational analysis, but defines the following as the base form:
- nouns: Nom. Sg. (or Nom. Pl. for plural-only nouns)
- verbs: infinitive
- adjectives: Pos. Adv./Pred.
-
morpheme boundaries are still explicity marked, but using different labels:
- <TRUNC>: marks hyphenation (same as original SMOR)
- <#>: marks compound boundary
- <->: marks joining element (Fugenelement) in compounds
- <~>: marks other morpheme boundary