1
0
forked from GitHub/gf-rgl
Files
gf-rgl/src/estonian/README.md
2019-01-03 21:38:18 +02:00

38 lines
1.4 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Estonian
## Language info
* English name: Estonian
* Autonym: eesti keel
* ISO code: Est
## Authors
* Kaarel Kaljurand, 2011-
* Inari Listenmaa, 2011-
## Publications
* Listenmaa, Inari and Kaljurand, Kaarel (2014) **Computational Estonian Grammar in Grammatical Framework**. Proceedings of the SALTMIL Workshop at LREC2014. [Link](http://ixa2.si.ehu.es/~jipsagak/SALTMIL/LREC_2014_Workshop_Proceedings_Saltmil.pdf)
## Implementation information
### Sources
The noun morphology is based on [Eesti käänamissüsteemi seaduspärasused](http://kjk.eki.ee/ee/issues/2012/6/156) (Kaalep, 2012).
We have automatically constructed a 80k-word lexicon from all the nouns, adjectives and adverbs in EstWN (v67); the verbs of the EstCG lexicon, which provides information on the verb complement and adjunct cases; and the database of multi-word verbs (Kaalep and Muischnek, 2008). Morfessor 2.0 was used for compound word splitting of nouns, and Filosofts morphology tools were used to generate the base forms of nouns and adjectives (6 forms), and verbs (8 forms).
### Testing
Morphology has been tested against Filosoft's morphological analyser.
Additionally, we have ported some of the existing GF ap-
plications (MOLTO Phrasebook (Ranta et al., 2012), ACE-
in-GF (Camilleri et al., 2012)) to Estonian.
## Maintainers
* Kaarel Kaljurand (@kaljurand)
* Inari Listenmaa (@inariksit)