From e35eb5260df25991165ce62582b0a209285fee6f Mon Sep 17 00:00:00 2001 From: Inari Listenmaa Date: Fri, 15 Feb 2019 22:12:02 +0100 Subject: [PATCH] (Spa) add README --- src/spanish/README.md | 80 +++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 80 insertions(+) create mode 100644 src/spanish/README.md diff --git a/src/spanish/README.md b/src/spanish/README.md new file mode 100644 index 000000000..90feb51a7 --- /dev/null +++ b/src/spanish/README.md @@ -0,0 +1,80 @@ +# Spanish + +## Language information +* English name: Spanish +* Autonym: español +* ISO code: Spa + +## Authors + +* Aarne Ranta + +With contributions from Inger Andersson, Bruno Cuconato, Inari Listenmaa, and Therese Söderberg. + +## Implementation information +The Spanish Resource Grammar is built using the Romance functor at + [romance](../romance/), and it mostly follows Spanish spoken in Spain. + +Romance instantiations differ mostly in their morphological modules +and in the constructions defined in the =Diff*= modules. Some relevant features: + +* two copulas, ser and estar +* prepositions that contract with articles are *de* and *a*, called `genitive` and `dative` respectively. When forming new prepositions, e.g. *después de* use `mkPrep "después" genitive` to get correct contractions. +* no inversion in question clauses, except for `QuestIComp` and `QuestIAdv`. + +### Known issues + +* Contractions with *con* and pronouns: *con* + *mí*/*ti*/… should be *conmigo*/*contigo*/…, but the contraction is not implemented. + +* Clitic pronouns and their combinations: + +```haskell +-- PredVP (UsePron i_Pron) (ComplSlash (SlashVV want_VV (Slash2V3 give_V3 (UsePron he_Pron))) (UsePron it_Pron)) +LangEng: i want to give him it +LangSpa: yo lo quiero dar &+ le -- should be dárselo +-- PredVP (UsePron i_Pron) (ComplVV want_VV (ComplSlash (Slash3V3 give_V3 (UsePron it_Pron)) (UsePron he_Pron))) +LangEng: i want to give him it +LangSpa: yo quiero dar &+ lo le -- should be dárselo +``` + +* Accents in imperatives and infinitives with clitics + +```haskell +--ImpVP (ComplSlash (Slash2V3 give_V3 (UsePron i_Pron)) (UsePron it_Pron)) +LangEng: give me it +LangSpa: da &+ lo me +``` + +Ignoring the clitic combination (should be *dámelo*), the verb form *da* should get an accent *dá* to show the correct syllable stress. + +## Data + +### Besch & Irreg +The [BeschSpa](BeschSpa.gf) and [IrregSpa](IrregSpa.gf) modules contain the complete inflection tables for all different +Spanish verb types. Based on the Functional Morphology [implementation](http://www.cse.chalmers.se/alumni/markus/FM/download/FM_SPA_1.1.tgz) by Inger Andersson and Therese Söderberg. + + +### DictSpa +This module contains nouns, adjectives and verbs taken from + [Open Multilingual Wordnet](http://compling.hss.ntu.edu.sg/omw/) and [DictionarySpa](https://github.com/GrammaticalFramework/wide-coverage/blob/master/translator/DictionarySpa.gf) (mostly from Wiktionary). + +#### Multiwords +Some thousands of multiwords are manually checked. As of February 2019, around 4000 are unchecked. Those are marked in [DictSpa](DictSpa.gf) with `-- to-check segmentation`. + +#### Morphology +WIP. +* Where there was a non-1-arg paradigm in the old resources, that is retained. +* All verbs that overlap with IrregSpa use the IrregSpa definition. +* Most of the manually checked multiwords are also checked for morphology, but naturally, errors do occur. + +## History and applications + +Some applications include: + +* [MOLTO phrasebook](http://www.molto-project.eu/sites/default/files/everyday.pdf) +* GF Mathematical Grammar Library ([paper](http://www.molto-project.eu/sites/default/files/gf-mgl.pdf), [demo](http://cloud.grammaticalframework.org/minibar/minibar.html)) +* GF [Wide-Coverage translator](http://cloud.grammaticalframework.org/wc.html) + +## Maintainers +Inari Listenmaa ([@inariksit](https://github.com/inariksit)) +Bruno Cuconato ([@odanoburu](https://github.com/odanoburu))