1
0
forked from GitHub/gf-core

gf-english.html added (to be web reachable)

This commit is contained in:
aarne
2013-08-29 09:06:08 +00:00
parent b37fed3d2d
commit 010e97fc47
3 changed files with 1700 additions and 42 deletions

File diff suppressed because it is too large Load Diff

View File

@@ -41,7 +41,7 @@ addition, a little bit like the French gender, they have a **classifier** (间 /
This document provides a tour of the digital grammars in the RGL. It is intended to serve at least three kinds of readers.
In the decreasing order of the number of potential readers,
- those who are intested to learn the grammar of a particular language,
- those who want to learn the grammar of some language in a precise way,
- those who want to use the RGL for a programming task,
- those who want to write an RGL grammar for a new language.
@@ -100,6 +100,38 @@ of French nouns: //maison// ("house") //is// feminine, inherently, and there is
indication of an inflectional gender.)
++Syntactic implications++
The features given in the table are rough indications for what one can expect in different languages. Thus,
for instance, some languages have no gender at all, and hence their nouns and adjectives won't have
genders either. But the table is a rather good generalization from the 28 language of the RGL: we can
safely say that, if a language //does// have gender, then nouns have an inherent gender and adjectives have
a variable gender. This is not a coincidence but has to do with **syntax**, that is, the combination of words
into complex expressions. Thus, for instance, nouns are combined with adjectives that modify them, so that
#BECE
//blue// + //house// = //blue house//
#ENCE
Now, adjectives have to be combinable with all nouns, independently of the gender of the noun: there are no
separate classes of masculine and feminine adjectives (again, with some apparent exceptions, such as //pregnant//,
but even these adjectives have at least grammatically correct metaphoric uses with nouns of other genders).
This means that we must be able to pick the gender of the adjective in agreement with the gender of the noun
that it modifies, which means that the gender of adjectives must be inflectional. Thus in French the adjective
for "blue" is //bleu//, with the feminine form //bleue//, and works as follows:
#BECE
//bleu// + //maison// = //maison bleue// ("blue house", feminine)
//bleu// + //livre// = //livre bleu// ("blue book", masculine)
#ENCE
French also provides examples of adjectives with different **positions**: //bleu// is put after the noun
it modifies, whereas //vieux// ("old") is put before the noun: //vieux livre// ("old book").
We will return to syntax later. At this point, it is sufficient to say that the morphological features of
words are not there just for nothing, but they play an important role in how words are combined in syntax.
In particular, they determine to a great extent how **agreement** works, that is, how the features of
words depend on each other in combinations.
++Semantics of the categories++
//Notice: this section, and all "semantics" columns can be safely skipped, because//
@@ -149,38 +181,6 @@ The following table summarizes the most important semantic types that will be us
++Syntactic implications++
The features given in the table are rough indications for what one can expect in different languages. Thus,
for instance, some languages have no gender at all, and hence their nouns and adjectives won't have
genders either. But the table is a rather good generalization from the 28 language of the RGL: we can
safely say that, if a language //does// have gender, then nouns have an inherent gender and adjectives have
a variable gender. This is not a coincidence but has to do with **syntax**, that is, the combination of words
into complex expressions. Thus, for instance, nouns are combined with adjectives that modify them, so that
#BECE
//blue// + //house// = //blue house//
#ENCE
Now, adjectives have to be combinable with all nouns, independently of the gender of the noun: there are no
separate classes of masculine and feminine adjectives (again, with some apparent exceptions, such as //pregnant//,
but even these adjectives have at least grammatically correct metaphoric uses with nouns of other genders).
This means that we must be able to pick the gender of the adjective in agreement with the gender of the noun
that it modifies, which means that the gender of adjectives must be inflectional. Thus in French the adjective
for "blue" is //bleu//, with the feminine form //bleue//, and works as follows:
#BECE
//bleu// + //maison// = //maison bleue// ("blue house", feminine)
//bleu// + //livre// = //livre bleu// ("blue book", masculine)
#ENCE
French also provides examples of adjectives with different **positions**: //bleu// is put after the noun
it modifies, whereas //vieux// ("old") is put before the noun: //vieux livre// ("old book").
We will return to syntax later. At this point, it is sufficient to say that the morphological features of
words are not there just for nothing, but they play an important role in how words are combined in syntax.
In particular, they determine to a great extent how **agreement** works, that is, how the features of
words depend on each other in combinations.
++Subcategorization++
In addition to the features needed for inflection and agreement, the lexicon must give information about //what//
@@ -213,9 +213,9 @@ dative, ablative, and so on).
===Table: subcategories of nouns and adjectives===
|| GF name | text name | example | inherent complement features | semantics ||
| ``N2`` | two-place noun | //brother// (//of someone// | case or preposition | ``e -> n``
| ``N3`` | three-place noun | //distance// (//from some place to some place// | case or preposition | ``e -> e -> n``
| ``A2`` | two-place adjective | //similar// (//to something// | case or preposition | ``e -> e -> t``
| ``N2`` | two-place noun | //brother// (//of someone//) | case or preposition | ``e -> n``
| ``N3`` | three-place noun | //distance// (//from some place to some place//) | case or preposition | ``e -> e -> n``
| ``A2`` | two-place adjective | //similar// (//to something//) | case or preposition | ``e -> e -> t``
Verbs show a particularly rich variation in subcategorization. The most familiar distinction is the one between

View File

@@ -105,8 +105,8 @@ two-place adjective-complement verbs was explained in Chapter 1.
|| GF name | text name | example | inflection features | inherent features | parts | semantics ||
| ``Cl`` | clause | //he paints it blue// | temporal, polarity | (none) | one | ``t``
| ``VP`` | verb phrase | //paints it blue// | temporal, polarity, agreement | subject case | verb, complement | ``e -> t``
| ``VPSlash`` | slash verb phrase | //paints - blue// | temporal, polarity, agreement | subject and complement case | verb, complement | ``e -> e -> t``
| ``VP`` | verb phrase | //paints it blue// | temporal, polarity, agreement | subject case | verb, complements | ``e -> t``
| ``VPSlash`` | slash verb phrase | //paints - blue// | temporal, polarity, agreement | subject and complement case | verb, complements | ``e -> e -> t``
| ``NP`` | noun phrase | //the house// | case | agreement | one | ``(e -> t) -> t``
| ``AP`` | adjectival phrase | //very blue// | gender, numeber, case | position | one | ``a`` = ``e -> t``
@@ -152,18 +152,40 @@ language. But what we can read out from the category table above is as follows:
|| GF name | text name | linearization type ||
| ``Cl`` | clause | ``{s : Temp => Pol => Str}``
| ``VP`` | verb phrase | ``{s : Temp => Pol => Agr => {verb,compl : Str} ; sc : Case}``
| ``VPSlash`` | slash verb phrase | ``{s : Temp => Pol => Agr => {verb,compl : Str} ; sc, cc : Case}``
| ``VP`` | verb phrase | ``{v : V ; c : Agr => Str ; sc : Case}``
| ``VPSlash`` | slash verb phrase | ``{v : V ; c : Agr => Str ; ; sc, cc : Case}``
| ``NP`` | noun phrase | ``{s : Case => Str ; a : Agr}``
| ``AP`` | adjectival phrase | ``{s : Gender => Number => Case => Str ; isPre : Bool}``
TODO explain these types, in particular the use of ``V``
These types suggest the following linearization rules:
```
PredVP np vp = {s = \\t,p => np.s ! vp.sc ++ vps.verb ++ vps.compl where vps = vp.s ! t ! p ! np.a}
PredVP np vp = {s = \\t,p => np.s ! vp.sc ++ vp.v ! t ! p ! np.a ++ vp.c ! np.a}
ComplSlash vpslash np = {v = vpslash.v ; c = \\a => np.s ! vpslash.cc ++ vpslash.c ! a}
SlashV2A v2a ap = {v = v2a ; c = ap.s ! v2a.ac ; cc = v2a.ap}
```
TODO linearization of the example
TODO explain these rules
The linearization of the example goes in a way analogous to the computation of semantics.
It is in both cases **compositional**, which means that the semantics/linearization only
depends on the semantics/linearization of the immediate arguments, not on the tree structure
of those arguments. Assuming the following linearizations of the words,
```
John* : mkPN "John"
paint* : mkV "paint" ** {cc = Acc ; ca = Nom}
the_house* : mkPN "the house"
yellow* : mkA "yellow"
```
we get the linearization of the clause as follows:
```
(PredVP John (ComplSlash (SlashV2A paint yellow) the-house))*
= "John" ++ vp.v ! SgP3 ++ vp.c ! SgP3
where vp = (ComplSlash (SlashV2A paint yellow) the_house)*
= {v = mkV "paint" ; c = \\_ => "the house yellow"}
= "John paints the house yellow"
```
Similar rules as to ``V2A`` apply to all subcategories of verbs. The ``V2`` verbs are first made into ``VPSlash``
by giving the non-NP complement. ``V3`` verbs can take their two NP complements in either order, which