From a28eeb58bd2ce2bb1cd198a11acc0800ef0ddddf Mon Sep 17 00:00:00 2001
From: aarne Resource grammar writing HOWTO
Author: Aarne Ranta <aarne (at) cs.chalmers.se>
-Last update: Wed Jan 25 15:04:36 2006
+Last update: Thu Feb 2 00:04:00 2006
+Another reduced API is the +toy Latin grammar +which will be used as a reference when discussing the details. +It is not so usable in practice as the Tutorial API, but it goes +deeper in explaining what parameters and dependencies the principal categories +and rules have. +
BasicGer gives a good set of examples for
those who want to build new lexica.
++Among all categories and functions, there is is a handful of the +most important and distinct ones, of which the others are can be +seen as variations. The categories are +
++ Cl ; VP ; V2 ; NP ; CN ; Det ; AP ; ++
+The functions are +
++ PredVP : NP -> VP -> Cl ; -- predication + ComplV2 : V2 -> NP -> VP ; -- complementization + DetCN : Det -> CN -> NP ; -- determination + ModCN : AP -> CN -> CN ; -- modification ++
+This toy Latin grammar shows in a nutshell how these +rules relate the categories to each other. It is intended to be a +first approximation when designing the parameter system of a new +language. We will refer to the implementations contained in it +when discussing the modules in more detail. +
+So far we just give links to the implementations of each API. -More explanation iś to follow - but many detail implementation tricks -are only found in the cooments of the modules. +More explanations follow - but many detail implementation tricks +are only found in the comments of the modules.
- +
It may be handy to provide a separate module of irregular
@@ -551,7 +586,7 @@ few hundred perhaps. Building such a lexicon separately also
makes it less important to cover everything by the
worst-case paradigms (mkV etc).
You can often find resources such as lists of @@ -586,7 +621,7 @@ When using ready-made word lists, you should think about coyright issues. Ideally, all resource grammar material should be provided under GNU General Public License.
- +This is a cheap technique to build a lexicon of thousands @@ -594,7 +629,7 @@ of words, if text data is available in digital format. See the Functional Morphology homepage for details.
- +Sooner or later it will happen that the resource grammar API @@ -603,7 +638,7 @@ that it does not include idiomatic expressions in a given language. The solution then is in the first place to build language-specific extension modules. This chapter will deal with this issue.
- +Above we have looked at how a resource implementation is built by @@ -621,7 +656,7 @@ use parametrized modules. The advantages are In this chapter, we will look at an example: adding Italian to the Romance family.
- +This is the most demanding form of resource grammar writing. @@ -637,6 +672,6 @@ This chapter will work out an example of how an Estonian grammar is constructed from the Finnish grammar through parametrization.
- +