working on CLT sem

This commit is contained in:
aarne
2006-03-07 21:04:12 +00:00
parent aad2498741
commit 1710d0c995
4 changed files with 97 additions and 45 deletions

View File

@@ -7,7 +7,7 @@
<P ALIGN="center"><CENTER><H1>The GF Resource Grammar Library Version 1.0</H1>
<FONT SIZE="4">
<I>Author: Aarne Ranta &lt;aarne (at) cs.chalmers.se&gt;</I><BR>
Last update: Tue Mar 7 16:01:46 2006
Last update: Tue Mar 7 21:50:28 2006
</FONT></CENTER>
<P>
@@ -145,6 +145,7 @@ Linguistics
</P>
<UL>
<LI>definition of linguistic ontology
<LI>describing language on this level of abstraction
<LI>coping with different problems in different languages
<LI>sharing concrete-syntax code between languages
<LI>creating a resource for other NLP applications
@@ -207,6 +208,8 @@ Computer science
<P>
Janna Khegai (Russian modules, forthcoming),
Bjorn Bringert (many Swadesh lexica),
Inger Andersson and Therese Söderberg (Spanish morphology),
Ludmilla Bogavac (Russian morphology),
Carlos Gonzalia (Spanish cardinals),
Partik Jansson (Swedish cardinals),
Aarne Ranta.
@@ -242,6 +245,10 @@ CLE (Core Language Engine,
<LI>grammar specialization via explanation-based learning
</UL>
<P>
<!-- NEW -->
</P>
<H3>Slightly less related work</H3>
<P>
<A HREF="http://www.delph-in.net/matrix/">LinGO Grammar Matrix</A>
</P>
@@ -315,7 +322,7 @@ Basic lexicon
</P>
<UL>
<LI>100 structural words
<LI>350 content words, mainly for testing
<LI>340 content words, mainly for testing
<LI>these include the 207 <A HREF="http://en.wiktionary.org/wiki/Swadesh_List">Swadesh words</A>
</UL>
@@ -341,7 +348,7 @@ declaratives, questions, imperatives, vocatives
</P>
<P>
Tense, mood, and polarity:
present, past, future, conditional ; similtaneous, anterior ; positive, negative
present, past, future, conditional ; simultaneous, anterior ; positive, negative
</P>
<P>
Questions:
@@ -373,10 +380,10 @@ proper names, pronouns, determiners, possessives, cardinals and ordinals
100 structural words
</P>
<P>
350 content words in a test lexicon
340 content words in a test lexicon
</P>
<P>
Lines of source code (4/3/2006):
35 kLines of source code (4/3/2006):
</P>
<PRE>
abstract 1131
@@ -407,20 +414,20 @@ Lines of source code (4/3/2006):
<PRE>
John walks.
TFullStop : Phr -&gt; Text -&gt; Text
(PhrUtt : PConj -&gt; Utt -&gt; Voc -&gt; Phr
NoPConj
(UttS : S -&gt; Utt
TFullStop : Phr -&gt; Text -&gt; Text | TQuestMark, TExclMark
(PhrUtt : PConj -&gt; Utt -&gt; Voc -&gt; Phr | PhrYes, PhrNo, ...
NoPConj | but_PConj, ...
(UttS : S -&gt; Utt | UttQS, UttImp, UttNP, ...
(UseCl : Tense -&gt; Anter -&gt; Pol -&gt; Cl -&gt; S
TPres
ASimul
PPos
(PredVP : NP -&gt; VP -&gt; Cl
(PredVP : NP -&gt; VP -&gt; Cl | ImpersNP, ExistNP, ...
(UsePN : PN -&gt; NP
john_PN)
(UseV : V -&gt; VP
(UseV : V -&gt; VP | ComplV2, UseComp, ...
walk_V))))
NoVoc)
NoVoc) | VocNP, please_Voc, ...
TEmpty
</PRE>
<P></P>
@@ -519,10 +526,6 @@ Goal: eliminate the user's need of worst-case functions.
</P>
<H3>Language-dependent syntax extensions</H3>
<P>
<!-- NEW -->
</P>
<H3>Special-purpose APIs</H3>
<P>
Syntactic structures that are not shared by all languages.
</P>
<P>
@@ -539,10 +542,26 @@ Candidates:
<P>
<!-- NEW -->
</P>
<H3>How to use as top-level grammar</H3>
<H3>Special-purpose APIs</H3>
<P>
Mathematical
</P>
<P>
Multimodal
</P>
<P>
Present
</P>
<P>
Minimal
</P>
<P>
Shallow
</P>
<P>
<!-- NEW -->
</P>
<H3>How to use the resource as top-level grammar</H3>
<H3>Compiling</H3>
<P>
It is a good idea to compile the library, so that it can be opened faster
@@ -580,9 +599,6 @@ The MCFG parser works in some languages, after waiting appr. 20 seconds
p -mcfg -lang=LangEng -cat=S "I would see her"
p -mcfg -lang=LangSwe -cat=S "jag skulle se henne"
p -mcfg -lang=LangNor -cat=S "jeg ville se henne"
</PRE>
<P>
Parsing in <CODE>present/</CODE> versions is quicker.
@@ -727,6 +743,6 @@ We start a demo by
</P>
<H3>Extend old modules or add a new one?</H3>
<!-- html code generated by txt2tags 2.0 (http://txt2tags.sf.net) -->
<!-- html code generated by txt2tags 2.3 (http://txt2tags.sf.net) -->
<!-- cmdline: txt2tags clt2006.txt -->
</BODY></HTML>