Commit Graph

9845 Commits

Author SHA1 Message Date
kr.angelov be922d09a1 added the file treebanks/PennTreebank/ParseEngAbs3.probs which is used by the statistical parser for robust chunking 2013-03-25 10:28:53 +00:00
kr.angelov 72556ad1ae a long list of prepositions from Wikipedia is now imported in DictEng in addition there are a number of small other changes in the dictionary. The statistical model is updated and is now moved to treebanks/PennTreebank/ParseEngAbs.probs 2013-03-25 10:24:24 +00:00
kr.angelov 8b40d4974b added configuration file which defines the heads for all syntactic functions in ParseEng 2013-03-21 13:39:24 +00:00
kr.angelov 650e1cfa43 the calculation of lexical_prob in the statistical parser doesn't work properly. It should be fixed but for now I just disabled the optimization 2013-03-20 12:28:52 +00:00
kr.angelov fec34e7622 replace #if with #ifdef when checking for the optional bottom up filtering in the C runtime 2013-03-20 10:47:47 +00:00
kr.angelov 466813f1e8 fix in ParseHin which made it impossible to load the grammar with the C runtime 2013-03-20 10:34:37 +00:00
virk.shafqat 648b3c53a9 pathbugfixParseHin 2013-03-20 10:21:12 +00:00
virk.shafqat af4ac56b35 smallfixDictHin 2013-03-20 09:17:55 +00:00
kr.angelov 1ddcfc219e the bottom up filtering in the C runtime is temporary disabled. It takes too much memory and even makes it impossible to load the Finnish and the German parsing grammars. 2013-03-19 10:59:44 +00:00
kr.angelov 8041999405 the ParseFin grammar now excludes ComplVV from VerbFin since this function has a more general type in the parsing grammar 2013-03-19 10:49:13 +00:00
aarne ed8611c94b Makefile entries for ParseHin and ParseGer 2013-03-19 07:59:04 +00:00
aarne 6660a47404 Makefile entries for pgf's for robust parsing 2013-03-19 07:26:06 +00:00
kr.angelov c775d0c5c5 filterout all adjectives and adverbs which could be derived morphologically 2013-03-18 17:31:20 +00:00
virk.shafqat 4fd9c860f3 factorUrdandAddDictGer 2013-03-18 14:42:10 +00:00
hallgren c823b7fd91 Fix a problem with pattern macros in pre { } expressions
The old partial evaluator has special rules to convert pattern macros in
pre { } expressions. These rules were missing in the new partial evaluator.
2013-03-16 13:36:23 +00:00
kr.angelov 34fddf669f some of the newly added nouns in DictEng were actually variations of already existing lexical entries. Those are removed now. 2013-03-15 23:23:06 +00:00
kr.angelov e5913189db massive extensions in DictEng and DictEngBul. This includes all new nouns imported from WordNet by Shafqat, phrasal verbs that I collected from internet and the PennTreebank, plus various other small additions. 2013-03-15 20:18:22 +00:00
aarne ff879641c8 uncomment a VerbFin function left commented by mistake 2013-03-15 17:40:28 +00:00
aarne 23a310be57 a stemmed version of DictEngFin and ParseFin - now works without trouble! 2013-03-15 15:07:14 +00:00
inari.listenmaa 2295f491c7 Corrections in LexiconSpa
Corrected many verbs in LexiconSpa. Also added in BeschSpa oir_51 and corrected morder_50b. In MorphoSpa added words ending in é and á to a regexp for regular nouns.
2013-03-15 17:13:25 +00:00
aarne 87d465594b stemming in Fin also for verbs now 2013-03-15 11:13:49 +00:00
aarne 3b66a870d1 stemmed version of verbs: from 102 forms to 13 "technical stems" in Fin 2013-03-15 08:55:14 +00:00
aarne 330a2ae094 Stemmed Finnish for N and A. Cuts down the number of forms stored in the lexicon to 30%, requiring more often -bind in linearization. Stored as alternative files in finnish/stemmed; perhaps to be functorized later. 2013-03-14 20:55:59 +00:00
aarne 59cbd2867f experimental stem-based morphology for Finnish, to minimize lexicon size, started 2013-03-14 15:13:44 +00:00
kr.angelov cb37254882 bug fix in the linearizer in the C runtime 2013-03-14 12:31:49 +00:00
kr.angelov f1a42ad78e update the pgf-service tool from the C runtime after the changes in the API 2013-03-14 10:37:01 +00:00
hallgren 86127d03d0 Update shell reference manual 2013-03-14 08:08:51 +00:00
aarne 54324d5de3 added PassVPSlash to ExtraRomance; it is a generalization of PassV2 already implemented for some languages. As a variant in Spa and Cat, added PassVPSlash_ser with another copula. 2013-03-13 17:07:57 +00:00
hallgren 6045ddae19 PGF web API: add 'to' parameter to the 'alignment' command
Word alignment diagrams can now be restricted to a subset of the languages
supported by the grammar.
2013-03-13 15:56:03 +00:00
hallgren e2bb437943 PGF web API: generalize the 'to' parameter to accept a list of languages
Some commands (linearize, linearizeAll, random, generate, translate and
translategroup) by default produce output in all languages supported by the
grammar and the 'to' parameter could be used to restrict output to a single
language. Now you can restrict the output to a list of languages. Languages
should be separated by spaces.

Also removed an unnecessary LANGUAGE pragma and reduced code verbosity.
2013-03-13 15:25:28 +00:00
aarne 18324e5e04 Poss,Part,Count NP implemented in Finnish: tehtaan johtaja, lasi viiniä, kolme pojista 2013-03-13 14:48:23 +00:00
aarne b3e1908c78 Poss,Part,Count NP in Scandinavian 2013-03-13 14:21:26 +00:00
aarne 0dc182c216 pt -nub to remove duplicate trees from a list returned e.g. by a parser 2013-03-13 13:43:30 +00:00
aarne 31b4b13b65 constructors PossNP, PartNP, CountNP to separete "of" constructions 2013-03-13 13:42:41 +00:00
aarne 05c9934af0 added the nominal possessive form NPNomPoss to NPCase in Eng (covering mine, yours, etc) 2013-03-13 12:34:22 +00:00
aarne f61a013a40 case endings in Fin symbols: Nom is a special case, since there is no ending 2013-03-12 20:29:31 +00:00
aarne 367123c81b the proper place of verb particle in Eng passives 2013-03-12 20:06:51 +00:00
aarne d14589804e worst case of Fin mkA; default case endings for Fin symbol as PN 2013-03-12 20:01:07 +00:00
aarne 77171607a9 some additions to DictEngGer, e.g. said_Quant 2013-03-12 16:59:38 +00:00
hallgren 7710dc42db partial evaluator: push predefined functions inside variants
This should prevent errors like

Internal error in Compute.ConcreteNew:
    Applying Predef.drop: Expected a value of type String, got VFV [VString "gewandt",VString "gewendet"]
2013-03-12 16:36:58 +00:00
aarne c5d03231d8 removed variants from IrregGer, introducing extra fun's for irregular variants, but none for regular ones. 2013-03-12 16:21:37 +00:00
aarne 9d1be48e0f command pt -subtrees that analyses a tree into the set of subtrees. Using pt -subtrees <bigtree> | l -treebank for debugging the lin of a big tree 2013-03-12 14:58:06 +00:00
aarne 9cd632497b AdvVPSlash in Fin 2013-03-12 10:00:15 +00:00
aarne cae80a3074 fixed DictEngGer to changed lincats 2013-03-12 09:07:15 +00:00
aarne 3eab279825 verb particles in place in DictEng 2013-03-11 19:20:35 +00:00
kr.angelov 2893397fbb bugfix in the statistical parser 2013-03-11 14:47:43 +00:00
hallgren cd8cbda3d4 Additional changes for GHC 7.4 & 7.6 compatibility 2013-03-11 12:57:09 +00:00
Sergei Trofimovich 05e5895134 ghc-7.6: allow directory-1.2
Get rid of old-time depend (and ClockTime in favour of UTCTime).
time-compat helps to retain backward compatibility with directory-1.1
and lower.
2013-03-09 21:38:43 +00:00
Sergei Trofimovich 6898804a11 ghc-7.6: fix src/server/PGFService.hs (use catch from base-4 Control.Exception) 2013-03-09 21:38:14 +00:00
Sergei Trofimovich 8d00819790 ghc-7.6: add missing Num instance for Bits
Fixes the following build failure:
    src/runtime/haskell/Data/Binary/IEEE754.lhs:256:17:
        Could not deduce (Num a) arising from a use of `mask'
        from the context (Bits a)
          bound by the type signature for
                     clamp :: Bits a => BitCount -> a -> a
2013-03-09 21:19:53 +00:00