Commit Graph

253 Commits

Author SHA1 Message Date
aarne
996e0141e2 three new uses of "self" in Idion(Eng,Fin,Ger,Swe): SelfAdvVP (he is singing himself), SelfAdVVP (he is himself singing), SelfNP (he himself is singing; I know the president himself) 2013-12-03 18:54:56 +00:00
aarne
938587c159 top-level access to contracted auxiliaries in ExtraEng. Introduced them as variants in ParseEng. 2013-12-03 14:52:12 +00:00
aarne
302e2d3ebd introduced contracted auxiliaries in English. So far only available internally in the tables created from Cl. TODO: make them accessible on top level in Extra; use them as variants in Parse. Also the use in QCl needs some more work: we can say "he's here", "I'm here", "where's he" but not "'s he here", "where'm I". 2013-12-03 13:54:57 +00:00
kr.angelov
5d93c1d4de split teacher_N into two genders 2013-12-03 10:39:45 +00:00
kr.angelov
ff5dcf0f6a FeatN and FeatN2 in English now show the gender if it is masculine or feminine 2013-12-03 09:05:20 +00:00
kr.angelov
41642e8d30 in DictEng all hyphenated words are again merged into a single token without spaces. This is necessary because there is no lexer anymore 2013-12-03 08:31:04 +00:00
kr.angelov
63c779f844 forgoten entry in DictEngAbs.gf 2013-12-03 08:28:09 +00:00
kr.angelov
2356aa7e65 one big patch with several changes in the different dictionaries
1. the duplicated variants in the French entries are merged
2. ~150 abstract words are split into masculine and feminine entries.
   The actual defininitions for English, Bulgarian, French and German differ.
   For the other languages they are just copied.
3. fan_N, intelligency_N, character_N and letter_N are split into
   different senses. The entries for Bulgarian, Swedish, German and French
   are sense disambiguated. For English they are just copied since they
   are ambiguous. For the other languages they are copied since I don't
   know how to translate them.
2013-12-03 08:14:05 +00:00
kr.angelov
724b2e04e1 fixes in the dictionaries 2013-11-29 13:56:28 +00:00
kr.angelov
99ac055997 fixes in the dictionaries 2013-11-28 13:16:03 +00:00
kr.angelov
f49a8df60c changes in the parallel dictionaries 2013-11-27 22:18:22 +00:00
kr.angelov
9de4812482 better dictionary visualization in the Android App 2013-11-27 22:16:05 +00:00
aarne
848573afdc 555 new words checked in DictEngFin; latin1 flags in some esoteric files 2013-11-27 19:22:37 +00:00
kr.angelov
4874165cea two senses of ring_V2 which have different morphology in English 2013-11-14 21:55:40 +00:00
kr.angelov
58b47c2766 added date_3_N for the sense of "meeting". also added comments for the meanings of a few sense disambiguated words in the abstract syntax. The format of the comments follows the Haddock convention 2013-11-14 20:15:02 +00:00
kr.angelov
4b919ba26f a number of changes in the dictionaries. marvellous_A is merged with marvelous_A. organise as alternative spelling of organize_V2. A few new Bulgarian words. DictEngSwe.gf is now disconnected from DictSwe.gf and follows the same convention as the English, Bulgarian and German dictionaries. 2013-11-14 18:03:33 +00:00
kr.angelov
bfd555b452 ring was not marked as irregular verb in the dictionary 2013-11-13 21:00:08 +00:00
kr.angelov
644c2f40d4 fix the valencies of vote and spread_out in the dictionary 2013-11-13 20:59:28 +00:00
kr.angelov
90e2f11045 now ago_Prep is postposition in English 2013-11-13 20:58:08 +00:00
kr.angelov
041630e8b1 split degree_N into three senses and add licentiate_N and phd_N in Bulgarian 2013-11-13 16:39:04 +00:00
kr.angelov
d1a0e6ca9d split fire_N into two senses fire_1_N and fire_2_N 2013-11-13 08:59:03 +00:00
kr.angelov
6449bbad9c use BIND to glue the digits in IIDig for the Bulgarian, English and Swedish grammmars 2013-11-12 10:28:46 +00:00
kr.angelov
1d7b6f44b6 added Predef.SOFT_BIND. This special token allows zero or more spaces between ordinary tokens. It is also used in the English RGL to attach the commas to the previous word. 2013-11-12 09:54:57 +00:00
kr.angelov
d0c06ae2b5 fix in DictEng and DictEngBul 2013-11-07 06:36:03 +00:00
kr.angelov
eb30f5308b fixes in DictEng, DictEngBul and DictEngSwe 2013-11-06 21:17:21 +00:00
kr.angelov
475f213c99 the content of ParseEngAbs3.probs is now merged with ParseEngAbs.probs. The later is now retrained. Once the grammar is compiled with the .probs file now it doesn't need anything more to do robust parsing. The robustness itself is controlled by the flags 'heuristic_search_factor', 'meta_prob' and 'meta_token_prob' in ParseEngAbs.gf 2013-11-06 10:21:46 +00:00
aarne
3e159f7d53 some missing Swe functions; ParseEng.who_RP to enable the parsing of "who" as RP, which is now blocked since Eng nouns don't have the human feature marked 2013-11-05 13:53:12 +00:00
aarne
fcb5b173f7 changed whichSg/Pl_IDet to which_IQuant in DictEngAbs and Eng, Fin, Swe, Chi implementations 2013-11-01 13:49:21 +00:00
kr.angelov
9b0f5ea449 ExtraEng.MkVPS should add the extra position field 2013-11-01 10:58:52 +00:00
kr.angelov
40f0ceb2ef use Simul instead of Anter in linref ClSlash 2013-10-31 08:35:39 +00:00
kr.angelov
7a618204c4 fix the lindefs for verbs and VPs in CatEng 2013-10-30 21:42:39 +00:00
kr.angelov
e50e0de6a1 more lindef and linref definitions in CatEng.gf 2013-10-30 15:23:09 +00:00
kr.angelov
36a9806d30 added linref for VP in CatEng.gf 2013-10-30 14:44:19 +00:00
kr.angelov
453190fcb6 remove very_Adv from Dict* 2013-10-28 13:07:20 +00:00
kr.angelov
ec6ef1aff4 minor fixes in DictEng and DictEngBul 2013-10-28 12:41:03 +00:00
aarne
6711201e0d added ext field to Eng.VP for extraposed sentences etc, which should appear after adverb: "she said yesterday that he was ill" 2013-10-25 13:06:21 +00:00
kr.angelov
2bff7d8c21 more in DictEngBul 2013-10-24 08:04:59 +00:00
kr.angelov
5d993bbdad put was used as regular verb in few places in DictEng 2013-10-23 15:33:33 +00:00
kr.angelov
eb8e830e66 a few fixes in DictEngBul and DictEng 2013-10-23 15:07:43 +00:00
aarne
075796dbe5 fixed the case of substantival possessives in Eng 2013-10-19 10:07:26 +00:00
aarne
fdfc8d4dc2 removed that-less variants of ComplVS and SlashV2S from VerbEng, and added constructors ComplBareVS and SlashBareV2S to Extra, so that the library user can control the presence of "that". 2013-10-18 08:45:56 +00:00
kr.angelov
72fdb2dc7d more words in DictEngBul 2013-10-11 08:18:00 +00:00
aarne
506c5a5b4e added interjection "hi" to Eng and Chi 2013-10-09 16:16:17 +00:00
aarne
0736a9a2ea new extra Chi constructions for HSK 2013-10-09 09:52:42 +00:00
kr.angelov
dfb5a78a8d changes in the English-Bulgarian grammar 2013-10-08 11:17:15 +00:00
kr.angelov
688796a40d work on the translation dictionaries and the statistical model 2013-09-20 12:09:54 +00:00
kr.angelov
662d98a354 decrease the heuristic factor for parsing to 0.80 from 0.95. Otherwise we get the wrong result for compound verbs like "look after", "look forward", etc. 2013-09-20 12:07:47 +00:00
kr.angelov
88844d693a ProgrVP in English should add the verb particle if there is any 2013-09-20 12:07:12 +00:00
kr.angelov
7bafc5653c added a number of interjections and greetings in DictEng and DictEngBul 2013-09-19 12:02:44 +00:00
kr.angelov
9810817f4d fixes in DictEng & DictEngBul 2013-09-17 12:01:05 +00:00