1
0
forked from GitHub/gf-core
Commit Graph

3836 Commits

Author SHA1 Message Date
aarne 4b34c51e4c more automatic creation of files in CheckDict ; words for translation in DictionaryFre 2014-04-10 07:06:57 +00:00
aarne 1397bb4528 restored passives in Extensions for Bul,Fre,Ita,Spa ; PassAgentVPSlash in ExtraBul to be verified 2014-04-09 16:16:32 +00:00
aarne 6d6d641b73 more explicit inheritance list for Extensions in Translate 2014-04-09 13:06:14 +00:00
aarne 6fc7271950 adjusted the contents of Translate.gf: (1) inherit everything in Idiom since it is useful and cheap ; (2) inherit ComplVV and SlashV2V from Verb rather than Extensions, since it is more efficient and already available for all languages. Actually the previous version didn't have these functions at all, which affected the quality quite a bit. 2014-04-09 09:40:43 +00:00
aarne edcb328b70 revised TopDict Ger,Ita,Spa with the new Dictionary improvements 2014-04-08 21:18:39 +00:00
aarne 2ca8024ba4 smart paradigm for refl verbs ending "se" in Spa 2014-04-08 20:28:27 +00:00
aarne 70dddec7e1 further clean-up in DictionaryIta and Spa 2014-04-08 20:21:37 +00:00
aarne d7e25e7bdd DictionarySpa and Ita: retrieved a few thousand unpredictable noun genders from Wiktionary 2014-04-08 19:44:18 +00:00
aarne 8fc7cc5541 some words in DictionaryGer 2014-04-08 16:13:46 +00:00
aarne 0d564c15ff smartened Paradigms in Ita and Spa to recognize some more nouns as feminine 2014-04-08 15:37:43 +00:00
aarne 72b6d8da9c checked 600 entries in DictionaryFre ; CheckDict.hs, functions for dictionary checking 2014-04-08 15:12:34 +00:00
aarne 2198d18d3a TopDictFre checked to 500 2014-04-08 12:53:03 +00:00
aarne feb4ff5dd1 plural of stad in Dut 2014-04-08 09:19:50 +00:00
aarne 04f9a50da0 some new ParadigmsFre and DictionarySwe 2014-04-07 20:23:11 +00:00
aarne 02a0372b41 checked 300+ words in TopDictFre ; not yet merged in DictionaryFre 2014-04-07 20:22:42 +00:00
aarne 2825f9e420 VPI chunks linearized ; nouns with ión in Spanish and zione in Italian marked as feminine 2014-04-07 12:02:52 +00:00
kr.angelov 1671383e1c another fix in DictionaryBul 2014-04-07 09:53:31 +00:00
kr.angelov 024321b520 fixes in DictionaryBul 2014-04-07 09:52:12 +00:00
aarne 0df4d4bef6 restored passives in Translate, generalized IdRP in Eng 2014-04-07 07:57:55 +00:00
aarne b7973f2f5d restored the initial segment of TopDictSwe 2014-04-06 20:19:46 +00:00
aarne 79fe3f2b49 swede in DictionaryEng ; CompoundCN probability in translate.probs tweaked to avoid too aggressive compounding 2014-04-06 19:45:35 +00:00
aarne 37c3afa9b4 added "todo" dictionaries 2014-04-06 19:19:51 +00:00
aarne 82a333c602 normalized Dictionary Fin,Chin,Hin,Fre to a format easier to process automatically; other Dictionary files were already in this format: each rule prefixed by "lin", sorted, checked parts uncommented, unchecked or problematic parts commented, one rule per line 2014-04-06 16:26:16 +00:00
hallgren 5d36c4734d Fixes for compiling Translate10.pgf 2014-04-04 19:04:16 +00:00
hallgren 3fa7b3e04b 149 new words in DictinarySwe.gf (mostly geographical names) 2014-04-04 19:02:42 +00:00
aarne 0aff5f4aa4 type error in finnish revealed 2014-04-04 17:01:06 +00:00
aarne 0577ec19a4 fixed type errors in finnish revealed by improved type checker 2014-04-04 16:38:36 +00:00
aarne c4a45f687f translate10 do the right thing 2014-04-04 14:45:50 +00:00
aarne 81f76ba658 Make for Translate10 2014-04-04 14:32:05 +00:00
aarne 316e473a1e added Spa and Ita to translator/ ; omitted some Extensions functions to double the parsing speed 2014-04-04 14:13:11 +00:00
aarne ab3244fbe5 polarities restored in Hin translation 2014-04-04 12:05:20 +00:00
aarne b5500723c4 the word for time in some Dictionaries 2014-04-04 07:29:42 +00:00
aarne 2ae1392cc1 Chunking and other robust translation facilities in plain RGL 2014-04-02 21:16:03 +00:00
aarne 8fc7add8a8 experimenting with exclusion of some functions to gain speed in Translate 2014-04-02 14:04:48 +00:00
aarne 086085b9a3 chunks for ordinary RGL, defined with a functor 2014-04-02 13:19:34 +00:00
aarne 99c31f406b Dictionary updates in Ger,Swe 2014-04-02 09:58:25 +00:00
aarne 2f06675db1 corrected some prepositions in DictionaryGer 2014-04-02 07:48:18 +00:00
aarne c328a7fd4a next 320 BNC words checked in Swe 2014-04-01 17:23:17 +00:00
hallgren 5d7c894380 translator/DictionarySwe.gf: fix for tasty_A, mkA "god" "gott" 2014-04-01 14:38:16 +00:00
aarne cdd7adef64 split the two senses of can_VV in Dictionary, as they are split in Structural 2014-04-01 13:57:42 +00:00
aarne fb202420dd changed the definition of MassNP in Romance so that subject position doesn't return the partitive but the definite article. Thus "wine is good" becomes "le vin est bon" and "I drink wine" becomes "je bois du vin". Partitive on subject position seems incorrect, and the definite article the best choice when translating mass terms without articles on the subject position. 2014-04-01 13:03:35 +00:00
aarne 0ce50f02b6 vice president and some other words 2014-03-31 19:46:05 +00:00
aarne 8a2ee67ad5 room_N in DictionaryChi 2014-03-31 14:30:31 +00:00
aarne 57c44d2af5 comment on possible bug in PredFin 2014-03-31 12:27:47 +00:00
aarne c77b137c14 instructions for generating lexicon spreadsheets 2014-03-31 07:13:02 +00:00
aarne 154a65cc3e checked top-1000 BNC senses in Swe, with some split senses added to Dictionary and DictionaryEng. Wrote bnc-dict-log.txt to describe the procedure, which should be reproducible to other languages now. 2014-03-30 16:28:40 +00:00
aarne e96d222c41 the top 7828 words in British National Corpus expanded to Dictionary fun's, in frequency order. A natural checklist for every Dictionary. 2014-03-30 09:28:54 +00:00
aarne a28fd18ee2 added ca. 200 new Dictionary fun's from BNC top-6000 list. Seems to be much better quality than Google 1-grams. 2014-03-30 09:25:29 +00:00
aarne 649924352a checked the top-1000 words of Google 1-grams in DictionarySwe, splitting senses of 70 words and adding the split senses to abstract Dictionary and DictionaryEng 2014-03-29 20:15:41 +00:00
aarne b5f4e308a3 around 1000 new abstract functions in Dictionary and DictionaryEng extracted from the top-10000 Google unigrams. Forthcoming in Swe and other languages soon. 2014-03-29 17:39:46 +00:00