aarne
|
b4e08ae60c
|
changed Finnish VP so that the verb is stored as such. This makes compilation and parsing several times faster. TODO: treat passivization correctly; optimize further by only storing the stems.
|
2013-12-07 11:13:17 +00:00 |
|
aarne
|
e75ed90621
|
make ParseFin compile with Construction
|
2013-12-07 08:01:28 +00:00 |
|
aarne
|
d3f21cce99
|
added constructions to Parse grammars
|
2013-12-06 17:13:53 +00:00 |
|
aarne
|
2843fae2af
|
PassAgentVPSlash for Eng, Swe, Fin, Fre
|
2013-12-06 08:32:53 +00:00 |
|
aarne
|
70f8827f75
|
three new uses of "self" in Idion(Eng,Fin,Ger,Swe): SelfAdvVP (he is singing himself), SelfAdVVP (he is himself singing), SelfNP (he himself is singing; I know the president himself)
|
2013-12-03 18:54:56 +00:00 |
|
aarne
|
384e62ef2c
|
some missing forms in Fin (agent participle, Inf3 instructives); now also in stemmed version.
|
2013-11-29 08:05:55 +00:00 |
|
aarne
|
01a7166bb8
|
added some esoteric verb forms to Fin (not used in resource API but nice to show): potential, passive imperatives, infinitives 2,4,5. Remains to carry out in stemmed/StemFin.
|
2013-11-28 22:01:38 +00:00 |
|
aarne
|
b1f5e0b735
|
555 new words checked in DictEngFin; latin1 flags in some esoteric files
|
2013-11-27 19:22:37 +00:00 |
|
aarne
|
145ccb1079
|
moved DictEngFin from stemmed to main level and checked some critical words for the spoken translator
|
2013-10-27 16:08:18 +00:00 |
|
aarne
|
c1046c74a2
|
finished the refactoring of Fin with unstemmed (for library use) and stemmed (for large-scale parsing use). All differences are located in the file StemFin.
|
2013-08-07 09:52:40 +00:00 |
|
aarne
|
5610b93e30
|
refactored Fin so that the stemmed and the unstemmed versions share all code except StemFin. It is chosen by setting the path; a functor solution would be purer, but it feels like overkill.
|
2013-08-06 21:36:12 +00:00 |
|
aarne
|
31da352105
|
Inari's new batch of words in DictFin: now 4713 checked words
|
2013-04-12 06:42:00 +00:00 |
|
inari.listenmaa
|
530d4f9205
|
fixed Kotus paradigm k42
There was an error in the paradigm for "mies", fixed the s in genitive stem ("miesen") to h ("miehen").
|
2013-04-10 14:15:01 +00:00 |
|
aarne
|
daac51c904
|
500+ checked words in DictFin from Inari: now 4291 words have been checked
|
2013-04-09 19:14:12 +00:00 |
|
aarne
|
ae05473561
|
Inari's DictFin additions
|
2013-04-09 13:42:33 +00:00 |
|
aarne
|
fa722cd295
|
fixes in FIn syntax and lexicon
|
2013-04-07 21:34:11 +00:00 |
|
aarne
|
e91bdb4fba
|
SlashSlashV2V in ParseEng added; new set of checked DictFin words
|
2013-04-07 13:23:23 +00:00 |
|
aarne
|
1a0a4a4a56
|
checked words for wsj-100 experiments
|
2013-04-07 10:06:37 +00:00 |
|
aarne
|
0f3561c1f7
|
Inari's 390 checked DictEngFin words
|
2013-04-05 14:53:12 +00:00 |
|
aarne
|
5ec5f9bf26
|
Inari's additions to DictFin
|
2013-04-05 12:46:37 +00:00 |
|
aarne
|
1f62470333
|
improved entry in DictFin
|
2013-04-04 20:28:23 +00:00 |
|
aarne
|
5e95aec86e
|
checked >100 penn words in DictFin
|
2013-04-04 19:11:17 +00:00 |
|
aarne
|
53559ac0f6
|
better annotations in DictFin: CHECKED
|
2013-04-04 16:04:50 +00:00 |
|
aarne
|
2a723e8fa7
|
changed 200 old-generation words in DictFin
|
2013-04-04 15:39:56 +00:00 |
|
aarne
|
ffc9b8e30b
|
missing ParseFin syntax constructs
|
2013-04-04 13:14:02 +00:00 |
|
aarne
|
88c103a5b7
|
commented out a couple of unfinished lines
|
2013-04-04 06:41:15 +00:00 |
|
aarne
|
627a400dcf
|
some new stemmed/ParadigmsFin
|
2013-04-04 06:39:36 +00:00 |
|
aarne
|
846abf61d5
|
some Fin determiners from Penn
|
2013-04-03 22:47:40 +00:00 |
|
aarne
|
2c916a4e50
|
300 full-penn top words in Dict
|
2013-04-03 20:27:34 +00:00 |
|
aarne
|
1689498e60
|
reported the chunking translator in Finnish log
|
2013-04-02 19:42:19 +00:00 |
|
aarne
|
358f427893
|
scattered DictEngFin improvements
|
2013-04-02 06:32:52 +00:00 |
|
aarne
|
47dd616156
|
another batch of words in DictFin
|
2013-03-31 20:26:56 +00:00 |
|
aarne
|
9539285791
|
weekdays and months in Fin
|
2013-03-31 14:47:46 +00:00 |
|
aarne
|
089a26f896
|
a special form for Finnish verbs needed for oleva/olema; correction to weakGrade of "kt" and the like
|
2013-03-31 13:56:22 +00:00 |
|
aarne
|
4fa1322079
|
Finnish VV inf forms extended with "näyttää tulevan" and "toivoo tulevansa"
|
2013-03-31 10:03:41 +00:00 |
|
aarne
|
94e9ba9262
|
another batch of lexicon corrections in DictFin (see finnish/stemmed/log.txt for details)
|
2013-03-30 23:21:13 +00:00 |
|
aarne
|
f3f235c9a4
|
a separate compound form for Fin nouns, needed e.g. for nainen->nais
|
2013-03-30 21:52:25 +00:00 |
|
aarne
|
35a6424818
|
Finnish passive Impf and Condit fixed (both in rgl and stemmed)
|
2013-03-30 20:44:28 +00:00 |
|
aarne
|
86dd7ca92a
|
an improvement in Fin passive infinitives - but still not perfect
|
2013-03-30 17:39:36 +00:00 |
|
aarne
|
6734fdfb43
|
bug fix in stemmed/NounFin with sp forms of IndefArt
|
2013-03-30 17:07:57 +00:00 |
|
aarne
|
50658443ea
|
revised 200+ most frequent V2's in DictEngFin, leading to changes of patterns for most verbs.
|
2013-03-30 10:49:41 +00:00 |
|
aarne
|
9c17590a85
|
fixed "date" and "force" in DictFin
|
2013-03-29 21:46:18 +00:00 |
|
aarne
|
3b06ca70f4
|
Some more DictFin items. 50 of the 3220 Penn examples remain to be covered.
|
2013-03-29 21:34:50 +00:00 |
|
aarne
|
132caa9414
|
top-ten missing DictEngFin words, now translating 95% of the corpus
|
2013-03-29 20:51:30 +00:00 |
|
aarne
|
8b97b049e0
|
a few more words in DictFin, but the most frequent missing ones - now complete lin for over 90% of the complete trees
|
2013-03-29 17:42:47 +00:00 |
|
aarne
|
66530dec81
|
preliminary implementation of GerundN and GerundAP in ParseFin. More coverage, but some strange translations.
|
2013-03-29 12:45:24 +00:00 |
|
aarne
|
5cffd85923
|
updated log.txt of ParseEngFin experiment
|
2013-03-29 10:20:36 +00:00 |
|
aarne
|
43483cd563
|
some new opers in ParadigmsFin, and 200 more words in DictEngFin: out of 3220 Penn trees now 2721 are completely translated (but mostly not so well...)
|
2013-03-29 10:13:04 +00:00 |
|
aarne
|
a47a9e2dc0
|
second round of completing ParseFin with missing constructs: hundreds more translations
|
2013-03-29 08:03:20 +00:00 |
|
aarne
|
ceb800269c
|
new DictEngFin in finnished/stemmed. Only 9k lemmas missing.
|
2013-03-28 14:22:09 +00:00 |
|