From ed35291768e585301b8b1fb840f0c1ef20094961 Mon Sep 17 00:00:00 2001 From: aarne Date: Fri, 29 Mar 2013 10:20:36 +0000 Subject: [PATCH] updated log.txt of ParseEngFin experiment --- lib/src/finnish/stemmed/log.txt | 14 ++++++++++++++ 1 file changed, 14 insertions(+) diff --git a/lib/src/finnish/stemmed/log.txt b/lib/src/finnish/stemmed/log.txt index d7db5ac5d..8e31bdb7f 100644 --- a/lib/src/finnish/stemmed/log.txt +++ b/lib/src/finnish/stemmed/log.txt @@ -8,4 +8,18 @@ AR 28/3/2013 Designed new paradigms. Filtered problematic/illegal things (PLURNOUN, ILLEGALVERB, POSTPONE, TODO). Just 9035 lemmas missing now. +28/3 +Set up an experiment with 3220 complete trees from Penn prepared by Krasimir. First results: + 561 no linearization + 960 lin with unknowns + +around 20 missing syntax constructions, 230 missing words + +29/3 +Added most missing syntax constructions. +Some new opers in ParadigmsFin, and 230 more words in DictEngFin: out of 3220 Penn trees now 2721 +are completely translated (but mostly not so well...) + 317 no lin + 182 lin with unknowns +