kr.angelov
|
5e3b23325e
|
remove the duplicated definition of PgfProductionIdx in parser.c
|
2012-11-19 14:16:31 +00:00 |
|
kr.angelov
|
954d7a7ff5
|
bugfix for the building of bottom-up filter in the C runtime
|
2012-11-16 13:27:15 +00:00 |
|
kr.angelov
|
5c52eaf0b7
|
revised heuristic in the statistical parser
|
2012-11-14 12:34:22 +00:00 |
|
kr.angelov
|
468464faca
|
bugfix in the statistical parser
|
2012-11-13 09:48:23 +00:00 |
|
kr.angelov
|
d1044b202a
|
two simple heuristics which speed up the statistical parser more than seven times.
|
2012-11-12 22:17:40 +00:00 |
|
kr.angelov
|
182e366f5d
|
a simple refactoring in the statistical parser
|
2012-11-12 21:48:22 +00:00 |
|
kr.angelov
|
7ad4436502
|
more counters in the profiler for the statistical parser
|
2012-11-12 15:36:21 +00:00 |
|
kr.angelov
|
9b2487243e
|
now we store the state instead of the offset for every continuation in the chart for the statistical parser
|
2012-11-12 14:04:52 +00:00 |
|
kr.angelov
|
c28056c4e5
|
in the statistical parser: move the outside probability from the parse items to their continuation. this makes the value slot shared between many items
|
2012-11-12 13:43:43 +00:00 |
|
kr.angelov
|
56f3ff8202
|
small refactoring in the C runtime
|
2012-11-12 13:05:35 +00:00 |
|
kr.angelov
|
cce22a7f7a
|
use size_t consistently as the type for constituent indices in the C runtime
|
2012-11-12 12:51:27 +00:00 |
|
kr.angelov
|
6784a4c76e
|
implemented gu_map_count in runtime/c/gu/map.c
|
2012-11-12 12:42:19 +00:00 |
|
kr.angelov
|
c679b08b38
|
use prob_t instead of float in a few places
|
2012-10-29 08:52:56 +00:00 |
|
kr.angelov
|
118333eee8
|
forgot to add one #ifdef
|
2012-10-25 18:37:22 +00:00 |
|
kr.angelov
|
d185938952
|
a major refactoring in the robust parser: bottom-up filtering and garbage collection for the chart
|
2012-10-25 14:42:53 +00:00 |
|
kr.angelov
|
93e3356d02
|
add teyjus/simulator/builtins/builtins.h
|
2012-10-11 11:10:17 +00:00 |
|
kr.angelov
|
b22075e15a
|
added the forgoten libteyjus.pc.in file in the C runtime
|
2012-10-11 04:22:38 +00:00 |
|
kr.angelov
|
bf49f3c246
|
now the meta probability for a category is explicitly specified in the statistical model instead of computed internally. this avoids rounding errors while computing the sum of a large number of small values.
|
2012-09-24 09:37:21 +00:00 |
|
kr.angelov
|
8b28b89ffc
|
in the robust parser we don't have to care about trees which yeld empty strings. this makes the parser a lot faster
|
2012-09-24 09:30:20 +00:00 |
|
kr.angelov
|
a307ed6c75
|
the C runtime now has a type prob_t which is used only for probability values
|
2012-09-18 09:18:48 +00:00 |
|
kr.angelov
|
86b5ec7447
|
bugfix in the C parser
|
2012-09-06 14:52:19 +00:00 |
|
kr.angelov
|
3601b796e5
|
fix some warnings in teyjus
|
2012-08-03 12:30:01 +00:00 |
|
kr.angelov
|
9fc7cac46f
|
the first draft of GF.Compile.Instructions
|
2012-08-01 09:21:59 +00:00 |
|
kr.angelov
|
83b321d862
|
An initial import of the teyjus source code in the C runtime for GF. The two runtime are still not connected but the source code compiles.
|
2012-07-31 15:16:04 +00:00 |
|
kr.angelov
|
3ad5493758
|
Use a separated tag for meta productions in the robust parser. This cleans up the code a lot
|
2012-06-13 05:49:30 +00:00 |
|
kr.angelov
|
c9c5675e1d
|
now there is a limit of 2000000 items in the chart of the robust parser. This prevents from explosion in the memory size but it will also prevent us from parsing some sentences.
|
2012-06-12 11:30:01 +00:00 |
|
kr.angelov
|
b27a440ef3
|
now the robust parser is purely top-down and the meta rules compete on a fair basis with the grammar rules
|
2012-06-12 09:29:51 +00:00 |
|
kr.angelov
|
06f9965d27
|
the viterbi probability for the epsilon categories is now updated properly
|
2012-05-25 07:30:35 +00:00 |
|
kr.angelov
|
ea5fe32d89
|
fixed an alignment bug in libgu which leads to crash on some platforms
|
2012-05-25 07:29:29 +00:00 |
|
kr.angelov
|
92488a6592
|
fixed memory leak in the pgf-translate tool
|
2012-05-25 07:28:18 +00:00 |
|
kr.angelov
|
5312ef23c6
|
added pgf-service.c in the robust parser'
|
2012-05-25 07:27:29 +00:00 |
|
kr.angelov
|
f4c17cb7aa
|
another attempt to port the robust parser to MacOS
|
2012-05-16 15:18:44 +00:00 |
|
kr.angelov
|
d2e528c7f4
|
small fix for compiling the robust parser on MacOS
|
2012-05-16 12:18:50 +00:00 |
|
kr.angelov
|
a6800fc0da
|
a new unbiased statistical parser. it is still far from perfect use it on your own risk.
|
2012-05-08 12:13:28 +00:00 |
|
kr.angelov
|
931066f6fc
|
yet another fix for parsing literals
|
2012-04-18 15:50:55 +00:00 |
|
kr.angelov
|
17bc8e5c89
|
some fixes in the robust parser and a new API for literals
|
2012-04-12 06:55:25 +00:00 |
|
kr.angelov
|
6644d93ec2
|
simple cleanup in the robust parser
|
2012-04-02 19:01:18 +00:00 |
|
kr.angelov
|
230f309317
|
libpgf: a new implementation for literals which also allows custom literals. the same mechanism is now used for the metavariables
|
2012-03-12 14:25:51 +00:00 |
|
kr.angelov
|
1726995921
|
libpgf: added simple lexer
|
2012-03-09 09:14:44 +00:00 |
|
kr.angelov
|
ed5de8335b
|
libpgf: implementation for built in literal categories
|
2012-03-07 16:39:29 +00:00 |
|
kr.angelov
|
96493c274b
|
libpgf: simple fix in the parser debugger
|
2012-03-07 12:23:07 +00:00 |
|
kr.angelov
|
a96da30489
|
libpgf: two APIs - one for finding all parse results and another for finding the best parse result
|
2012-03-07 11:00:17 +00:00 |
|
kr.angelov
|
0e90d1ba1f
|
libpgf: now all concrete functions and categories are explicitly linked to their abstract counter parts
|
2012-03-05 12:59:31 +00:00 |
|
kr.angelov
|
4d1b0859d0
|
libpgf: preliminary version for the statistical ranking. we use naive statistical model with random weight for the meta variables.
|
2012-03-02 19:25:01 +00:00 |
|
kr.angelov
|
58b6bbd242
|
libpgf: simple optimization in the implementation for heaps
|
2012-03-02 17:43:46 +00:00 |
|
kr.angelov
|
2775faad7a
|
libpgf: an API for building heaps from buffers in gu/seq.c
|
2012-02-29 18:51:21 +00:00 |
|
kr.angelov
|
e31c883075
|
libpgf: the first prototype for the robust parser
|
2012-02-29 14:43:08 +00:00 |
|
kr.angelov
|
b054549082
|
libpgf: the choice object should remember integers instead of single bytes
|
2012-02-29 13:41:59 +00:00 |
|
kr.angelov
|
5fa1418194
|
libpgf: another fix in the parser debugger
|
2012-02-28 16:37:12 +00:00 |
|
kr.angelov
|
4ddf24429b
|
libpgf: the time measurement in pgf-translate should be made after pgf_parse_result for more accurate timing
|
2012-02-28 13:14:24 +00:00 |
|