1
0
forked from GitHub/gf-core
Commit Graph

155 Commits

Author SHA1 Message Date
kr.angelov
9b2487243e now we store the state instead of the offset for every continuation in the chart for the statistical parser 2012-11-12 14:04:52 +00:00
kr.angelov
c28056c4e5 in the statistical parser: move the outside probability from the parse items to their continuation. this makes the value slot shared between many items 2012-11-12 13:43:43 +00:00
kr.angelov
56f3ff8202 small refactoring in the C runtime 2012-11-12 13:05:35 +00:00
kr.angelov
cce22a7f7a use size_t consistently as the type for constituent indices in the C runtime 2012-11-12 12:51:27 +00:00
kr.angelov
c679b08b38 use prob_t instead of float in a few places 2012-10-29 08:52:56 +00:00
kr.angelov
118333eee8 forgot to add one #ifdef 2012-10-25 18:37:22 +00:00
kr.angelov
d185938952 a major refactoring in the robust parser: bottom-up filtering and garbage collection for the chart 2012-10-25 14:42:53 +00:00
kr.angelov
bf49f3c246 now the meta probability for a category is explicitly specified in the statistical model instead of computed internally. this avoids rounding errors while computing the sum of a large number of small values. 2012-09-24 09:37:21 +00:00
kr.angelov
8b28b89ffc in the robust parser we don't have to care about trees which yeld empty strings. this makes the parser a lot faster 2012-09-24 09:30:20 +00:00
kr.angelov
a307ed6c75 the C runtime now has a type prob_t which is used only for probability values 2012-09-18 09:18:48 +00:00
kr.angelov
86b5ec7447 bugfix in the C parser 2012-09-06 14:52:19 +00:00
kr.angelov
3ad5493758 Use a separated tag for meta productions in the robust parser. This cleans up the code a lot 2012-06-13 05:49:30 +00:00
kr.angelov
c9c5675e1d now there is a limit of 2000000 items in the chart of the robust parser. This prevents from explosion in the memory size but it will also prevent us from parsing some sentences. 2012-06-12 11:30:01 +00:00
kr.angelov
b27a440ef3 now the robust parser is purely top-down and the meta rules compete on a fair basis with the grammar rules 2012-06-12 09:29:51 +00:00
kr.angelov
06f9965d27 the viterbi probability for the epsilon categories is now updated properly 2012-05-25 07:30:35 +00:00
kr.angelov
f4c17cb7aa another attempt to port the robust parser to MacOS 2012-05-16 15:18:44 +00:00
kr.angelov
a6800fc0da a new unbiased statistical parser. it is still far from perfect use it on your own risk. 2012-05-08 12:13:28 +00:00
kr.angelov
931066f6fc yet another fix for parsing literals 2012-04-18 15:50:55 +00:00
kr.angelov
17bc8e5c89 some fixes in the robust parser and a new API for literals 2012-04-12 06:55:25 +00:00
kr.angelov
6644d93ec2 simple cleanup in the robust parser 2012-04-02 19:01:18 +00:00
kr.angelov
230f309317 libpgf: a new implementation for literals which also allows custom literals. the same mechanism is now used for the metavariables 2012-03-12 14:25:51 +00:00
kr.angelov
1726995921 libpgf: added simple lexer 2012-03-09 09:14:44 +00:00
kr.angelov
ed5de8335b libpgf: implementation for built in literal categories 2012-03-07 16:39:29 +00:00
kr.angelov
96493c274b libpgf: simple fix in the parser debugger 2012-03-07 12:23:07 +00:00
kr.angelov
a96da30489 libpgf: two APIs - one for finding all parse results and another for finding the best parse result 2012-03-07 11:00:17 +00:00
kr.angelov
0e90d1ba1f libpgf: now all concrete functions and categories are explicitly linked to their abstract counter parts 2012-03-05 12:59:31 +00:00
kr.angelov
4d1b0859d0 libpgf: preliminary version for the statistical ranking. we use naive statistical model with random weight for the meta variables. 2012-03-02 19:25:01 +00:00
kr.angelov
e31c883075 libpgf: the first prototype for the robust parser 2012-02-29 14:43:08 +00:00
kr.angelov
5fa1418194 libpgf: another fix in the parser debugger 2012-02-28 16:37:12 +00:00
kr.angelov
dcbeb63849 libpgf: fix in the parser debugger 2012-02-28 13:12:38 +00:00
kr.angelov
686fcca9a4 libpgf: pretty printing for expressions with metavaraibles 2012-02-27 13:50:35 +00:00
kr.angelov
667ca8e5f7 libpgf: fix in pgf_read_into_map 2012-02-24 15:15:07 +00:00
kr.angelov
1d239ab567 libpgf: simple fix in the grammar printer and the reader 2012-02-24 13:52:21 +00:00
kr.angelov
b99fa6aa9a libpgf: now we have both complete bottom up index for robust parsing and fast lexical lookup from the same index 2012-02-22 21:27:54 +00:00
kr.angelov
42410f80d2 libpgf: two small fixes in the parser debugger 2012-02-22 14:06:49 +00:00
kr.angelov
786bf883b7 libpgf: the map curr_lindefs must be allocated from a temporary pool 2012-02-22 08:49:08 +00:00
kr.angelov
7ddd0d5f3e libpgf: added index for fast lexicon lookup. Still not perfect 2012-02-21 21:17:50 +00:00
kr.angelov
a55a224dce libpgf: now the debugging mode for the parser is available only with compilation option. 2012-02-18 19:30:16 +00:00
kr.angelov
5259be92e3 libpgf: remove the now redundant field extra_ccats in PgfConcr 2012-02-18 16:25:53 +00:00
kr.angelov
47e5e8c966 libpgf: now the linearization index is created during the grammar loading which also makes the types PgfLzr and PgfParser redundant. 2012-02-18 16:22:40 +00:00
kr.angelov
aed7cc429a libpgf: simplify the loading of PgfCncCat 2012-02-17 14:26:08 +00:00
kr.angelov
4f8ad8c23f libpgf: added printer.c 2012-02-17 14:11:29 +00:00
kr.angelov
6f0795d8a3 libpgf: switch to using callbacks and lazy prediction in the parser. this reduce the parsing time from 11 sec down to 3 sec. 2012-01-26 12:32:26 +00:00
kr.angelov
a2414bc625 libpgf: use a temporal pool for allocating the arrays in the continuation map of the parser 2012-01-26 09:03:08 +00:00
kr.angelov
8256750599 libpgf: fix a warning in reader.c 2012-01-26 08:58:23 +00:00
kr.angelov
58de345f99 libpgf: few fixes to make the loading of grammars with def rules possible 2012-01-24 14:47:11 +00:00
kr.angelov
acc4f352af libpgf: extra_ccat is now redundant and was removed 2012-01-23 19:47:08 +00:00
kr.angelov
5ccd75c8b9 libpgf: debugging framework for the parser 2012-01-23 15:49:29 +00:00
kr.angelov
fcd2c2905e libpgf: the concrete categories were allocated from the temporary pool 2012-01-23 13:43:17 +00:00
kr.angelov
4e4881dba5 libpgf: printing of literals and flags 2012-01-23 10:17:20 +00:00