Commit Graph

30 Commits

Author SHA1 Message Date
kr.angelov
5c9ee467a9 a major reimplementation of the linearizer in the C runtime 2012-12-19 09:07:05 +00:00
kr.angelov
1376df457d started an official API to the C runtime 2012-12-12 11:25:58 +00:00
kr.angelov
52255664be use prob_t instead of float in a few places 2012-10-29 08:52:56 +00:00
kr.angelov
9721833680 a major refactoring in the robust parser: bottom-up filtering and garbage collection for the chart 2012-10-25 14:42:53 +00:00
kr.angelov
18fe8af964 now the meta probability for a category is explicitly specified in the statistical model instead of computed internally. this avoids rounding errors while computing the sum of a large number of small values. 2012-09-24 09:37:21 +00:00
kr.angelov
44df7a33cf the C runtime now has a type prob_t which is used only for probability values 2012-09-18 09:18:48 +00:00
kr.angelov
cd3cca4aa2 bugfix in the C parser 2012-09-06 14:52:19 +00:00
kr.angelov
7549a4876d now there is a limit of 2000000 items in the chart of the robust parser. This prevents from explosion in the memory size but it will also prevent us from parsing some sentences. 2012-06-12 11:30:01 +00:00
kr.angelov
b765b0c054 now the robust parser is purely top-down and the meta rules compete on a fair basis with the grammar rules 2012-06-12 09:29:51 +00:00
kr.angelov
4aca965109 a new unbiased statistical parser. it is still far from perfect use it on your own risk. 2012-05-08 12:13:28 +00:00
kr.angelov
2bf3f22fac libpgf: a new implementation for literals which also allows custom literals. the same mechanism is now used for the metavariables 2012-03-12 14:25:51 +00:00
kr.angelov
1da464a4cc libpgf: implementation for built in literal categories 2012-03-07 16:39:29 +00:00
kr.angelov
791a1a17b0 libpgf: now all concrete functions and categories are explicitly linked to their abstract counter parts 2012-03-05 12:59:31 +00:00
kr.angelov
aca0bd5ee5 libpgf: the first prototype for the robust parser 2012-02-29 14:43:08 +00:00
kr.angelov
695c776065 libpgf: fix in pgf_read_into_map 2012-02-24 15:15:07 +00:00
kr.angelov
0faffc6ffd libpgf: simple fix in the grammar printer and the reader 2012-02-24 13:52:21 +00:00
kr.angelov
f1d2852c4d libpgf: now we have both complete bottom up index for robust parsing and fast lexical lookup from the same index 2012-02-22 21:27:54 +00:00
kr.angelov
831de53573 libpgf: the map curr_lindefs must be allocated from a temporary pool 2012-02-22 08:49:08 +00:00
kr.angelov
dc4c3d3b28 libpgf: added index for fast lexicon lookup. Still not perfect 2012-02-21 21:17:50 +00:00
kr.angelov
0147885e2f libpgf: now the linearization index is created during the grammar loading which also makes the types PgfLzr and PgfParser redundant. 2012-02-18 16:22:40 +00:00
kr.angelov
75b724ab54 libpgf: simplify the loading of PgfCncCat 2012-02-17 14:26:08 +00:00
kr.angelov
469d8cf804 libpgf: fix a warning in reader.c 2012-01-26 08:58:23 +00:00
kr.angelov
b62d57fd30 libpgf: few fixes to make the loading of grammars with def rules possible 2012-01-24 14:47:11 +00:00
kr.angelov
5b96b55184 libpgf: extra_ccat is now redundant and was removed 2012-01-23 19:47:08 +00:00
kr.angelov
21dee01c9d libpgf: debugging framework for the parser 2012-01-23 15:49:29 +00:00
kr.angelov
f2cfa9888e libpgf: the concrete categories were allocated from the temporary pool 2012-01-23 13:43:17 +00:00
kr.angelov
c5b4e5388a libpgf: move the lindefs field from PgfCncCat to PgfCCat. display the list in the grammar printout 2012-01-23 09:46:45 +00:00
kr.angelov
64a00dad48 added an API for printing the PGF to human readable format 2012-01-21 10:27:55 +00:00
kr.angelov
1acb722ba2 fix the crash in the reader when there aren't any productions per category. 2012-01-21 10:25:43 +00:00
kr.angelov
a369d3bdac initial import of the C runtime 2012-01-20 13:41:10 +00:00