Commit Graph

193 Commits

Author SHA1 Message Date
kr.angelov
32905c8363 debugging infrastructure in the reasoner 2012-12-14 21:25:00 +00:00
kr.angelov
5cec2d5a50 bugfix for the reasoner in the C runtime 2012-12-14 21:24:17 +00:00
kr.angelov
8aefd1e072 The first prototype for exhaustive generation in the C runtime. The trees are always listed in decreasing probability order. There is also an API for generation from Python 2012-12-14 15:32:49 +00:00
kr.angelov
e1bab39458 bugfix in the lexer from the C runtime. the input sentence doesn't have to terminate with whitespace 2012-12-13 16:45:44 +00:00
kr.angelov
2ba632dc9f a top-level API for parsing in the C runtime 2012-12-13 14:44:33 +00:00
kr.angelov
60942c440a bugfix: the outside probability of a PgfItemConts must always be initialized to zero 2012-12-13 11:11:45 +00:00
kr.angelov
fe51a7fb98 bugfix: pgf_read_expr no longer requires a semicolon at the end of an abstract expression 2012-12-13 11:09:26 +00:00
kr.angelov
1376df457d started an official API to the C runtime 2012-12-12 11:25:58 +00:00
kr.angelov
3182e382dc bugfix for robust parsing with multi-word units 2012-12-11 12:57:22 +00:00
kr.angelov
1863e4c3d6 added experimental script for chunking in the C runtime 2012-12-03 10:07:54 +00:00
kr.angelov
2da23e9872 added INSTALL file and updated README file for the C runtime 2012-12-03 09:09:08 +00:00
peter.ljunglof
1789272c9a Added C compilation notes for Mac OSX 2012-11-22 08:50:49 +00:00
kr.angelov
f8c302f9ef remove the duplicated definition of PgfProductionIdx in parser.c 2012-11-19 14:16:31 +00:00
kr.angelov
71b7c09ffe bugfix for the building of bottom-up filter in the C runtime 2012-11-16 13:27:15 +00:00
kr.angelov
a3ba1991f4 revised heuristic in the statistical parser 2012-11-14 12:34:22 +00:00
kr.angelov
70c68f0527 bugfix in the statistical parser 2012-11-13 09:48:23 +00:00
kr.angelov
08ee662944 two simple heuristics which speed up the statistical parser more than seven times. 2012-11-12 22:17:40 +00:00
kr.angelov
68170d5b08 a simple refactoring in the statistical parser 2012-11-12 21:48:22 +00:00
kr.angelov
a2771552d6 more counters in the profiler for the statistical parser 2012-11-12 15:36:21 +00:00
kr.angelov
46de62c452 now we store the state instead of the offset for every continuation in the chart for the statistical parser 2012-11-12 14:04:52 +00:00
kr.angelov
9967c3ad04 in the statistical parser: move the outside probability from the parse items to their continuation. this makes the value slot shared between many items 2012-11-12 13:43:43 +00:00
kr.angelov
9d23093492 small refactoring in the C runtime 2012-11-12 13:05:35 +00:00
kr.angelov
a50c7c24b8 use size_t consistently as the type for constituent indices in the C runtime 2012-11-12 12:51:27 +00:00
kr.angelov
1e531e8237 implemented gu_map_count in runtime/c/gu/map.c 2012-11-12 12:42:19 +00:00
kr.angelov
52255664be use prob_t instead of float in a few places 2012-10-29 08:52:56 +00:00
kr.angelov
0ad2405d69 forgot to add one #ifdef 2012-10-25 18:37:22 +00:00
kr.angelov
9721833680 a major refactoring in the robust parser: bottom-up filtering and garbage collection for the chart 2012-10-25 14:42:53 +00:00
kr.angelov
28b58b6267 add teyjus/simulator/builtins/builtins.h 2012-10-11 11:10:17 +00:00
kr.angelov
f0583bfd93 added the forgoten libteyjus.pc.in file in the C runtime 2012-10-11 04:22:38 +00:00
kr.angelov
18fe8af964 now the meta probability for a category is explicitly specified in the statistical model instead of computed internally. this avoids rounding errors while computing the sum of a large number of small values. 2012-09-24 09:37:21 +00:00
kr.angelov
bb15542a85 in the robust parser we don't have to care about trees which yeld empty strings. this makes the parser a lot faster 2012-09-24 09:30:20 +00:00
kr.angelov
44df7a33cf the C runtime now has a type prob_t which is used only for probability values 2012-09-18 09:18:48 +00:00
kr.angelov
cd3cca4aa2 bugfix in the C parser 2012-09-06 14:52:19 +00:00
kr.angelov
3ef3108314 fix some warnings in teyjus 2012-08-03 12:30:01 +00:00
kr.angelov
b34232c6dc the first draft of GF.Compile.Instructions 2012-08-01 09:21:59 +00:00
kr.angelov
2844742afd An initial import of the teyjus source code in the C runtime for GF. The two runtime are still not connected but the source code compiles. 2012-07-31 15:16:04 +00:00
kr.angelov
3a352a953f Use a separated tag for meta productions in the robust parser. This cleans up the code a lot 2012-06-13 05:49:30 +00:00
kr.angelov
7549a4876d now there is a limit of 2000000 items in the chart of the robust parser. This prevents from explosion in the memory size but it will also prevent us from parsing some sentences. 2012-06-12 11:30:01 +00:00
kr.angelov
b765b0c054 now the robust parser is purely top-down and the meta rules compete on a fair basis with the grammar rules 2012-06-12 09:29:51 +00:00
kr.angelov
cab4602b62 the viterbi probability for the epsilon categories is now updated properly 2012-05-25 07:30:35 +00:00
kr.angelov
ec00a42ee6 fixed an alignment bug in libgu which leads to crash on some platforms 2012-05-25 07:29:29 +00:00
kr.angelov
bd102d54d9 fixed memory leak in the pgf-translate tool 2012-05-25 07:28:18 +00:00
kr.angelov
fdcd0de336 added pgf-service.c in the robust parser' 2012-05-25 07:27:29 +00:00
kr.angelov
bd8046f23d another attempt to port the robust parser to MacOS 2012-05-16 15:18:44 +00:00
kr.angelov
e342e5e7d6 small fix for compiling the robust parser on MacOS 2012-05-16 12:18:50 +00:00
kr.angelov
4aca965109 a new unbiased statistical parser. it is still far from perfect use it on your own risk. 2012-05-08 12:13:28 +00:00
kr.angelov
ed6a53609b yet another fix for parsing literals 2012-04-18 15:50:55 +00:00
kr.angelov
c6c54f8815 some fixes in the robust parser and a new API for literals 2012-04-12 06:55:25 +00:00
kr.angelov
99cc07ad67 simple cleanup in the robust parser 2012-04-02 19:01:18 +00:00
kr.angelov
2bf3f22fac libpgf: a new implementation for literals which also allows custom literals. the same mechanism is now used for the metavariables 2012-03-12 14:25:51 +00:00