Commit Graph

118 Commits

Author SHA1 Message Date
kr.angelov
f001d40ae3 added gu_buf_flush in seq.c which removes all elements from a buffer 2013-02-26 09:48:09 +00:00
kr.angelov
5a54596fe8 the parser in the C runtime should not crash if the start category is not defined 2013-02-19 12:08:48 +00:00
kr.angelov
f86dcb6572 bugfix in the grammar reader in the C runtime 2013-02-19 12:04:10 +00:00
kr.angelov
ffb17bd26a bugfix in the linearizer for the C runtime 2013-02-13 15:39:01 +00:00
kr.angelov
55203110bb now the beam size for the statistical parser can be configured by using the flag beam_size in the top-level concrete module 2013-02-12 10:53:13 +00:00
kr.angelov
1f77afcfce the statistical parser now uses a baseline lexical estimation of the beam size 2013-02-12 09:41:32 +00:00
kr.angelov
a6b35a9053 the class PgfConcr from the Python binding now has a property name which returns the name of the concrete syntax 2013-02-11 15:51:26 +00:00
kr.angelov
0b7b939aca refactoring: now all named objects in the C runtime have an explicit name field 2013-02-11 14:10:54 +00:00
kr.angelov
56c8f91d19 remove the pgf2yaml tool which was both broken and redundant. The declarations for generic programming from data.c are removed as well 2013-02-11 13:51:12 +00:00
kr.angelov
ff25ba8f90 the grammar reader in the C runtime is completely rewritten and it doesn't use the generic programming API 2013-02-11 10:16:58 +00:00
kr.angelov
e9b5557c6c This patch removes Gregoire's parse_tokens function in the python binding and adds another implementation which builds on the existing API for lexers in the C runtime. Now it is possible to write incremental Lexers in Python 2013-02-01 09:29:43 +00:00
kr.angelov
eca4a28563 implement gu_exn_caught in gu/exn.c. It was missing 2013-02-01 09:26:30 +00:00
gregoire.detrez
459a9a855b Python binding: add a parsing function that accepts directly a list of tokens.
Is allows to define a tokenizer in python (or use an existing one, from nltk for instance.)
2013-01-24 13:31:34 +00:00
kr.angelov
ccc3d6be0d fix warnings in pgf-parse.c 2013-01-08 12:53:49 +00:00
peter.ljunglof
168d39bf6d PGF installation instructions for Mac OSX 2013-01-08 10:23:25 +00:00
peter.ljunglof
88e67b92a4 Made meta probs an optional argument instead of hard-coded in pgf-translate 2013-01-08 10:20:46 +00:00
peter.ljunglof
d2ae113652 New PGF utility: pgf-parse 2013-01-08 10:19:53 +00:00
kr.angelov
3be31c62e9 a new reasoner in the C runtime. It supports tabling which makes it decideable for propositional logic. dependent types and high-order types are not supported yet. The generation is still in decreasing probability order 2013-01-07 12:50:32 +00:00
kr.angelov
0be179d7ff bugfix in the strings library from the C runtime 2012-12-27 21:18:46 +00:00
kr.angelov
bb077b8330 bugfix: the linearizer should not generate extra space at the end of the sentence 2012-12-19 11:18:34 +00:00
kr.angelov
f7eaa8a89a bugfix for linearization of metavariables at the root of a tree 2012-12-19 10:03:05 +00:00
kr.angelov
6201640d7b rename linearize.{h/c} to linearizer.{h/c} which follows the convention used in parser.c and reasoner.c 2012-12-19 09:17:24 +00:00
kr.angelov
5c9ee467a9 a major reimplementation of the linearizer in the C runtime 2012-12-19 09:07:05 +00:00
kr.angelov
008c18a8a7 fixed accidental bug in pgf-parse.c 2012-12-18 15:42:04 +00:00
kr.angelov
dc809da91f the C runtime now can read abstract expressions with literals and meta variables 2012-12-18 12:29:30 +00:00
kr.angelov
32905c8363 debugging infrastructure in the reasoner 2012-12-14 21:25:00 +00:00
kr.angelov
5cec2d5a50 bugfix for the reasoner in the C runtime 2012-12-14 21:24:17 +00:00
kr.angelov
8aefd1e072 The first prototype for exhaustive generation in the C runtime. The trees are always listed in decreasing probability order. There is also an API for generation from Python 2012-12-14 15:32:49 +00:00
kr.angelov
e1bab39458 bugfix in the lexer from the C runtime. the input sentence doesn't have to terminate with whitespace 2012-12-13 16:45:44 +00:00
kr.angelov
2ba632dc9f a top-level API for parsing in the C runtime 2012-12-13 14:44:33 +00:00
kr.angelov
60942c440a bugfix: the outside probability of a PgfItemConts must always be initialized to zero 2012-12-13 11:11:45 +00:00
kr.angelov
fe51a7fb98 bugfix: pgf_read_expr no longer requires a semicolon at the end of an abstract expression 2012-12-13 11:09:26 +00:00
kr.angelov
1376df457d started an official API to the C runtime 2012-12-12 11:25:58 +00:00
kr.angelov
3182e382dc bugfix for robust parsing with multi-word units 2012-12-11 12:57:22 +00:00
kr.angelov
1863e4c3d6 added experimental script for chunking in the C runtime 2012-12-03 10:07:54 +00:00
kr.angelov
2da23e9872 added INSTALL file and updated README file for the C runtime 2012-12-03 09:09:08 +00:00
peter.ljunglof
1789272c9a Added C compilation notes for Mac OSX 2012-11-22 08:50:49 +00:00
kr.angelov
f8c302f9ef remove the duplicated definition of PgfProductionIdx in parser.c 2012-11-19 14:16:31 +00:00
kr.angelov
71b7c09ffe bugfix for the building of bottom-up filter in the C runtime 2012-11-16 13:27:15 +00:00
kr.angelov
a3ba1991f4 revised heuristic in the statistical parser 2012-11-14 12:34:22 +00:00
kr.angelov
70c68f0527 bugfix in the statistical parser 2012-11-13 09:48:23 +00:00
kr.angelov
08ee662944 two simple heuristics which speed up the statistical parser more than seven times. 2012-11-12 22:17:40 +00:00
kr.angelov
68170d5b08 a simple refactoring in the statistical parser 2012-11-12 21:48:22 +00:00
kr.angelov
a2771552d6 more counters in the profiler for the statistical parser 2012-11-12 15:36:21 +00:00
kr.angelov
46de62c452 now we store the state instead of the offset for every continuation in the chart for the statistical parser 2012-11-12 14:04:52 +00:00
kr.angelov
9967c3ad04 in the statistical parser: move the outside probability from the parse items to their continuation. this makes the value slot shared between many items 2012-11-12 13:43:43 +00:00
kr.angelov
9d23093492 small refactoring in the C runtime 2012-11-12 13:05:35 +00:00
kr.angelov
a50c7c24b8 use size_t consistently as the type for constituent indices in the C runtime 2012-11-12 12:51:27 +00:00
kr.angelov
1e531e8237 implemented gu_map_count in runtime/c/gu/map.c 2012-11-12 12:42:19 +00:00
kr.angelov
52255664be use prob_t instead of float in a few places 2012-10-29 08:52:56 +00:00