Commit Graph

445 Commits

Author SHA1 Message Date
krangelov
c482d3466c added != operator removed function null() 2021-08-12 08:49:20 +02:00
krangelov
2c1700776e implemented readExpr & readType 2021-08-11 22:07:01 +02:00
krangelov
a5008c2fe1 implemented functionType and marshalling for types and expressions 2021-08-10 15:07:41 +02:00
krangelov
91f183ca6a move the C sources to the subfolder pgf again for backwards compatibility 2021-08-08 18:29:16 +02:00
krangelov
8f0a1b8fee started a new database-backed runtime from scratch 2021-07-30 12:08:28 +02:00
krangelov
155657709a Merge branch 'master' into c-runtime 2021-07-30 11:20:04 +02:00
krangelov
655173932e fix type signature 2021-01-20 20:38:19 +01:00
krangelov
d4ccd2848c Take a step back and do bottom up prediction only when needed. This solves the problem that some sentences were impossible to parse. 2020-11-17 10:26:00 +01:00
krangelov
90fc1d750e remove the deprecated pgf_print_expr_tuple 2020-08-14 21:03:48 +02:00
krangelov
23edeec5a9 added an API for cloning expressions/types/literals 2020-08-14 20:54:15 +02:00
krangelov
a2d7f1369c filter out empty brackets 2020-07-26 15:55:18 +02:00
krangelov
77a2630ed9 revert to using unconditional probabilities in the different lookup functions 2020-07-11 09:52:43 +02:00
krangelov
33818076ff drop the SG library completely. 2020-07-08 21:12:01 +02:00
krangelov
165c5a6d9d bugfix in parsing literals 2020-05-20 19:57:33 +02:00
krangelov
48d3973daa fix the uggly typo that broke the literals. 2020-05-20 10:15:53 +02:00
krangelov
9a1f982b14 split unknown words by spaces 2020-05-16 08:32:00 +02:00
krangelov
62bc78380e lookupCohorts now detects and reports unknown words. Also:
- added added two filtering functions: filterLongest and filterBest
 - updated the PGF service to work with the new API
2020-05-14 15:03:30 +02:00
krangelov
762d83c1f0 switch off the debugger 2020-03-25 20:03:06 +01:00
krangelov
00e25d0ccb an API to access the names of all fields withing a category 2020-03-06 12:29:08 +01:00
krangelov
88f76ef671 lookup_morpho, lookup_cohorts and fullform_lexicon now report the usual conditional probability. This turns out to be more useful for part of speech tagging 2020-03-05 12:04:42 +01:00
krangelov
f22bd70585 The APIs for callbacks and the bracketed strings now use a string for the analysis intead of an integer. This is now consistent with lookupMorpho and friends 2020-03-05 11:58:21 +01:00
krangelov
14f394c9e9 a version of the parser which returns a chart rather than a list of expressions 2019-12-07 22:00:39 +01:00
krangelov
95b3fb306f forgot that debugging is on 2019-10-09 14:34:04 +02:00
krangelov
5b790b82c5 fix chunk extraction when there are literals 2019-10-09 14:32:20 +02:00
krangelov
26361b3692 fix the parsing for literals after the latest changes in the parser 2019-10-09 14:18:05 +02:00
krangelov
acb70ccc1b cleanup 2019-09-19 22:30:08 +02:00
krangelov
4a71464ca7 Merge with master and drop the Haskell runtime completely 2019-09-19 22:01:57 +02:00
krangelov
e2ddea6c7d first version of a parser which returns chunks in case of failure 2019-08-30 13:31:57 +02:00
krangelov
3e662475ee bugfix in the parser's scanner 2019-08-09 15:15:30 +02:00
krangelov
32379a8d11 fully supported case-insensitive parsing/lookup 2019-06-30 08:48:23 +02:00
krangelov
b56591c6b6 the parser now ensures that all word senses are in the chart 2019-06-25 12:58:28 +02:00
krangelov
2d9478b973 share some code for printing 2019-06-24 12:13:14 +02:00
krangelov
ac039ec74f filter out empty cohorts 2019-05-28 12:42:59 +02:00
krangelov
9f0ea19a1c API for scanning for cohorts in an arbitrary text 2019-05-28 12:26:00 +02:00
krangelov
c32cd7133f fixes in the headers for compilation on Windows 2019-02-18 11:23:01 +01:00
krangelov
409731413e disable the parser debugger which was left enabled by accident 2019-02-14 15:33:21 +01:00
krangelov
8a5e7fa25d fix the reader for patterns in a def rule 2019-02-14 15:32:12 +01:00
Krasimir Angelov
26dabeab9b save the original concrete category in BracketedString 2018-12-20 10:52:45 +01:00
krangelov
eb22112178 a pool where the smallest chunks are memory pages 2018-12-16 07:46:14 +01:00
Krasimir Angelov
5a2b200948 manually copy the "c-runtime" branch from the old repository. 2018-11-02 14:38:44 +01:00
Krasimir Angelov
6c4c22ff7a undo the partial fix for word completion since it breaks normal parsing 2018-06-27 11:53:40 +02:00
Krasimir Angelov
68db7695f5 a simple fix for the linearizer 2018-06-20 22:21:10 +02:00
Krasimir Angelov
b86373438b Revert "when linearizing missing functions with brackets, the name of the function should still be reported"
This reverts commit 18204bdd25bd460904ac475f3ea340daa96589df.
2018-06-20 21:16:19 +02:00
Krasimir Angelov
4cdd4dd8a7 Revert "forgot to switch off the debugger"
This reverts commit 5919dfa3366dfd2f2af8c3ce7749d066a2033f0d.
2018-06-20 21:13:58 +02:00
Krasimir Angelov
427f8d8478 forgot to switch off the debugger 2018-06-20 16:36:45 +02:00
Krasimir Angelov
aaefe4310d when linearizing missing functions with brackets, the name of the function should still be reported 2018-06-20 16:35:28 +02:00
Krasimir Angelov
2d898e78c0 bugfix in the C parser 2018-04-12 14:45:02 +02:00
Krasimir Angelov
bb4218433f - tweak the tokenizer in pgf_lookup_sentence to threat .!?,: as separate tokens
+ bugfix which causes crashes
2018-02-22 11:35:54 +01:00
Krasimir Angelov
a16fe3415a partial fix for word completion in the C runtime 2018-02-04 17:21:15 +01:00
Krasimir Angelov
fa8530add1 silence some warnings on MINGW32 2017-12-09 23:11:04 +01:00