Commit Graph

1152 Commits

Author SHA1 Message Date
bjorn 93e9d60765 SRG category renaming: don't count the external cats when calculating indices. 2008-09-26 09:44:02 +00:00
bjorn d80c71f688 Changed SRG category renaming to use GF cat + index among CFG cats for that GF cat, instead of the old Concrete syntax name + index among all CFG cats. 2008-09-26 09:39:48 +00:00
bjorn c588e8ca2b Oops, I accidentally disabled all CFG simplification and LR-removal with the ENBF printing patch. 2008-09-26 09:39:04 +00:00
bjorn 8c348f963f Change category naming in CFG generation: the number now starts from 1 for each GF category, instead of being the FCFG category number. 2008-09-26 09:28:03 +00:00
bjorn 824d72bf16 Added --output-format=ebnf. 2008-09-26 09:20:39 +00:00
bjorn 84dec7aaa2 In CFG generation: don't add linrow number if the lincat has only a single constituent. 2008-09-26 09:06:52 +00:00
bjorn 367cc0acb9 Added --output-format=nolr. Minor makeRegular refactoring. 2008-09-25 13:41:07 +00:00
bjorn 8001ea8ea0 Added --output-format=regular. 2008-09-25 11:49:00 +00:00
bjorn 4d96eb7e9a Fixe copy-paste error: srgs_abnf_nonrec produced xml, fixed now. 2008-09-25 11:36:04 +00:00
bjorn 716250bcf1 Somewhat nicer FCFG output format. 2008-09-25 08:44:23 +00:00
bjorn 90b2441c34 Added an FCFG output format (--output-format=fcfg). This now lives in GF.Speech.PGFToCFG, but should probably move somewhere else. 2008-09-25 08:41:49 +00:00
bjorn 0d1b2c01b9 Got SRGS ABNF generation to compile. 2008-09-23 12:56:25 +00:00
bjorn 66d4909142 Added srgs_abnf_nonrec grammar printer to GF.Infra.Options. 2008-09-23 12:44:17 +00:00
bjorn c01af6168b Re-added old PrSRGS_ABNF (does not compile yet). 2008-09-23 12:41:44 +00:00
aarne 961261c46e use hex in transliteration table 2008-09-23 09:04:03 +00:00
aarne 2269a5ba41 Telugu transliteration 2008-09-23 08:49:58 +00:00
bjorn 3857f4a6a6 A somewhat better solution to the words/UTF-8 problem: do encoding last, but only on what appears to be string literals. 2008-09-15 13:16:04 +00:00
bjorn a39f8cc5da Temporary fix for the grave accent a encoding problem: change compatPrint to id.
The problem is that lower case a with a grave accent is coded in UTF-8 as \195\160. 
Unicode character \160 is non-breaking space, so Haskell's words function
will break a UTF-8 encoded string at this character.
String literals in the .gfo file are UTF-8 encoded in generateModuleCode,
just before the call to prGrammar (which uses compactPrint, which used words).
The real solution would be to pretty-print the grammar to Unicode, and then
encode as UTF-8. The problem with that is Latin-1 identifers. They are now
kept in Latin-1 in the .gfo file, since Alex can't handle Unicode.
The real solution to that would be to fix Alex to handle Unicode, but 
that is non-trivial. GHC interally uses a very hacky .x file to be
able to lex UTF-8 source files. 

An alternative solution that doesn't address the weirdness of using two different
encodings in the same .gfo as we do now, is to incorporate compactPrint
into the grammar printer, to avoid having to do any postprocessing.
2008-09-15 12:38:37 +00:00
krasimir 2d6e7ad625 print the search path when we can't find a module 2008-09-10 08:55:16 +00:00
aarne 279c0367d8 better error msg for missing record labels 2008-09-09 19:43:33 +00:00
aarne 3da1d10eb9 completed IdiomCat to compile api and math 2008-09-09 16:11:05 +00:00
aarne fa02871525 missing case in param lookup table in GrammartoGFCC 2008-09-09 15:33:56 +00:00
aarne 4f560ea668 debug information in GrammarToGFCC 2008-09-09 14:04:37 +00:00
aarne b3af35a88e show "error" in run interruptibly mode in GFI 2008-09-09 06:58:38 +00:00
aarne 56173a9fdb enable matching of ++ strings with regular patterns 2008-09-09 06:36:36 +00:00
aarne efd1b4fb73 capitals in lextext and unlextext; notice that a sentence starting with a proper name now gets lexed with a small letter if lextext is used 2008-09-05 09:21:48 +00:00
aarne 9884983762 convert all values in shell pipes to strings 2008-09-04 09:03:02 +00:00
bjorn dbe3571fcc Added --haskell-prefix option for changing the constructor prefix in generated Haskell modules. 2008-09-03 15:42:11 +00:00
peb 8952a626d0 new PGF output format: prolog syntax
* output a PGF grammar in prolog readable syntax
* variables in abstract syntax (hypotheses and lambda-abstractions) 
  are translated to unique logical variables
* PGF terms in concrete syntax are translated to more prolog-like terms
2008-09-03 09:04:09 +00:00
aarne 2efc33329e treatment of Int: not always PredefAbs but can be PredefCnc 2008-09-03 09:33:37 +00:00
bjorn f246747496 Completed SISR 1.0 support (untested). 2008-08-24 14:53:21 +00:00
bjorn c42e4c0690 Export modifyFlags and modifyModuleFlags from GF.Infra.Options. 2008-08-24 14:52:49 +00:00
aarne 65e92e6dab infForm added to Catalan ; debugging alternative in PGF generation 2008-08-23 14:26:34 +00:00
bjorn 849c146973 UTF8 encode all generated XML. 2008-08-22 09:47:42 +00:00
bjorn acf1a17268 In GF.Compile.Export, output file name extensions should not have double dots. 2008-08-22 09:27:44 +00:00
bjorn 243bd895e4 Added srgs_xml_nonrec output format. 2008-08-22 08:43:58 +00:00
aarne b962c3a0d3 alternative prtType to help debugging overload resolution errors 2008-08-20 06:44:58 +00:00
bjorn 4925580877 When producing PGF, do UTF-8 encoding of the whole file after pretty printing instead of walking through the PGF structure before pretty-printing.
The old code was brittle, and only converted strings in the lins and opers, not in printnames and parsers. The old code also double encoded some grammars when using gfc to combine PGF files.
2008-08-18 12:34:56 +00:00
aarne 65a25ad181 bronzeage grammar restored except for three ones with issues ; arabic transliteration added 2008-08-16 18:55:28 +00:00
aarne 20fe8a8c20 improved error message in pgf compilation to help debugging 2008-08-16 12:27:59 +00:00
aarne 1302960992 forgiving names in concrete that are not in abstract, with a warning (no warning guaranteed if this is because restricted inheritance) 2008-08-14 20:25:52 +00:00
bjorn b296099441 Include concrete syntax flags in generated JS. 2008-08-13 12:08:11 +00:00
bjorn 428d92eac0 Set charset to utf-8 when printing html code. This is needed to get Safari to display resdemo.html correctly. 2008-06-27 18:14:19 +00:00
krasimir bff0a96b0b the shell now supports cp1251 coding with se command. works with the word completion as well 2008-06-27 10:33:46 +00:00
aarne ae83ab19ed version-related changes in Makefile and help 2008-06-27 09:09:29 +00:00
aarne 4a9bc587a5 moved interruption to GFI level to capture all commands 2008-06-26 21:01:51 +00:00
aarne 9d909f32cf fixed decodeUTF8 for pgf; removed old resources from darcs 2008-06-26 19:41:25 +00:00
aarne 544fc47489 oops, forgot add Coding 2008-06-26 18:48:28 +00:00
aarne 7f9a0cdd34 uniform encoding: gfo and pgf in UTF8, internal in unicode 2008-06-26 16:35:45 +00:00
aarne 712717e461 utf8 adjustments 2008-06-26 06:52:01 +00:00