Commit Graph

1202 Commits

Author SHA1 Message Date
bjorn
93e9d60765 SRG category renaming: don't count the external cats when calculating indices. 2008-09-26 09:44:02 +00:00
bjorn
d80c71f688 Changed SRG category renaming to use GF cat + index among CFG cats for that GF cat, instead of the old Concrete syntax name + index among all CFG cats. 2008-09-26 09:39:48 +00:00
bjorn
c588e8ca2b Oops, I accidentally disabled all CFG simplification and LR-removal with the ENBF printing patch. 2008-09-26 09:39:04 +00:00
bjorn
8c348f963f Change category naming in CFG generation: the number now starts from 1 for each GF category, instead of being the FCFG category number. 2008-09-26 09:28:03 +00:00
bjorn
824d72bf16 Added --output-format=ebnf. 2008-09-26 09:20:39 +00:00
bjorn
84dec7aaa2 In CFG generation: don't add linrow number if the lincat has only a single constituent. 2008-09-26 09:06:52 +00:00
bjorn
367cc0acb9 Added --output-format=nolr. Minor makeRegular refactoring. 2008-09-25 13:41:07 +00:00
bjorn
8001ea8ea0 Added --output-format=regular. 2008-09-25 11:49:00 +00:00
bjorn
4d96eb7e9a Fixe copy-paste error: srgs_abnf_nonrec produced xml, fixed now. 2008-09-25 11:36:04 +00:00
bjorn
716250bcf1 Somewhat nicer FCFG output format. 2008-09-25 08:44:23 +00:00
bjorn
90b2441c34 Added an FCFG output format (--output-format=fcfg). This now lives in GF.Speech.PGFToCFG, but should probably move somewhere else. 2008-09-25 08:41:49 +00:00
bjorn
0d1b2c01b9 Got SRGS ABNF generation to compile. 2008-09-23 12:56:25 +00:00
bjorn
66d4909142 Added srgs_abnf_nonrec grammar printer to GF.Infra.Options. 2008-09-23 12:44:17 +00:00
bjorn
c01af6168b Re-added old PrSRGS_ABNF (does not compile yet). 2008-09-23 12:41:44 +00:00
aarne
961261c46e use hex in transliteration table 2008-09-23 09:04:03 +00:00
aarne
2269a5ba41 Telugu transliteration 2008-09-23 08:49:58 +00:00
bjorn
3857f4a6a6 A somewhat better solution to the words/UTF-8 problem: do encoding last, but only on what appears to be string literals. 2008-09-15 13:16:04 +00:00
bjorn
a39f8cc5da Temporary fix for the grave accent a encoding problem: change compatPrint to id.
The problem is that lower case a with a grave accent is coded in UTF-8 as \195\160. 
Unicode character \160 is non-breaking space, so Haskell's words function
will break a UTF-8 encoded string at this character.
String literals in the .gfo file are UTF-8 encoded in generateModuleCode,
just before the call to prGrammar (which uses compactPrint, which used words).
The real solution would be to pretty-print the grammar to Unicode, and then
encode as UTF-8. The problem with that is Latin-1 identifers. They are now
kept in Latin-1 in the .gfo file, since Alex can't handle Unicode.
The real solution to that would be to fix Alex to handle Unicode, but 
that is non-trivial. GHC interally uses a very hacky .x file to be
able to lex UTF-8 source files. 

An alternative solution that doesn't address the weirdness of using two different
encodings in the same .gfo as we do now, is to incorporate compactPrint
into the grammar printer, to avoid having to do any postprocessing.
2008-09-15 12:38:37 +00:00
krasimir
2d6e7ad625 print the search path when we can't find a module 2008-09-10 08:55:16 +00:00
aarne
279c0367d8 better error msg for missing record labels 2008-09-09 19:43:33 +00:00
aarne
3da1d10eb9 completed IdiomCat to compile api and math 2008-09-09 16:11:05 +00:00
aarne
fa02871525 missing case in param lookup table in GrammartoGFCC 2008-09-09 15:33:56 +00:00
aarne
4f560ea668 debug information in GrammarToGFCC 2008-09-09 14:04:37 +00:00
aarne
b3af35a88e show "error" in run interruptibly mode in GFI 2008-09-09 06:58:38 +00:00
aarne
56173a9fdb enable matching of ++ strings with regular patterns 2008-09-09 06:36:36 +00:00
aarne
efd1b4fb73 capitals in lextext and unlextext; notice that a sentence starting with a proper name now gets lexed with a small letter if lextext is used 2008-09-05 09:21:48 +00:00
aarne
9884983762 convert all values in shell pipes to strings 2008-09-04 09:03:02 +00:00
bjorn
dbe3571fcc Added --haskell-prefix option for changing the constructor prefix in generated Haskell modules. 2008-09-03 15:42:11 +00:00
peb
8952a626d0 new PGF output format: prolog syntax
* output a PGF grammar in prolog readable syntax
* variables in abstract syntax (hypotheses and lambda-abstractions) 
  are translated to unique logical variables
* PGF terms in concrete syntax are translated to more prolog-like terms
2008-09-03 09:04:09 +00:00
aarne
2efc33329e treatment of Int: not always PredefAbs but can be PredefCnc 2008-09-03 09:33:37 +00:00
bjorn
f246747496 Completed SISR 1.0 support (untested). 2008-08-24 14:53:21 +00:00
bjorn
c42e4c0690 Export modifyFlags and modifyModuleFlags from GF.Infra.Options. 2008-08-24 14:52:49 +00:00
aarne
65e92e6dab infForm added to Catalan ; debugging alternative in PGF generation 2008-08-23 14:26:34 +00:00
bjorn
849c146973 UTF8 encode all generated XML. 2008-08-22 09:47:42 +00:00
bjorn
acf1a17268 In GF.Compile.Export, output file name extensions should not have double dots. 2008-08-22 09:27:44 +00:00
bjorn
243bd895e4 Added srgs_xml_nonrec output format. 2008-08-22 08:43:58 +00:00
aarne
b962c3a0d3 alternative prtType to help debugging overload resolution errors 2008-08-20 06:44:58 +00:00
bjorn
4925580877 When producing PGF, do UTF-8 encoding of the whole file after pretty printing instead of walking through the PGF structure before pretty-printing.
The old code was brittle, and only converted strings in the lins and opers, not in printnames and parsers. The old code also double encoded some grammars when using gfc to combine PGF files.
2008-08-18 12:34:56 +00:00
aarne
65a25ad181 bronzeage grammar restored except for three ones with issues ; arabic transliteration added 2008-08-16 18:55:28 +00:00
aarne
20fe8a8c20 improved error message in pgf compilation to help debugging 2008-08-16 12:27:59 +00:00
aarne
1302960992 forgiving names in concrete that are not in abstract, with a warning (no warning guaranteed if this is because restricted inheritance) 2008-08-14 20:25:52 +00:00
bjorn
b296099441 Include concrete syntax flags in generated JS. 2008-08-13 12:08:11 +00:00
bjorn
428d92eac0 Set charset to utf-8 when printing html code. This is needed to get Safari to display resdemo.html correctly. 2008-06-27 18:14:19 +00:00
krasimir
bff0a96b0b the shell now supports cp1251 coding with se command. works with the word completion as well 2008-06-27 10:33:46 +00:00
aarne
ae83ab19ed version-related changes in Makefile and help 2008-06-27 09:09:29 +00:00
aarne
4a9bc587a5 moved interruption to GFI level to capture all commands 2008-06-26 21:01:51 +00:00
aarne
9d909f32cf fixed decodeUTF8 for pgf; removed old resources from darcs 2008-06-26 19:41:25 +00:00
aarne
544fc47489 oops, forgot add Coding 2008-06-26 18:48:28 +00:00
aarne
7f9a0cdd34 uniform encoding: gfo and pgf in UTF8, internal in unicode 2008-06-26 16:35:45 +00:00
aarne
712717e461 utf8 adjustments 2008-06-26 06:52:01 +00:00