bjorn
93e9d60765
SRG category renaming: don't count the external cats when calculating indices.
2008-09-26 09:44:02 +00:00
bjorn
d80c71f688
Changed SRG category renaming to use GF cat + index among CFG cats for that GF cat, instead of the old Concrete syntax name + index among all CFG cats.
2008-09-26 09:39:48 +00:00
bjorn
c588e8ca2b
Oops, I accidentally disabled all CFG simplification and LR-removal with the ENBF printing patch.
2008-09-26 09:39:04 +00:00
bjorn
8c348f963f
Change category naming in CFG generation: the number now starts from 1 for each GF category, instead of being the FCFG category number.
2008-09-26 09:28:03 +00:00
bjorn
824d72bf16
Added --output-format=ebnf.
2008-09-26 09:20:39 +00:00
bjorn
84dec7aaa2
In CFG generation: don't add linrow number if the lincat has only a single constituent.
2008-09-26 09:06:52 +00:00
bjorn
367cc0acb9
Added --output-format=nolr. Minor makeRegular refactoring.
2008-09-25 13:41:07 +00:00
bjorn
8001ea8ea0
Added --output-format=regular.
2008-09-25 11:49:00 +00:00
bjorn
4d96eb7e9a
Fixe copy-paste error: srgs_abnf_nonrec produced xml, fixed now.
2008-09-25 11:36:04 +00:00
bjorn
716250bcf1
Somewhat nicer FCFG output format.
2008-09-25 08:44:23 +00:00
bjorn
90b2441c34
Added an FCFG output format (--output-format=fcfg). This now lives in GF.Speech.PGFToCFG, but should probably move somewhere else.
2008-09-25 08:41:49 +00:00
bjorn
0d1b2c01b9
Got SRGS ABNF generation to compile.
2008-09-23 12:56:25 +00:00
bjorn
66d4909142
Added srgs_abnf_nonrec grammar printer to GF.Infra.Options.
2008-09-23 12:44:17 +00:00
bjorn
c01af6168b
Re-added old PrSRGS_ABNF (does not compile yet).
2008-09-23 12:41:44 +00:00
aarne
961261c46e
use hex in transliteration table
2008-09-23 09:04:03 +00:00
aarne
2269a5ba41
Telugu transliteration
2008-09-23 08:49:58 +00:00
bjorn
3857f4a6a6
A somewhat better solution to the words/UTF-8 problem: do encoding last, but only on what appears to be string literals.
2008-09-15 13:16:04 +00:00
bjorn
a39f8cc5da
Temporary fix for the grave accent a encoding problem: change compatPrint to id.
...
The problem is that lower case a with a grave accent is coded in UTF-8 as \195\160.
Unicode character \160 is non-breaking space, so Haskell's words function
will break a UTF-8 encoded string at this character.
String literals in the .gfo file are UTF-8 encoded in generateModuleCode,
just before the call to prGrammar (which uses compactPrint, which used words).
The real solution would be to pretty-print the grammar to Unicode, and then
encode as UTF-8. The problem with that is Latin-1 identifers. They are now
kept in Latin-1 in the .gfo file, since Alex can't handle Unicode.
The real solution to that would be to fix Alex to handle Unicode, but
that is non-trivial. GHC interally uses a very hacky .x file to be
able to lex UTF-8 source files.
An alternative solution that doesn't address the weirdness of using two different
encodings in the same .gfo as we do now, is to incorporate compactPrint
into the grammar printer, to avoid having to do any postprocessing.
2008-09-15 12:38:37 +00:00
krasimir
2d6e7ad625
print the search path when we can't find a module
2008-09-10 08:55:16 +00:00
aarne
279c0367d8
better error msg for missing record labels
2008-09-09 19:43:33 +00:00
aarne
3da1d10eb9
completed IdiomCat to compile api and math
2008-09-09 16:11:05 +00:00
aarne
fa02871525
missing case in param lookup table in GrammartoGFCC
2008-09-09 15:33:56 +00:00
aarne
4f560ea668
debug information in GrammarToGFCC
2008-09-09 14:04:37 +00:00
aarne
b3af35a88e
show "error" in run interruptibly mode in GFI
2008-09-09 06:58:38 +00:00
aarne
56173a9fdb
enable matching of ++ strings with regular patterns
2008-09-09 06:36:36 +00:00
aarne
efd1b4fb73
capitals in lextext and unlextext; notice that a sentence starting with a proper name now gets lexed with a small letter if lextext is used
2008-09-05 09:21:48 +00:00
aarne
9884983762
convert all values in shell pipes to strings
2008-09-04 09:03:02 +00:00
bjorn
dbe3571fcc
Added --haskell-prefix option for changing the constructor prefix in generated Haskell modules.
2008-09-03 15:42:11 +00:00
peb
8952a626d0
new PGF output format: prolog syntax
...
* output a PGF grammar in prolog readable syntax
* variables in abstract syntax (hypotheses and lambda-abstractions)
are translated to unique logical variables
* PGF terms in concrete syntax are translated to more prolog-like terms
2008-09-03 09:04:09 +00:00
aarne
2efc33329e
treatment of Int: not always PredefAbs but can be PredefCnc
2008-09-03 09:33:37 +00:00
bjorn
f246747496
Completed SISR 1.0 support (untested).
2008-08-24 14:53:21 +00:00
bjorn
c42e4c0690
Export modifyFlags and modifyModuleFlags from GF.Infra.Options.
2008-08-24 14:52:49 +00:00
aarne
65e92e6dab
infForm added to Catalan ; debugging alternative in PGF generation
2008-08-23 14:26:34 +00:00
bjorn
849c146973
UTF8 encode all generated XML.
2008-08-22 09:47:42 +00:00
bjorn
acf1a17268
In GF.Compile.Export, output file name extensions should not have double dots.
2008-08-22 09:27:44 +00:00
bjorn
243bd895e4
Added srgs_xml_nonrec output format.
2008-08-22 08:43:58 +00:00
aarne
b962c3a0d3
alternative prtType to help debugging overload resolution errors
2008-08-20 06:44:58 +00:00
bjorn
4925580877
When producing PGF, do UTF-8 encoding of the whole file after pretty printing instead of walking through the PGF structure before pretty-printing.
...
The old code was brittle, and only converted strings in the lins and opers, not in printnames and parsers. The old code also double encoded some grammars when using gfc to combine PGF files.
2008-08-18 12:34:56 +00:00
aarne
65a25ad181
bronzeage grammar restored except for three ones with issues ; arabic transliteration added
2008-08-16 18:55:28 +00:00
aarne
20fe8a8c20
improved error message in pgf compilation to help debugging
2008-08-16 12:27:59 +00:00
aarne
1302960992
forgiving names in concrete that are not in abstract, with a warning (no warning guaranteed if this is because restricted inheritance)
2008-08-14 20:25:52 +00:00
bjorn
b296099441
Include concrete syntax flags in generated JS.
2008-08-13 12:08:11 +00:00
bjorn
428d92eac0
Set charset to utf-8 when printing html code. This is needed to get Safari to display resdemo.html correctly.
2008-06-27 18:14:19 +00:00
krasimir
bff0a96b0b
the shell now supports cp1251 coding with se command. works with the word completion as well
2008-06-27 10:33:46 +00:00
aarne
ae83ab19ed
version-related changes in Makefile and help
2008-06-27 09:09:29 +00:00
aarne
4a9bc587a5
moved interruption to GFI level to capture all commands
2008-06-26 21:01:51 +00:00
aarne
9d909f32cf
fixed decodeUTF8 for pgf; removed old resources from darcs
2008-06-26 19:41:25 +00:00
aarne
544fc47489
oops, forgot add Coding
2008-06-26 18:48:28 +00:00
aarne
7f9a0cdd34
uniform encoding: gfo and pgf in UTF8, internal in unicode
2008-06-26 16:35:45 +00:00
aarne
712717e461
utf8 adjustments
2008-06-26 06:52:01 +00:00