GitHub/gf-core

Fork 1

mirror of https://github.com/GrammaticalFramework/gf-core.git synced 2026-04-09 04:59:31 -06:00

Files

John J. Camilleri b47eb18f86 Minor additions to LPGF readme

2021-08-27 09:32:43 +02:00

10 KiB

Raw Blame History

LPGF testsuite & benchmark

Testsuite

LPGF must be equivalent to PGF in terms of linearisation output.

Possible exceptions:

No handling of variants (design choice)
Rendering of missing functions

N.B. Phrasebook doesn't compile with RGL after 1131058b68c204a8d1312d2e2a610748eb8032cb

Running

Because Stack insists on rebuilding things all the time, I use separate .stack-work folders for testing and benchmarking.

Assumes treebank in same folder with same abstract name as grammar, e.g. unittests/Params.treebank

stack build --work-dir .stack-work-test --test --no-run-tests
stack test  --work-dir .stack-work-test gf:test:lpgf # all LPGF tests
stack test  --work-dir .stack-work-test gf:test:lpgf --test-arguments="unittests/Params" # specific grammar
stack test  --work-dir .stack-work-test gf:test:lpgf --test-arguments="foods/Foods Fre Ger" # specific grammar and languages
stack test  --work-dir .stack-work-test gf:test:lpgf --test-arguments="phrasebook/Phrasebook"

Set environment variable DEBUG=1 to enable dumping of intermediate formats into DEBUG/ folder.

Benchmark

Compare performance metrics between LPGF and PGF[2]. Note: correctness is not checked here.

Compilation

Comparing PGF, LPGF along following criteria:

Time
Memory
Binary file size

Runtime (linearisation)

Comparing PGF, PGF2, LPGF along following criteria:

Time
Memory

Running

Run each command separately so that memory measurements are isolated. The +RTS -T -RTS is so that GHC can report its own memory usage.

stack build --work-dir .stack-work-bench --bench --no-run-benchmarks &&
stack bench --work-dir .stack-work-bench --benchmark-arguments "compile pgf  testsuite/lpgf/foods/Foods*.gf +RTS -T -RTS" &&
stack bench --work-dir .stack-work-bench --benchmark-arguments "compile lpgf testsuite/lpgf/foods/Foods*.gf +RTS -T -RTS" &&
stack bench --work-dir .stack-work-bench --benchmark-arguments "run pgf  Foods.pgf  testsuite/lpgf/foods/Foods-all.trees +RTS -T -RTS" &&
stack bench --work-dir .stack-work-bench --benchmark-arguments "run pgf2 Foods.pgf  testsuite/lpgf/foods/Foods-all.trees +RTS -T -RTS" &&
stack bench --work-dir .stack-work-bench --benchmark-arguments "run lpgf Foods.lpgf testsuite/lpgf/foods/Foods-all.trees +RTS -T -RTS"

stack build --work-dir .stack-work-bench --bench --no-run-benchmarks &&
stack bench --work-dir .stack-work-bench --benchmark-arguments "compile pgf  testsuite/lpgf/phrasebook/Phrasebook*.gf +RTS -T -RTS" &&
stack bench --work-dir .stack-work-bench --benchmark-arguments "compile lpgf testsuite/lpgf/phrasebook/Phrasebook*.gf +RTS -T -RTS" &&
stack bench --work-dir .stack-work-bench --benchmark-arguments "run pgf  Phrasebook.pgf  testsuite/lpgf/phrasebook/Phrasebook-10000.trees +RTS -T -RTS" &&
stack bench --work-dir .stack-work-bench --benchmark-arguments "run pgf2 Phrasebook.pgf  testsuite/lpgf/phrasebook/Phrasebook-10000.trees +RTS -T -RTS" &&
stack bench --work-dir .stack-work-bench --benchmark-arguments "run lpgf Phrasebook.lpgf testsuite/lpgf/phrasebook/Phrasebook-10000.trees +RTS -T -RTS"

Profiling

stack build --work-dir .stack-work-profile --profile --bench --no-run-benchmarks &&
stack bench --work-dir .stack-work-profile --profile --benchmark-arguments "compile lpgf testsuite/lpgf/phrasebook/PhrasebookFre.gf +RTS -T -p -h -RTS"

Produced files:

lpgf-bench.prof - total time and memory allocation (-p)
lpgf-bench.hp - heap profile (-h)

Open heap profile graph on-the-fly:

stack exec -- hp2ps -c lpgf-bench.hp && open lpgf-bench.ps

Convert and copy timestamped files into PROF/:

TS="$(date +%Y-%m-%d_%H%M)" &&
stack exec -- hp2ps -c lpgf-bench.hp &&
mv lpgf-bench.prof PROF/$TS.prof &&
mv lpgf-bench.ps PROF/$TS.ps &&
mv lpgf-bench.hs PROF/$TS.hp

Resources

Honing in

stack build --test --bench --no-run-tests --no-run-benchmarks &&
stack bench --benchmark-arguments "compile lpgf testsuite/lpgf/phrasebook/PhrasebookFre.gf +RTS -T -RTS"

Baseline PGF

compile: 1.600776s
size: 2.88 MB Phrasebook.pgf Max memory: 328.20 MB

Baseline LPGF = B

compile: 12.401099s
size: 3.01 MB Phrasebook.lpgf Max memory: 1.33 GB

Baseline LPGF String instead of Text

compile: 12.124689s
size: 3.01 MB Phrasebook.lpgf Max memory: 1.34 GB

Baseline LPGF with impossible pruning

compile: 7.406503s
size: 3.01 MB Phrasebook.lpgf Max memory: 1.13 GB

B -extractStrings

compile: 13.822735s
size: 5.78 MB Phrasebook.lpgf Max memory: 1.39 GB

B -cleanupRecordFields

compile: 13.670776s
size: 3.01 MB Phrasebook.lpgf Max memory: 1.48 GB

No generation at all = E

compile: 0.521001s
size: 3.27 KB Phrasebook.lpgf Max memory: 230.69 MB

+ Concat, Literal, Error, Predef, Tuple, Variant, Commented

compile: 1.503594s
size: 3.27 KB Phrasebook.lpgf Max memory: 395.31 MB

+ Var, Pre, Selection

compile: 1.260184s
size: 3.28 KB Phrasebook.lpgf Max memory: 392.17 MB

+ Record

compile: 1.659233s
size: 7.07 KB Phrasebook.lpgf Max memory: 397.41 MB

+ Projection = X

compile: 1.446217s
size: 7.94 KB Phrasebook.lpgf Max memory: 423.62 MB

X + Param

compile: 2.073838s
size: 10.82 KB Phrasebook.lpgf Max memory: 619.71 MB

X + Table

compile: 11.26558s
size: 2.48 MB Phrasebook.lpgf Max memory: 1.15 GB

RawIdents

compile: 5.393466s
size: 3.01 MB Phrasebook.lpgf Max memory: 1.12 GB

Repeated terms in compilation

Param and Table

Concr	Total	Unique	Perc
PhrasebookEng	8673	1724	20%
PhrasebookSwe	14802	2257	15%
PhrasebookFin	526225	4866	1%

Param

Concr	Total	Unique	Perc
PhrasebookEng	3211	78	2%
PhrasebookSwe	7567	69	1%
PhrasebookFin	316355	310	0.1%

Table

Concr	Total	Unique	Perc
PhrasebookEng	5470	1654	30%
PhrasebookSwe	7243	2196	30%
PhrasebookFin	209878	4564	2%

After impelementing state monad for table memoisation

worse!

compile: 12.55848s
size: 3.01 MB Phrasebook.lpgf Max memory: 2.25 GB

Params

Concr	Total	Misses	Perc
PhrasebookEng	3211	72	2%
PhrasebookSwe	7526	61	1%
PhrasebookFin	135268	333	0.2%
PhrasebookFre	337102	76	0.02%

Tables

Concr	Total	Misses	Perc
PhrasebookEng	3719	3170	85%
PhrasebookSwe	4031	3019	75%
PhrasebookFin	36875	21730	59%
PhrasebookFre	41397	32967	80%

Conclusions:

map itself requires more memory than actual compilation
lookup/insert is also as bad as actual compilation

Tried HashMap (deriving Hashable for LinValue), no inprovement. Using show on LinValue for keys is incredibly slow.

Notes on compilation

1 (see unittests/Params4)

param defns P = P1 | P2 Q = Q1 | Q2 R = RP P | RPQ P Q | R0 X = XPQ P Q

translation NB: tuples may be nested, but will be concatted at runtime

P1 = <1> P2 = <2>

Q1 = <1> Q2 = <2>

R P1 = <1,1> R P2 = <1,2> RPQ P1 Q1 = <2,1,1> RPQ P1 Q2 = <2,1,2> RPQ P2 Q1 = <2,2,1> RPQ P2 Q2 = <2,2,2> R0 = <3>

XPQ P1 Q1 = <1,1,1> XPQ P1 Q2 = <1,1,2> XPQ P2 Q1 = <1,2,1> XPQ P2 Q2 = <1,2,2>

P => Str <"P1","P2">

{p:P ; q:Q} => Str <<"P1;Q1","P1;Q2">,<"P2;Q1","P2;Q2">>

{p=P2; q=Q1} <<2>,<1>>

R => Str < <"RP P1","RP P2">, < <"RPQ P1 Q1","RPQ P1 Q2">, <"RPQ P2 Q1","RPQ P2 Q2"> >, "R0"

X => Str <<<"XPQ P1 Q1","XPQ P1 Q2">, <"XPQ P2 Q1","XPQ P2 Q2">>>

{p=P2 ; r=R0} <<2>,<3>>

{p=P2 ; r1=RP P1 ; r2=RPQ P1 Q2 ; r3=R0 } < <2> , <1, 1> , <2, 1, 2> , <3>>

2 (see unittests/Params5)

param defns

P = P1 | PQ Q Q = Q1 | QR R R = R1 | R2

translation

P1 = <1> PQ Q1 = <2,1> PQ QR R1 = <2,2,1> PQ QR R2 = <2,2,2>

Q1 = <1> QR R1 = <2,1> QR R2 = <2,2>

R1 = <1> R2 = <2>

P => Str <"P1",<"PQ Q1",<"PQ (QR R1)","PQ (QR R2)">>>

{q:Q ; p:P} => Str < <"Q1;P1",<"Q1;PQ Q1",<"Q1;PQ (QR R1)","Q1;PQ (QR R2)">>>, < <"QR R1;P1",<"QR R1;PQ Q1",<"QR R1;PQ (QR R1)","QR R1;PQ (QR R2)">>>, <"QR R2;P1",<"QR R2;PQ Q1",<"QR R2;PQ (QR R1)","QR R2;PQ (QR R2)">>>

{q=Q1 ; p=P1} = <<1>,<1>> {q=Q1 ; p=PQ Q1} = <<1>,<2,1>> {q=Q1 ; p=PQ (QR R1)} = <<1>,<2,2,1>> {q=Q1 ; p=PQ (QR R2)} = <<1>,<2,2,2>>

{q=QR R1 ; p=P1} = <<2,1>,<1>> {q=QR R1 ; p=PQ Q1} = <<2,1>,<2,1>> {q=QR R1 ; p=PQ (QR R1)} = <<2,1>,<2,2,1>> {q=QR R1 ; p=PQ (QR R2)} = <<2,1>,<2,2,2>>

{q=QR R2 ; p=P1} = <<2,2>,<1>> {q=QR R2 ; p=PQ Q1} = <<2,2>,<2,1>> {q=QR R2 ; p=PQ (QR R1)} = <<2,2>,<2,2,1>> {q=QR R2 ; p=PQ (QR R2)} = <<2,2>,<2,2,2>>

NOTE: GF will swap q and p in record, as part of record field sorting, resulting in the following:

{p:P ; q:Q} => Str < <"P1;Q1", <"P1;QR R1","P1;QR R2">>, < <"PQ Q1;Q1", <"PQ Q1;QR R1","PQ Q1;QR R2">>, < <"PQ (QR R1);Q1", <"PQ (QR R1);QR R1","PQ (QR R1);QR R2">>, <"PQ (QR R2);Q1", <"PQ (QR R2);QR R1","PQ (QR R2);QR R2">> >

{p=P1 ; q=Q1} = <<1>,<1>> {p=P1 ; q=QR R1} = <<1>,<2,1>> {p=P1 ; q=QR R2} = <<1>,<2,2>>

{p=PQ Q1 ; q=Q1} = <<2,1>,<1>> {p=PQ Q1 ; q=QR R1} = <<2,1>,<2,1>> {p=PQ Q1 ; q=QR R2} = <<2,1>,<2,2>>

{p=PQ (QR R1) ; q=Q1} = <<2,2,1>,<1>> {p=PQ (QR R1) ; q=QR R1} = <<2,2,1>,<2,1>> {p=PQ (QR R1) ; q=QR R2} = <<2,2,1>,<2,2>>

{p=PQ (QR R2) ; q=Q1} = <<2,2,2>,<1>> {p=PQ (QR R2) ; q=QR R1} = <<2,2,2>,<2,1>> {p=PQ (QR R2) ; q=QR R2} = <<2,2,2>,<2,2>>

{pp: {p:P} ; q:Q} => Str

{pp={p=P1} ; q=Q1} = <<<1>>,<1>> {pp={p=P1} ; q=QR R1} = <<<1>>,<2,1>> {pp={p=P1} ; q=QR R2} = <<<1>>,<2,2>>

{pp={p=PQ Q1} ; q=Q1} = <<<2,1>>, <1>> {pp={p=PQ Q1} ; q=QR R1} = <<<2,1>>, <2,1>> {pp={p=PQ Q1} ; q=QR R2} = <<<2,1>>, <2,2>>

{pp={p=PQ (QR R1)} ; q=Q1} = <<<2,2,1>>,<1>> {pp={p=PQ (QR R1)} ; q=QR R1} = <<<2,2,1>>,<2,1>> {pp={p=PQ (QR R1)} ; q=QR R2} = <<<2,2,1>>,<2,2>>

{pp={p=PQ (QR R2)} ; q=Q1} = <<<2,2,2>>,<1>> {pp={p=PQ (QR R2)} ; q=QR R1} = <<<2,2,2>>,<2,1>> {pp={p=PQ (QR R2)} ; q=QR R2} = <<<2,2,2>>,<2,2>>

10 KiB Raw Blame History

LPGF testsuite & benchmark

Testsuite

Running

Benchmark

Compilation

Runtime (linearisation)

Running

Profiling

Honing in

Repeated terms in compilation

After impelementing state monad for table memoisation

Notes on compilation

1 (see unittests/Params4)

2 (see unittests/Params5)

10 KiB

Raw Blame History