mirror of
https://github.com/GrammaticalFramework/gf-core.git
synced 2026-04-09 21:19:31 -06:00
A Russian dictionary generated from a wordlist created by the FreeLing project. The accompanying converter can be used to convert other wordlists in EAGLES format to GF grammars.
24 lines
638 B
Plaintext
24 lines
638 B
Plaintext
How to use:
|
|
|
|
1) Sort the wordlist so it can be split into sublists. It is necessary because
|
|
the converter is quite memory-hungry, and you might not have enough RAM to
|
|
process the whole wordlist at once.
|
|
|
|
./CollectLemmas dicc.src | sort > lemmas.src
|
|
|
|
2) Split the sorted wordlist.
|
|
|
|
split -l 500000 lemmas.src
|
|
|
|
3) Splitting has probably left forms of some lemmas spread across two
|
|
sublists. Manually edit sublists so all forms for a lemma are present in just
|
|
one sublist.
|
|
|
|
4) Run the converter.
|
|
|
|
./run_conv.sh xa*
|
|
|
|
5) The converter has produced abstract and concrete syntaxes for the
|
|
dictionary. You can try them out with GF:
|
|
|
|
gf DictRus.gf |