forked from GitHub/gf-core
A Russian dictionary
A Russian dictionary generated from a wordlist created by the FreeLing project. The accompanying converter can be used to convert other wordlists in EAGLES format to GF grammars.
This commit is contained in:
24
contrib/eaglesconv/README
Normal file
24
contrib/eaglesconv/README
Normal file
@@ -0,0 +1,24 @@
|
||||
How to use:
|
||||
|
||||
1) Sort the wordlist so it can be split into sublists. It is necessary because
|
||||
the converter is quite memory-hungry, and you might not have enough RAM to
|
||||
process the whole wordlist at once.
|
||||
|
||||
./CollectLemmas dicc.src | sort > lemmas.src
|
||||
|
||||
2) Split the sorted wordlist.
|
||||
|
||||
split -l 500000 lemmas.src
|
||||
|
||||
3) Splitting has probably left forms of some lemmas spread across two
|
||||
sublists. Manually edit sublists so all forms for a lemma are present in just
|
||||
one sublist.
|
||||
|
||||
4) Run the converter.
|
||||
|
||||
./run_conv.sh xa*
|
||||
|
||||
5) The converter has produced abstract and concrete syntaxes for the
|
||||
dictionary. You can try them out with GF:
|
||||
|
||||
gf DictRus.gf
|
||||
Reference in New Issue
Block a user