1
0
forked from GitHub/gf-core

A Russian dictionary

A Russian dictionary generated from a wordlist created by the FreeLing
project. The accompanying converter can be used to convert other wordlists in
EAGLES format to GF grammars.
This commit is contained in:
Nick Frolov
2011-12-31 02:36:24 +00:00
parent 4bf634ffcd
commit 2ff1d34c86
10 changed files with 272168 additions and 0 deletions

24
contrib/eaglesconv/README Normal file
View File

@@ -0,0 +1,24 @@
How to use:
1) Sort the wordlist so it can be split into sublists. It is necessary because
the converter is quite memory-hungry, and you might not have enough RAM to
process the whole wordlist at once.
./CollectLemmas dicc.src | sort > lemmas.src
2) Split the sorted wordlist.
split -l 500000 lemmas.src
3) Splitting has probably left forms of some lemmas spread across two
sublists. Manually edit sublists so all forms for a lemma are present in just
one sublist.
4) Run the converter.
./run_conv.sh xa*
5) The converter has produced abstract and concrete syntaxes for the
dictionary. You can try them out with GF:
gf DictRus.gf