revised the description of extra categories in morphodict README

This commit is contained in:
aarneranta
2021-05-31 09:52:12 +02:00
parent 38d852a5bb
commit 4f70384664

View File

@@ -81,9 +81,10 @@ In addition, structural words should appear here with their native lemma names:
- `Subj` - `Subj`
Additional language-specific categories can be included if the reasons are clear. Additional language-specific categories can be included if the reasons are clear.
They must then be defined in the `Ext` module for that language. They must then be importable from the `Paradigms` module for that language, together with `mk` functions.
The `Extend` module may also put them in use in syntax.
Following the model of Universal Tagset, we add a category `X` for unspecified words in `Ext`, with the linearization type `{s : Str}`. Following the model of Universal Tagset, we add a category `X` for unspecified words in `Extend`, with the linearization type `{s : Str}`.
Hence it can only be used for uninflected strings with unclear status. Hence it can only be used for uninflected strings with unclear status.
## Naming ## Naming
@@ -161,10 +162,10 @@ Some things TODO:
## Things to do ## Things to do
To support the construction of a `MorphoDict`, the following should be guaranteed in the RGL: To support the construction of a `MorphoDict`, the following should be provided in `Paradigms`:
- in `Paradigms`, explicit smart paradigms with characteristic forms and inherent features for each category - explicit smart paradigms with characteristic forms and inherent features for each category
- in `Paradigms`, API constants for all inherent features that are needed - API constants for all inherent features that are needed
- in `Paradigms`, compound-constructing functions for all categories that need them - compound-constructing functions for all categories that need them
- in `Extra`, the extra categories that one wants to include in that language - the extra categories that one wants to include in that language