User Dictionary specifications for some languages
Unlike English, where the declension of nouns in cases is made using prepositions, there are many languages in which the declension is made according to other rules, for example, by changing part of the word itself.
For these languages, we add the same word to the dictionary dozens of times, but many words are built and changed according to the same rules. (Russian, Ukrainian, Byelorussian and many Eastern-European languages).
Here is an example (not complete) for two Ukrainian words ("dissertation" and "compilation").
You may notice that the initial parts of these words appear in the dictionary dozens of times, and the endings of these words can be applied to thousands of other words.
What if users are allowed to add to the dictionary not only the whole word, but also in parts?
Then the initial parts of the word would appear only once, as well as the final ones:
Where "|" is a marker of separation of parts of words.
Some words can have not only endings, but also prefixes, therefore there can be more than one marker in a word. And in this case in the dictionary it will begin and end with the "|" sign:
пе~~ре~~| |роб~~лю~~| |ва~~ти
пе~~ре~~| |пи~~су~~| |ва~~ти
пе~~ре~~| |со~~ву~~| |ва~~ти
пе~~ре~~| |див~~ля~~| |тись
пе~~ре~~| |су~~ва~~н~~| |ня
Such an extension of the rules would not complicate the existing algorithm too much. This would greatly reduce the vocabulary.