Age | Commit message (Collapse) | Author |
|
Change-Id: Ic56383e235be27d48358944c9b6588481052297a
|
|
Change-Id: Ib5c77f185abeeaef5045780766514a813794c8e8
|
|
When the class of the word is unambiguous, limit the output only to that -
gives more precise & expected results.
[Like, it is interesting to see the other possibilities too, but I guess less
choices but more focused ones are preferred.]
Change-Id: I2876fbb4fa02c00fc7e65189812365f77b9a5ed6
|
|
Change-Id: I13b60baf14fc90aba6f07ada2fc4423d06db76e8
|
|
Change-Id: I7d75626a37d4f241d8d407a11855325e39c5fa63
|
|
Change-Id: Ie05e0c0ce8b4f9541a5a143ddf9ccf960940a3b7
|
|
This is a completely new, independent thesaurus, generated from an English <->
Czech dictinary.
The data of the dictionary are licensed under GNU Free Documentation License
1.1 or later, consequently this resulting thesaurus is GNU/FDL 1.1 or later
too.
Change-Id: I0136b413d5affd6e45a71bdd579ae196fe48dff5
|
|
Change-Id: Ifb47efe7562ca9ccc2324d4ebd966506cae2bec6
|
|
* word classifiacation (when available)
* word blacklist
* ignore some non-translations (eg. irregular verbs)
* ignore vulgarisms (when marked), they only add confusion
|
|
|
|
slovnik_data_utf8.txt is the English <-> Czech dictionary from
http://slovnik.zcu.cz/download.php, licensed under
GNU Free Documentation License 1.1 or later. The data are a snapshot
from 2016-02-24.
dictionary-to-thesaurus.py is a simple script that generates a thesaurus from
this dictionary. The idea to generate our thesaurus from a dictionary comes
from Zdenek Zabokrtsky (UFAL, Faculty of Mathematics & Physics, Charles
University in Prague).
The results are far better than I would have imagined; I owe Zdenek some
beers :-) Many thanks!
The source data are GNU/FDL 1.1 or later, the resulting thesaurus too.
The actual addition of the thesaurus to the build system will be done in a
separate commit later.
|
|
Change-Id: I40ebd1ca223fe7950ed3280c43a51a3dfbd0070e
|
|
Various incompatible stripping characters and conditions
Signed-off-by: Tomáš Chvátal <tchvatal@suse.cz>
|
|
(http://www.liberix.cz/doplnky/slovniky/ooo/dict-cs-2.oxt)
This updates all files, not just thesaurus. Btw, the thesaurus is no
longer licensed under MIT.
Change-Id: I04e93c99aed8bc57b0b5724741842020271b69c2
|
|
Change-Id: I235d23248469b760da69983575dfcd73431757d4
|
|
... adapt dictionaries to that.
|
|
Change-Id: I70388bf6b95d8692cc6f25fc5a9c7baf3a675710
|