summaryrefslogtreecommitdiff
path: root/external/hunspell
AgeCommit message (Collapse)Author
2018-04-03tdf#116586 fix LibreOffice crash by Hungarian personal dictionaryLászló Németh
"AkH. 11. (old orthography)", caused by bad handling of forbidden words of alias compressed dictionaries during run-time dictionary extension. Note: accepting also all – previously forbidden – affixed forms of the modified stem is a removed function: "Grammar by" spell checking (for example, in LibreOffice via SPELLML interface of Hunspell) supports user words with affixation and compounding. The original idea was to add substandard or interfering words as forbidden words – but with flags – to the dictionary, and adding the stem to the personal dictionary could “switch on” the affixed forms of the word, too. Now the suggested method is using "Grammar by" personal dictionaries, as in LibreOffice: https://wiki.documentfoundation.org/ReleaseNotes/6.0#.E2.80.9CGrammar_By.E2.80.9D_spell_checking Change-Id: I8160d3f49a73cb2481d51c336115b6c27e7c9f1d Reviewed-on: https://gerrit.libreoffice.org/51918 Tested-by: Jenkins <ci@libreoffice.org> Reviewed-by: Andras Timar <andras.timar@collabora.com> (cherry picked from commit cb717d293f31ac98d2fd214f792ac5c1441aa444)
2018-01-17Add recent Hunspell fixes and improvementsLászló Németh
from Hunspell repository to give better spell checking and suggestions. Short Hunspell commit descriptions (complete commit descriptions are in the committed Hunspell patch): 4a8921b BREAK tries to break at the second word break 957950b Spelling dictionary should be a real spelling dictionary 0b8a4d8 Use only middle replentries for compound word checking 4e4106f Reduce strange ngram suggestions 89a8ec6 Optimize condition order in walk_hashtable loop e80685c Remove SUBSTANDARD dictionary roots from suggestions. 90cb55f Clean-up ngram suggestions for lowercase words bbf2eb4 word pairs of the dic file get highest suggestion priority 0667049 check dictionary word pairs to filter compound word overgeneration ebdd308 clean-up suggestion 526f600 skip empty ph: field and support character stripping eb97eb7 Dictionary words with COMPOUNDFORBIDFLAG are removed 8912f2a Allow suggestion search for prefix + *two suffixes* caa24d6 Improve ph: usage for capitalization and Unicode 05082b4 BREAK: keep also break-at-first-break-point breaking db142a3 Fix regression in Hungarian "moving rule" 711466a fix compiler warnings 7ba5beb Support dictionary based REP replacements Reviewed-on: https://gerrit.libreoffice.org/45918 Tested-by: Jenkins <ci@libreoffice.org> Reviewed-by: László Németh <nemeth@numbertext.org> (cherry-picked from the commit 721e6eb9899aa4ff6ee943e81caddb1722139adf) Change-Id: I7f7202acf2dccec05ef9c542362b432aa8566a86 Reviewed-on: https://gerrit.libreoffice.org/48048 Reviewed-by: László Németh <nemeth@numbertext.org> Tested-by: László Németh <nemeth@numbertext.org> Tested-by: Jenkins <ci@libreoffice.org> Reviewed-by: Andras Timar <andras.timar@collabora.com>
2017-11-17apply patch for run-time Hunspell dictionary extensionLászló Németh
to use in new "Affixation/Compounding By" feature of language-specific LibreOffice user dictionaries. Change-Id: Icb4181717f0ff1145ac5acc130266b165132cfe9 Reviewed-on: https://gerrit.libreoffice.org/44561 Reviewed-by: László Németh <nemeth@numbertext.org> Tested-by: László Németh <nemeth@numbertext.org>
2017-11-06fix spell checking issues using recent Hunspell patchesLászló Németh
Test: English word "Ian" are "item" are not allowed as "İan", "İtem" now. Patch list with commit ids in Hunspell repository: commit 66badb7449c2053c89456f11a7f71f3f5916b550 Extend dotless i and dotted I rules to Crimean Tatar language commit 88cf975c295e3ec808efb77bb1a2a031d77f0c89 Allow dotted I in dictionary, and disable bad capitalization commit 39b785a6b03b35cc8a27f43f6005dcaa432694e1 FORBIDDENWORD precedes BREAK commit 0f691abe68788d0a58e72ab66877a9f670cd2741 Remove forbidden words from dash suggestion list commit 15b2cde4f01706f0a648518a5cfc57394d015448 tdf#95024 fix compound handling for new Hungarian orthography commit de3ae6844af62300e473f7b7b66a56e54153b4b9 fix compound word part "pa:" Change-Id: Id12b5629b0c975464072b5b144743cbe40fe45a3 Reviewed-on: https://gerrit.libreoffice.org/44200 Tested-by: Jenkins <ci@libreoffice.org> Reviewed-by: Andras Timar <andras.timar@collabora.com>
2017-09-21bump hunspell to 1.6.2Caolán McNamara
Change-Id: I91d4d58f2b8ba69067de1d08476a8cebbb780535 Reviewed-on: https://gerrit.libreoffice.org/42555 Tested-by: Jenkins <ci@libreoffice.org> Reviewed-by: Caolán McNamara <caolanm@redhat.com> Tested-by: Caolán McNamara <caolanm@redhat.com>
2017-06-22--enable-optimized should be orthogonal to --enable-debug/--enable-dbgutilStephan Bergmann
Change-Id: I277f30129560ea9fa76d6439a60bb191358df99d Reviewed-on: https://gerrit.libreoffice.org/39088 Reviewed-by: Stephan Bergmann <sbergman@redhat.com> Tested-by: Stephan Bergmann <sbergman@redhat.com>
2017-06-12iOS, update hunspell "revert commit" patchjan Iversen
Added support for arm64, by modifying the patch that reverts an earlier commit. Change-Id: If0d1920c1a91b3ad44c4ae9c299270b7806db811
2017-06-07Remove gb_LinkTarget_add_generated_cxxobjectsStephan Bergmann
...in favor of gb_LinkTarget_add_generated_exception_objects. The former would have needed any flags to be passed in explicitly (but no call sites did), so e.g. StaticLibrary_graphite didn't have any debug information (when building with --enable-debug). I guess there is no downside to having C++ exception support enabled in these places, and using _add_generated_cxxobjects instead was likely an oversight in the first place (at least in the case of external/graphite/StaticLibrary_graphite.mk, it was that way ever since 1ceb47d96da9e7977c96241f49ad291ff0466970 "graphite: convert to gbuild", but for no apparent reason). Change-Id: I9986a6c5ec30a521095dbe5315e5ca649741a790
2017-03-24tdf#106751 fix regressions in Hungarian spell checkingLászló Németh
using recent fixes of Hunspell code base Change-Id: I180a2ecba924180419c5eb1a0e78b5c84e7242c4 Reviewed-on: https://gerrit.libreoffice.org/35665 Tested-by: Jenkins <ci@libreoffice.org> Reviewed-by: László Németh <nemeth@numbertext.org> Tested-by: László Németh <nemeth@numbertext.org>
2017-02-12use alternative optimizations for buffer creation bottleneckCaolán McNamara
Change-Id: I9f29e8d3e5e97fe403a3e0d7d03c6ac01c7689c4
2017-02-12Resolves: tdf#105426 helpful to actually let the compiler optimize hunspellCaolán McNamara
*facepalm* Change-Id: I5f6d6cb94e1a80d2d7ae96900517aae3c8f39f08 Reviewed-on: https://gerrit.libreoffice.org/34176 Reviewed-by: Caolán McNamara <caolanm@redhat.com> Tested-by: Caolán McNamara <caolanm@redhat.com>
2017-02-10Remove MinGW supportStephan Bergmann
In OOo times, there'd originally been efforts to allow building on Windows with MinGW. Later, in LO times, this has been shifted to an attempt of cross- compiling for Windows on Linux. That attempt can be considered abandoned, and the relevant code rotting. Due to this heritage, there are now three kinds of MinGW-specific code in LO: * Code from the original OOo native Windows effort that is no longer relevant for the LO cross-compilation effort, but has never been removed properly. * Code from the original OOo native Windows effort that is re-purposed for the LO cross-compilation effort. * Code that has been added specifially for the LO cross-compilation effort. All three kinds of code are removed. (An unrelated, remaining use of MinGW is for --enable-build-unowinreg, utilizing --with-mingw-cross-compiler, MINGWCXX, and MINGWSTRIP.) Change-Id: I49daad8669b4cbe49fa923050c4a4a6ff7dda568 Reviewed-on: https://gerrit.libreoffice.org/34127 Tested-by: Jenkins <ci@libreoffice.org> Reviewed-by: Stephan Bergmann <sbergman@redhat.com>
2017-01-24just build the hunspell lib, not the restCaolán McNamara
might hack around link failures of the hunspell tools under arm Change-Id: Ib5db702c2898e909e2bb91f4c4bc6b91bd08f47b
2017-01-24Related: tdf#105426 upgrade to hunspell-1.6.0Caolán McNamara
Change-Id: I275222d1a7d78cbfb1ca046562fc8a3c314b0fae Reviewed-on: https://gerrit.libreoffice.org/33454 Reviewed-by: Caolán McNamara <caolanm@redhat.com> Tested-by: Caolán McNamara <caolanm@redhat.com>
2016-12-22build hunspell with _GLIBCXX_DEBUGCaolán McNamara
Change-Id: Ida2ce221f1e35afd946545c2a36b72dd2386e7dd
2016-12-21external/hunspell: Work around -Werror,-Wundef with clang-clStephan Bergmann
...when lingucomponent/source/spellcheck/spell/sspellimp.cxx includes hunspell.hxx Change-Id: Ibb4b2901a9dfbf7e145281bedeaa9d8daf79aba5
2016-12-14update to latest hunspell 1-5 releaseCaolán McNamara
Change-Id: Ifb412506f2e36878d44d0e4f6360ae8d070ffa15 Reviewed-on: https://gerrit.libreoffice.org/28488 Tested-by: Jenkins <ci@libreoffice.org> Reviewed-by: Caolán McNamara <caolanm@redhat.com> Tested-by: Caolán McNamara <caolanm@redhat.com>
2016-05-03bump to hunspell 1.4.1Caolán McNamara
Change-Id: I20d9abfd73586daf1ed99f0e1a2481840e61f90e
2016-04-18try next hunspell 1.4.0 prerelease milestoneCaolán McNamara
try using reverse iterators here then and remove dictmgr which we don't seem to use or need Change-Id: I981fccc4ca7562d5ab707dc2f605e409169dec07
2016-04-15hunspell: MSVC debug runtime iterators don't want to decrement past begin()Michael Stahl
Change-Id: I258dc551d14ba72113b284be3566ba924434ab4c
2016-04-15test pre 1.4.0 hunspellCaolán McNamara
Change-Id: Ied87b41ec02b993fd08ac13f325b9489b6f07456
2016-04-07external/hunspell: -fsanitize=shiftStephan Bergmann
Change-Id: Ifc78a8f1a0a6fa29ef9becc5432e581f42241467
2016-04-06bump hunspell to 1.3.4Caolán McNamara
Change-Id: If7c4868c4296d1ca0e485dc06fdf3472e2e8653f Reviewed-on: https://gerrit.libreoffice.org/23863 Tested-by: Jenkins <ci@libreoffice.org> Reviewed-by: Caolán McNamara <caolanm@redhat.com> Tested-by: Caolán McNamara <caolanm@redhat.com>
2015-10-14tdf#95024 fix compound word handling for new Hungarian orthographyLászló Németh
This commit contains the recent Hunspell fix for Hungarian compound word handling (commit 42807f970ac2d65f0d13a7c57eb454b210e92240 in Hunspell git repository), changing spell checking only in Hungarian documents. Change-Id: I1c6c3736ecf8c1e2fffcf1c53959b25dc9d27966
2015-09-11Resolves: rhbz#1261421 crash on mashing hangul korean keyboardCaolán McNamara
Change-Id: Ie066c7f83ad15bec198f2091a3b084468c502766
2015-03-03remove more SunStudio cruftMichael Stahl
Change-Id: I1e1ca09e870461990c919938ed0902c8b90c4413
2015-01-13external/hunspell: Work around -fsanitize=shiftStephan Bergmann
Change-Id: I66ac6ec25615698382d065db2b782950cbc154e4
2015-01-12external/hunspell: Work around -fsanitize=shiftStephan Bergmann
Change-Id: Id93e61a1b9893ae026056a9d3e03c88b259f0ff8
2014-09-26Hunspell: fix buffer overflow during morphological analysisLászló Németh
affected: thesaurus usage in a Hungarian document test case: press Ctrl+F7 on the word "művészegyéniség" Change-Id: I024568e81265c4ce3e05f718bf9147229416ab73
2014-08-28upgrade to hunspell 1.3.3Caolán McNamara
Change-Id: Ibf8345885e99ae8eb9dd6d64ccd9b5407d8d610e Reviewed-on: https://gerrit.libreoffice.org/11136 Reviewed-by: Caolán McNamara <caolanm@redhat.com> Tested-by: Caolán McNamara <caolanm@redhat.com>
2014-07-20fdo#80363 _wfullpath in MyThes and HyphenLászló Németh
Change-Id: I4232040d4c62220389ca356797d18b1c87673e64
2014-07-19fdo#80363 add _wfullpath to expand ".." in dictionary pathsLászló Németh
Change-Id: I6c9edd0b4d2c63f7735d090e5d2d72d5fb81b921 Reviewed-on: https://gerrit.libreoffice.org/10400 Reviewed-by: Andras Timar <andras.timar@collabora.com> Tested-by: Andras Timar <andras.timar@collabora.com>
2014-04-25try including windows.h for MultiByteToWideCharCaolán McNamara
Change-Id: Ib737e520ff16f2e467504e85fc3969d525fd742a
2014-04-25fdo#48017 fix WIN32 long path name support of spelling dictionariesLászló Németh
Change-Id: I1ccaae9dba4f82cd50531890e159519a765a0fff
2014-02-27normalize values of CROSS_COMPILINGMichael Stahl
Change-Id: I0cc43cef91e3fcd82a3558a16ab0afbd4d56b141
2013-10-18Fix building from inside modules moved to externalKhaled Hosny
Change-Id: Id6023dc3751fe70984f489682be17d1ab1855f71
2013-10-18fdo#70393: move hunspell to a subdir of externalKhaled Hosny
Change-Id: Id3aafd95cb8c064f84898844a91f350e9aad0ed4 Reviewed-on: https://gerrit.libreoffice.org/6307 Reviewed-by: David Tardon <dtardon@redhat.com> Tested-by: David Tardon <dtardon@redhat.com>