blob: 3679b1be77d14c25def43895976391f08a7b54b1 [file] [log] [blame]
Name: hunspell dictionaries
License: MPL 1.1/GPL 2.0/LGPL 2.1
Security Critical: no
Version: unknown
This folder contains the following:
.dic files
.aff files
.bdic files
.dic_delta files
README_<language>_<region>.txt files
The .bdic files are binary files, generated from the corresponding .dic and .aff
files, using convert_dict (chrome\tools\convert_dict). These binary files are
used by the spellchecker. The .dic_delta files are used to add words which are
not there in the .dic files. Irrespective of the encoding of the corresponding
.dic file, the .dic_delta files are encoded as UTF-8.
The final binary file, .bdic, is generated with words from the .dic and
additional words from the .dic_delta file. In order to get the current-most bdic
file versions, it is a good idea to rebuild them using convert_dict from the
.dic, .aff and .dic_delta files in this folder. convert_dict takes care of
duplicate entries present both in .dic and .dic_delta files.
The README_<language>_<region>.txt files contain information about the
individual dictionaries, including copyright information.
The .bdic files are versioned to force clients to download new versions when
necessary. Use the same version for all the dictionaries that you add at the
same time. Increment the major version number (3) if you're updating either dic
or aff files. Increment the minor version number (0) if you're updating only
dic_delta files. If you add or update dictionaries, make sure to update the
constants in chrome\common\
Note: the encodings for these files are usually only UTF-8 for English
dictionaries. Otherwise, they could be anything. This will lead to
errors when trying to upload so in general, you'll just have to upload
your change and cross your fingers.
The following 39 dictionaries have been appended with or without additional
words using the .dic_delta files (as of December 14th 2012), and are covered
under the existing GPL/LGPL/MPL tri-license in COPYING:
af_ZA: No changes
bg_BG: No changes
ca_ES: Added words
Added NOSUGGEST flag = ! to .aff file
Added two words to .dic file with the ! flag to mark them forbidden/nosuggest.
cs_CZ: Added words
da_DK: Added words
Changed "øvrigt/mk" to "øvrigt/" in da_DK.dic to make convert_dict work.
de_DE_neu: Added words
Removed "Information" from dic_delta in favor of "Information/P" to fix
the spelling of "Informationen"
en_AU: Added words
en_CA: Added words
en_GB: Added words
en_US: Added words
es_ES: Added words
et_EE: No changes
fo_FO: No changes
fr_FR: Added words to 4.8 (modern) downloaded from
he_IL: Added words
hi_IN: Added words
hr_HR: Added words
hu-HU: Added words
id_ID: Added words
it_IT: Added words
Added NOSUGGEST flag = % to .aff file
Added three words to .dic file with the ! flag to mark them forbidden/nosuggest.
ko: Removed a line from dic that was longer than 128 characters
lt_LT: Added words.
Added NOSUGGEST flag = ! to .aff file
Added some words to .dic file with the ! flag to mark them forbidden/nosuggest.
lv_LV: Added words
nb_NO: Added words
nl_NL: Added words
pl_PL: Added words
pt_BR: Added words
pt_PT: Added words
ro_RO: Added words
sh: No changes
sk_SK: Added words
sl_SI: Added words
Added NOSUGGEST flag = ! to .aff file
Added five words to .dic file with the ! flag to mark them forbidden/nosuggest.
Changed "Ÿvrklji/N" to "Ÿvrklji/" in sl_SI.dic to make convert_dict work.
sq: No changes
sr: Added a word
sv_SE: Added words
Changed "ω/r" to "ω/" in sv_SE.dic to make convert_dict work.
ta_IN: Removed the word "சர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர்ர" to make convert_dict work.
tr_TR: No changes
uk_UA: No changes
vi_VN: Added words
The following dictionary has a BSD-like license in README_ru_RU.txt:
ru_RU: Added words
On Jan 20, 2011, en_US.dic_delta was copied to en_AU.dic_delta and
en_GB.dic_delta so these locales would get the same additional words.
On Dec 26, 2012, we reran convert_dict on all dictionaries to add MD5 checksum
to the .bdic files, added sq, ta, and ko dictionaries, and bumped the versions
up to 3-0.
On Jan 8, 2013, we added back et_EE and tr_TR .dic and .aff files to add MD5
checksums to their .bdic files. We used version 3-0 again, because these files
were not updated with the rest of the dictionaries on Dec 26, 2012.
On Jan 9, 2013, we changed the download location of tr_TR from a Firefox website
to a Google code website. The only difference between the two locations is that
the tr_TR affix file from the Google code website includes a "FLAG num"
directive, which prevents Chrome's heapcheck tests from crashing. Given that aff
file changed, we used version 4-0.