diff options
author | Edward Welbourne <edward.welbourne@qt.io> | 2019-10-23 17:17:49 +0200 |
---|---|---|
committer | Edward Welbourne <edward.welbourne@qt.io> | 2019-10-30 17:38:02 +0100 |
commit | c3eb521a0f10112df6b61d2592351c4eef2e1f9b (patch) | |
tree | b881b60a1645acb9796ab8983654fe7e6ff75e28 /util/unicode/README | |
parent | fc3c6cde83ef705565cc8f3b960cc838329049d6 (diff) |
Update UCD data to Unicode 12.1.0's Revision 24
Had to teach the update program to accept category Lm as for
Joining_Transparent, for the sake of a new ArabicShaping.txt entry.
Added three new Unicode versions, several new scripts and a new
word-break class.
Updated UCD's test data for tst_QTextBoundaryFinder. This left 57
tests failing; I have commented out the data rows for those tests,
pending someone with more knowledge addressing this.
Task-number: QTBUG-79631
Task-number: QTBUG-79418
Change-Id: Ic33d3b3551195d47a84d98e84020f57a68f0b201
Reviewed-by: Eskil Abrahamsen Blomfeldt <eskil.abrahamsen-blomfeldt@qt.io>
Diffstat (limited to 'util/unicode/README')
-rw-r--r-- | util/unicode/README | 29 |
1 files changed, 22 insertions, 7 deletions
diff --git a/util/unicode/README b/util/unicode/README index 1f14459d14..29594b6caa 100644 --- a/util/unicode/README +++ b/util/unicode/README @@ -6,24 +6,39 @@ To update: * Unpack the zip file; for each file in data/, replace with the new version; find the *BreakProperty.txt in auxiliary/. (These last are only in the zip, not in the web-space's unpacked versions.) +* In tst_QTextBoundaryFinder's data/ sub-directory, update its files + from the auxiliary/ sub-directory of the UCD data. * If needed, add an entry to enum QChar::UnicodeVersion for the new Unicode version * In that case, also update main.cpp's initAgeMap and DATA_VERSION_S* to match * Build this project. Its binary, unicode, ignores command-line options and assumes it is being run from this directory. When run, - it produces lots of output. Hopefully that doesn't matter. -* Assertions may trigger: if so, study code and understand what's more - complicated about this update; talk to folk named in the git logs, - maybe push a WIP to gerrit to solicit advice. Some bit-field may - need to be expanded, for example. In some cases QChar may need - additions to some of its enums. -* Build with the modified code, fix any compilation issues. + it produces lots of output. If it gets as far as updating + qunicodetables.cpp the output hopefully doesn't matter. +* It'll end prematurely with a qFatal() message if it needs updates, + either in main.cpp or in QChar: + * "unassigned or unhandled age value:" initAgeMap() and + QChar::UnicodeVersion; + * "Unhandled script property value:" initScriptMap(), QChar::Script, + qharfbuzzng.cpp's _qtscript_to_hbscript[] array and + qfontconfigdatabase.cpp's specialLanguages. + * "unassigned word break class:" enum WordBreakClass, + word_break_class_string and initWordBreak(); +* Assertions or other qFatal()s may trigger: if so, study code and + understand what's more complicated about this update; talk to folk + named in the git logs, maybe push a WIP to gerrit to solicit + advice. Some bit-field may need to be expanded, for example. In some + cases QChar may need additions to some of its enums. +* Build with the modified code, fix any compilation issues, make check + in suitable directories, including tst_QTextBoundaryFinder. * That may have updated qtbase/src/corelib/text/qunicodetables.cpp; if so the update matters; be sure to commit the changes to data/ at the same time and update text/qt_attribution.json to match; use the UCD Revision number, rather than the Unicode standard number, as the Version, for all that qunicodetables.cpp uses the latter. +* If you don't normally build in the source tree, remember to delete + qtbase/.qmake.stash while you're cleaning up. The script writingSystems.sh generates a list of writing systems, ostensibly as a the basis for updating QFontDatabase::WritingSystem |