Draft Press Release

(todo: graph at right needs updating)

CLDR CoverageUnicode CLDR 31 provides an update to the key building blocks for software supporting the world's languages. This data is used by all major software systems for their software internationalization and localization, adapting software to the conventions of different languages for such common software tasks.

Some of the improvements in the release are:

  • Canonical codes
    • The subdivision codes have been changed to all have the bcp47 format.
    • The locales in the language-territory population data are in canonical format.
    • The timezone ID for GMT has been split from UTC.
    • There is a mechanism for identifying hybrid locales, such as Hinglish.
  • Emoji 5.0
    • Short names and keywords have been updated for English. (Data for other languages to be gathered in the next cycle).
    • Collation (sorting) adds the new 5.0 Emoji characters and sequences, and some fixes for Emoji 4.0 characters and sequences.
    • For Emoji usage, subdivision names for Scotland, Wales, and England have been added for 65 languages.
For further details and links to documentation, see the CLDR Release Notes.

(below this line is left from CLDR 30, not yet updated)

MARK: My recommendation is:

  1. Just pour the https://sites.google.com/site/cldr/index/downloads/cldr-30?pli=1#TOC-Overview language in above, after finalized. Keeping bullets for clarity.
  2. For the image, it should be a current snapshot of https://sites.google.com/site/cldr/index/downloads/cldr-30?pli=1#TOC-Growth
  3. For the tweet, I suggest:
    1. New CLDR v30! Unicode 9.0, full Emoji 4.0 names, extended, relative times, language names, … http://cldr.unicode.org/index/downloads/cldr-30

(Not generally part of the announcement posting, but here for reference:)

About the Unicode Consortium

The Unicode Consortium is a non-profit organization founded to develop, extend and promote use of the Unicode Standard and related globalization standards.

The membership of the consortium represents a broad spectrum of corporations and organizations, many in the computer and information processing industry. Members include: Adobe, Apple, EmojiXpress, Facebook, Google, Government of Bangladesh, Government of India, Huawei, IBM, Microsoft, Monotype Imaging, Netflix, Sultanate of Oman MARA, Oracle, Rajya Marathi Vikas Sanstha, SAP, Symantec, Tamil Virtual University, The University of California (Berkeley), plus well over a hundred Associate, Liaison, and Individual members. For a complete member list go to http://www.unicode.org/consortium/members.html

For more information, please contact the Unicode Consortium http://www.unicode.org/contacts.html.