CLDR 45 Release Note

No. Date Rel. Note Data Charts Spec Delta GitHub Tag Delta DTD CLDR JSON
45 2024-04-17 v45 CLDR45 Charts45 LDML45 Δ45 release-45 ΔDtd45 45.0.0

Overview

Unicode CLDR provides key building blocks for software supporting the world’s languages. CLDR data is used by all major software systems (including all mobile phones) for their software internationalization and localization, adapting software to the conventions of different languages.

CLDR 45 is a closed release with no submission period, focusing on just a few areas:

Message Format 2.0 (LDML Part 9)

Software needs to construct messages that incorporate various pieces of information. The complexities of the world’s languages make this challenging. The goal for MessageFormat 2.0 is to allow developers and translators to create natural-sounding, grammatically-correct, user interfaces that can appear in any language and support the needs of diverse cultures.

The new MessageFormat defines the data model, syntax, processing, and conformance requirements for the next generation of dynamic messages. It is intended for adoption by programming languages, software libraries, and software localization tooling. It enables the integration of internationalization APIs (such as date or number formats), and grammatical matching (such as plurals or genders). It is extensible, allowing software developers to create formatting or message selection logic that add on to the core capabilities. Its data model provides a means of representing existing syntaxes, thus enabling gradual adoption by users of older formatting systems.

Keyboard 3.0 (LDML Part 7)

Keyboard support for digitally disadvantaged languages is often lacking or inconsistent between platforms. The updated LDML Keyboard 3.0 format specifies an interchange format for keyboard data. This will allow keyboard authors to create a single mapping file for their language, which implementations can use to provide that language’s keyboard mapping on their own platform. This format allows both physical and virtual (that is, on-screen or touch) keyboard layouts for a language to be defined in a single file.

Tooling Changes

Many tooling changes are difficult to accommodate in a data-submission release, including performance work and UI improvements. The changes in v45 improve survey tool performance for linguists during data submission and vetting allowing for higher data quality. They are targeted at the v46 submission period, starting in May, 2024.

Data Changes

DTD Changes

For a full listing, see Delta DTDs.

Supplemental Data Changes

For a full listing, see ¤¤BCP47 Delta and ¤¤Supplemental Delta

Locale Changes

For a full listing, see Delta Data

File Changes

The following files were added:

JSON Data Changes

Specification Changes

Growth

For this release there are no appreciable changes.

Migration

  1. Changes to parentLocales require upgrading implementations that use that element. In particular, they need to support the new nonlikelyScript value, and use the appropriate explicit inheritance for each type of inheritance. The v44 list of locales that inherit directly from root is retained for this release, but will disappear in the future. So implementations should move as quickly as possible to support the new value. See Dtd Changes.
  2. The new SI prefixes are provided for metric units so that implementations can verify that they are providing the correct prefix values, including the new prefixes (see New Prefixes in the SI: ronto, quecto, ronna, quetta).
  3. In last release CLDR 44, Unicode updated its outbound license from the “Unicode, Inc. License - Data Files and Software” to the “Unicode License v3”. All of the substantive terms of the license remain the same. The only changes made were non-substantive technical edits. The new license is OSI-approved and has been assigned the SPDX Identifier Unicode-3.0.
  4. Keyboard has a new DTD (keyboard3.dtd and the <keyboard3> element). There have been some incompatible changes between the tech preview and the stable version now in v44.
  5. Redundant values that inherit “laterally” may be removed in production data: Some data values inherit “sideways” from another element with the same parent, in the same locale. This has been long specified in LDML: Lateral_Inheritance, but some clients may not implement to the specification, and get the wrong answers. For example, consider the following items in the en locale, some added in CLDR 44 to provide clients a way to explicitly select a particular variant across locales (instead of using the default): ```
British Indian Ocean Territory

British Indian Ocean Territory

Chagos Archipelago

``` Both alt forms inherit sideways from the non-alt form. Thus in this case, the “biot” variant is redundant and will be removed in production data. Clients that are trying to select the “biot” variant but find it missing should fall back to the non-alt form. Similar behavior occurs with plural forms for units, where some plural forms may match and thus fall back to the “other” form.

Known Issues

Acknowledgments

Many people have made significant contributions to CLDR and LDML; see the Acknowledgments page for a full listing.

The Unicode Terms of Use apply to CLDR data; in particular, see Exhibit 1.

For web pages with different views of CLDR data, see http://cldr.unicode.org/index/charts.