CLDR 34 Release Note

Overview

Unicode CLDR 34 provides an update to the key building blocks for software supporting the world's languages. CLDR data is used by all major software systems for their software internationalization and localization, adapting software to the conventions of different languages for such common software tasks.

CLDR 34 included a full Survey Tool data collection phase, adding approximately 6M of data overall, resulting in the following language support:

🆕 is for languages reaching the level in this release. Tongan (to), Konkani (kok), Dzongkha (dz), Tatar (tt) were already in ICU, while Sindhi (sd), Maori (mi), Turkmen (tk), Javanese (jv), Interlingua (ia), Kurdish (ku), Xhosa (xh) are being included for the first time in the upcoming ICU 63. The above counts are just for the languages (with multiple entries for multi-script languages such as Serbian or Chinese) — there are many additional regional locales. 

Other notable changes include:

For details, see Detailed Specification Changes, Detailed Structure Changes, Detailed Data Changes, Growth.

Detailed Specification Changes

For detailed specification changes, see LDML34 Modifications.

Detailed Structure Changes

Detailed Data Changes

In addition, the following changes were made. This is not complete: for a full list see the list of bug fixes

Growth

The following summarizes the number of changes (additions + corrections) for languages in the release.

The following shows languages with a larger relative number of changes. For the first line, there are over 20% additions alone, not counting corrections.

TBD: add chart

Migration

Known Issues

(These may addressed in a maintenance update)

Acknowledgments

Many people have made significant contributions to CLDR and LDML; see the Acknowledgments page for a full listing.

Key to Header Links

The Unicode Terms of Use apply to CLDR data; in particular, see Exhibit 1.

For web pages with different views of CLDR data, see http://cldr.unicode.org/index/charts.