Information Hub for Linguists
Prerequisites
If you're new to CLDR, take the CLDR training below.
If you're already experienced with CLDR, read the Critical reminders section (mandatory).
Once you are ready, go to the Survey Tool and log in.
Check the Announcements & Updates for the latest news and Known Issues with the tool.
Disconnect error. If you see a persistent Loading error with a disconnect message or other odd behavior, please empty your cache.
Survey Tool email notification may be going to your spam folder. Check your spam folder regularly.
“Same as code” errors - when translating codes for items such as languages, regions, scripts, and keys, it is normally an error to select the code itself as the translated name. If the error appears under Typography, you can ignore. [CLDR-13552]
CLDR training (for new linguists)
Before getting started to contribute data in CLDR, and jumping in to using the Survey Tool, it is important that you understand the CLDR process & take the CLDR training. It takes about 2-3 hours to complete the training.
Understand the basics about the CLDR process and the community-based model and the Technical Committee. Read: CLDR Process
Read the Getting Started topics on the Information Hub:
*If you (individual or your organization) have not established a connection with the CLDR technical committee, start with Survey Tool Accounts.
Critical reminders (for all linguists)
You're already familiar with the CLDR process, but do keep the following in mind:
Aim at commonly used language - CLDR should reflect common-usage standards not academic /official standards (unless commonly followed). Keep that perspective in mind.
Carefully consider changes to existing standards - any change to an existing CLDR standard should be carefully considered and discussed with your fellow linguists in the CLDR Forum. Remember your change will be reflected across thousands of online products!
Keep consistency across logical groups - ensure that all related entries are consistent. If you change the name of a weekday, make sure it’s reflected across all related items. If you change an abbreviation, make sure it’s updated across your locale, etc.
Tip: The Reports are a great way to validate consistency across related logical groups, e.g. translations of date formats. Use them to proofread your work before submitting.
Avoid voting for English - for items that do not work in your language, don't simply use English. Find a solution that works for your language. For example, if your language doesn't have a concept of calendar "quarters", use a translation that describes the concept "three-month period" rather than “quarter-of-a-year”.
Watch out for complex sections and read the instructions carefully if in doubt:
Tip: The links in the Info Panel will point you to relevant instructions for the entry you’re editing/vetting. Use it if in doubt.
Survey Tool
Once trained and up to speed on Critical reminders (above), log in to the Survey Tool to begin your work.
Announcements & Updates
Tool status
Last updated: 2023-05-26
The survey tool is closed, and will remain closed during the v45 cycle.
The next open cycle will be CLDR 46, which is expected to open for submission in late-May similar to previous general open submission cycles.
V44 is a general submission cycle. For details on the contents, see "What's new in this cycle", below.
Current Survey Tool stage: v44 Resolution
For more information about the current CLDR stage please see the Resolution section in the Survey Tool stages page. Please refer to the Milestone Schedule for the full v44 release schedule.
Known Issues
Last updated: 2023-05-26
Please review this list before getting started to avoid creating duplicate tickets. This list will be updated as fixes are made available in Survey Tool Production. If you hit a problem, please file a ticket.
CLDR-15672 GMT short value not showing up in Basic level
Expanded section in left navigation panel flickers if clicked [CLDR-14750]
Images for the plain symbols. Non-emoji such as €, √, », ¹, §, ... do not have images in the info pane. [CLDR-13477]
Workaround: Look at the Code column; unlike the new emoji, your browser should display them there.
Resolved Issues
Last updated: XXXX-XX-XX
LOREM IPSUM
Recent Changes
Last updated: 2023-05-26
2023-06-21
There was a problem importing votes for certain items, causing too many items to show up as Abstained. That has been fixed, and it will automatically import old votes again when you log in. Votes that you made during this cycle will not be affected.
There is a small update to examples for the auxiliary Alphabetic information, such as in https://st.unicode.org/cldr-apps/v#/ru/Alphabetic_Information/2703e9d07ab2ef3a
The key (🗝️) has some characters that come from other locales using your script. If some of them might occur in foreign names in newspapers, etc. in your language, you might consider adding them to your auxiliary set. Don't add any that you wouldn't normally see.
Quick summary of v43
A new data item: exemplar city 'Ciudad Juárez' has been added under Timezones > North America. Updates have been made to the Person Names guide.
See CLDR v43 release page for a summary of other data changes.
New locales and locale upgrades
The following locales are added at a Core level, aiming for Basic level by end of release: Mi'kmaq (mic), [TBD]
The follow locales have higher coverage targets: [TBD]
New Data Fields
Emoji
There are new 20 new emoji that need short names and search keywords, plus 2 new labels.
The labels are for use in constructing emoji that face to the right or left.
PersonNames: There are three new fields, and 3 new modifiers.
Please read the section "Version 44 Changes" at the top of the Person Names Guide before doing any work on the Person Name Formats page.
Units:
The International Bureau of Weights and Measures (BIPM) has added 4 new prefixes.
There are a few new units, mostly locale-specific, such as units of area used in Japan [NOTE: not all of these will be present at start of Submission]
Revisited Data Fields
We are using a new process in this release, to make sure that certain types of fields are reviewed. The fields have been reset to Provisional, and you'll see them marked as P in the Dashboard. The number of fields to revisit varies by locale: for example, German has around 50 revisited fields while Amharic has around 100.
PersonNames
Some locales had patterns missing spaces, insufficient variation between requested lengths, and bad sample names
Please read the section "Version 44 Changes" at the top of the Person Names Guide BEFORE doing any work on the Person Name Formats page.
Remember that Sample Person Names, like Minimal Pairs, are NOT translations. Do not simply transliterate or translate the English. For more information, see the Person Names Guide.
Inconsistent Units
For example, for the same unit, Dutch had "dots" as the long display name and "pixels" as the short display name; German has "Tassen" as long display name and "Cups" as the short. When you do these, please scan the Units pages for other inconsistencies.
Changes to English
These are cases of substantial changes, where just a warning is insufficient. Examples:
The name of the Islamic calendar is being changed to indicate the specific association with the Hijrah.
See https://st.unicode.org/cldr-apps/v#/fr/Keys/3df52b1e4d65309a and the following 3 related calendar names.
The name of the British Indian Ocean Territories is being change to allow for two variants plus a default. This allows implementations to make a choice among them.
For the 'biot' variant, use a name corresponding to "British Indian Ocean Territories"
For the 'chagos' variant, use a name corresponding to "Chagos Archipelago" or "Chagos Islands"
For the main variant, use whichever is the most commonly used form in your language.
Survey Tool Changes
Last updated: 2023-05-26
Announcements. There is a new announcement mechanism that lets the TC and managers communicate more effectively with you.
Forum. Improvements have been make to the forum to make it easier to filter posts to only those which you have yet to respond to in CLDR-14380
Needing action has been updated to only show open posts that you have not responded or have voted against and the forum poster has since responded to.
Open discussions has been split into three categories:
Open discussion - all open forum posts
Open requests by you - all requests you have made that are currently open
Open requests by others - all requests made by others that are currently open
Info panel. Improvements include:
Bidi Examples. If you are working with a bi-directional languages, be aware of the Right-to-Left and Neutral context. The Survey Tool now shows examples with a strong RL context and neutral, and we have seen issues where vetters removed the ALM bidi marks or modified the patterns without considering the neutral context. There are additional examples for number formats and numeric date/time patterns showing the results in different contexts, with additional examples for currency formats showing currency symbols with different directionality. Please review existing data with the new tooling and fix any directionality issues.
Inheritance. The Survey Tool shows detail information about how values inherit from other places with a new following icon in the Info Panel. This is for development use, and visible to Managers and above.
Alphabetic Information Page. Exemplars and parseLenients have a new format that should help prevent errors. You no longer have to worry about using \ in front of characters like [ or $, or about {...} around grouped characters. In the examples in the Info panel, you will also see the differences from the Winning item when you click on any Other item (non-Winning).
Please read Unicode Sets before doing any work on that page.