Information Hub for Linguists
2023-07-24 Vetting phase is now closed. Survey Tool is no in resolution mode.
2023-07-06 Submission phase is now closed. Survey Tool is now in vetting mode.
2023-06-21 The Survey tool has been updated: see Recent Changes
Summary
This page and the pages in this section provide guidelines for translation of CLDR strings.
Please read this page completely before starting, and visit this page (Information hub for linguists) every other day, and check for news at the top. Also check the Known Issues for any new problems.
The information on this page will be updated at least weekly on Wednesdays if there are changes. New information will be highlighted by blue italicized text. Please bookmark it!
If you are new to the online tool (or just want to refresh your memory), please read the Survey Tool Guide before starting. Some basic topics include:
Pick your locale
Working in the Voting view
Once you are ready, go to the Survey Tool and log in.
Scope for CLDR v44
V44 is a general submission cycle. For details on the contents, see "What's new in this cycle", below.
Current Survey Tool stage: v44 Resolution
Thank you for contributing to CLDR!
For more information about the current CLDR stage please see the Resolution section in the Survey Tool stages page.
Please refer to the Milestone Schedule for the full v44 release schedule.
Prerequisites
Know Data stability expectations
Know topics under @Getting Started to ensure familiarity on what you may encounter working in the Survey Tool.
@General translation guides are the customary expectations for all the vetting work.
Disconnect error. If you see a persistent Loading error with a disconnect message or other odd behavior, please empty your cache.
Survey Tool email notification may be going to your spam folder. Check your spam folder regularly.
Notation
💡 — marks important translation tips
bolded text — marks items that need special attention
blue italicized text — marks latest updates
What's new in this cycle
If you are new to CLDR contribution, please read the prerequisites above first.
If you have contributed to CLDR in the past, below is the information that's new or changed since the last release.
New locales and locale upgrades
The following locales are added at a Core level, aiming for Basic level by end of release: Mi'kmaq (mic), [TBD]
The follow locales have higher coverage targets: [TBD]
New Data Fields
Emoji:
There are new 20 new emoji that need short names and search keywords, plus 2 new labels.
The labels are for use in constructing emoji that face to the right or left.
PersonNames: There are three new fields, and 3 new modifiers.
Please read the section "Version 44 Changes" at the top of the Person Names Guide before doing any work on the Person Name Formats page.
Units:
The International Bureau of Weights and Measures (BIPM) has added 4 new prefixes.
There are a few new units, mostly locale-specific, such as units of area used in Japan [NOTE: not all of these will be present at start of Submission]
Revisited Data Fields
We are using a new process in this release, to make sure that certain types of fields are reviewed. The fields have been reset to Provisional, and you'll see them marked as P in the Dashboard. The number of fields to revisit varies by locale: for example, German has around 50 revisited fields while Amharic has around 100.
PersonNames: Some locales had patterns missing spaces, insufficient variation between requested lengths, and bad sample names
Please read the section "Version 44 Changes" at the top of the Person Names Guide BEFORE doing any work on the Person Name Formats page.
Remember that Sample Person Names, like Minimal Pairs, are NOT translations. Do not simply transliterate or translate the English. For more information, see the Person Names Guide.
Inconsistent Units: For example, for the same unit, Dutch had "dots" as the long display name and "pixels" as the short display name; German has "Tassen" as long display name and "Cups" as the short. When you do these, please scan the Units pages for other inconsistencies.
Changes to English: These are cases of substantial changes, where just a warning is insufficient. Examples:
The name of the Islamic calendar is being changed to indicate the specific association with the Hijrah.
See https://st.unicode.org/cldr-apps/v#/fr/Keys/3df52b1e4d65309a and the following 3 related calendar names.
The name of the British Indian Ocean Territories is being change to allow for two variants plus a default. This allows implementations to make a choice among them.
For the 'biot' variant, use a name corresponding to "British Indian Ocean Territories"
For the 'chagos' variant, use a name corresponding to "Chagos Archipelago" or "Chagos Islands"
For the main variant, use whichever is the most commonly used form in your language.
Survey Tool Changes
Announcements. There is a new announcement mechanism that lets the TC and managers communicate more effectively with you.
Forum. Improvements have been make to the forum to make it easier to filter posts to only those which you have yet to respond to in CLDR-14380
Needing action has been updated to only show open posts that you have not responded or have voted against and the forum poster has since responded to.
Open discussions has been split into three categories:
Open discussion - all open forum posts
Open requests by you - all requests you have made that are currently open
Open requests by others - all requests made by others that are currently open
Info panel. Improvements include:
Bidi Examples. If you are working with a bi-directional languages, be aware of the Right-to-Left and Neutral context. The Survey Tool now shows examples with a strong RL context and neutral, and we have seen issues where vetters removed the ALM bidi marks or modified the patterns without considering the neutral context. There are additional examples for number formats and numeric date/time patterns showing the results in different contexts, with additional examples for currency formats showing currency symbols with different directionality. Please review existing data with the new tooling and fix any directionality issues.
Inheritance. The Survey Tool shows detail information about how values inherit from other places with a new following icon in the Info Panel. This is for development use, and visible to Managers and above.
Alphabetic Information Page. Exemplars and parseLenients have a new format that should help prevent errors. You no longer have to worry about using \ in front of characters like [ or $, or about {...} around grouped characters. In the examples in the Info panel, you will also see the differences from the Winning item when you click on any Other item (non-Winning).
Please read Unicode Sets before doing any work on that page.
Recent Changes
2023-07-06 - Survey Tool now in Vetting mode
2023-06-21
There was a problem importing votes for certain items, causing too many items to show up as Abstained. That has been fixed, and it will automatically import old votes again when you log in. Votes that you made during this cycle will not be affected.
There is a small update to examples for the auxiliary Alphabetic information, such as in https://st.unicode.org/cldr-apps/v#/ru/Alphabetic_Information/2703e9d07ab2ef3a
The key (🗝️) has some characters that come from other locales using your script. If some of them might occur in foreign names in newspapers, etc. in your language, you might consider adding them to your auxiliary set. Don't add any that you wouldn't normally see.
Quick summary of v43
A new data item: exemplar city 'Ciudad Juárez' has been added under Timezones > North America. Updates have been made to the Person Names guide.
See CLDR v43 release page for a summary of other data changes.
Translation quality
The following are areas where we have seen data quality issues or those that need your attention more carefully.
Avoiding voting for English
For items that do not work in your language, please don't simply use English. Find a solution that works for your language. For example, if your language doesn't have a concept of "quarters", use a translation that describes the concept "three-month period" rather than “quarter-of-a-year”.
Dealing with “Same as code” errors:
Since v37, if you voted for the Code, a Same as Code error will raise.
When translating codes for items such as languages, regions, scripts, and keys, it is normally an error to select the code itself as the translated name (such as “en” as the translated name for code “en” English), except for some specific cases including certain script codes (for example, code “Thai” is also the name for script Thai in several languages).
If the error appears under Typography, you can ignore. [CLDR-13552]
Handling Display name menu variants
Translation guides
If you are new to CLDR, use the @Getting Started topics to get started and review the left Table of Contents under Translation Guides.
Known Issues
Last updated: 2023-05-26
Please review this list before getting started to avoid creating duplicate tickets. This list will be updated as fixes are made available in Survey Tool Production. If you hit a problem, please file a ticket.
CLDR-15672 GMT short value not showing up in Basic level
Expanded section in left navigation panel flickers if clicked [CLDR-14750]
Images for the plain symbols. Non-emoji such as €, √, », ¹, §, ... do not have images in the info pane. [CLDR-13477]
Workaround: Look at the Code column; unlike the new emoji, your browser should display them there.
Resolved Issues
Last updated: XXXX-XX-XX