Languages vary in how they handle plurals of nouns or unit expressions ("hour" vs "hours", and so on). Some languages have two forms, like English; some languages have only a single form; and some languages have multiple forms. CLDR uses short, mnemonic tags for these plural categories:
See Language
Plural Rules for the categories for each language in CLDR.
These categories are used to provide localized units, with
a more natural ways of expressing phrases that vary in plural form,
such as "1 hour" vs "2 hours". While they cannot express all the
intricacies of natural languages, they allow for more natural phrasing
than constructions like "1 hour(s)".
Reporting Defects
When you find errors or omissions in this
data, please report the information with a bug report. Please give examples of how the forms
may differ. You don't have to give the exact rules, but it is
extremely helpful! Here's an example:
Determining Plural CategoriesThe CLDR plural categories do not necessarily match the traditional grammatical categories. Instead, the categories are determined by changes required in a phrase or sentence if a numeric placeholder changes value. Minimal pairsThe categories are can be investigated by looking a minimal pairs: where a change in value forces a change in the other words. For example, the following is a minimal pair for English, establishing a difference in category between "1" and "2".
Some languages, like Bengali, do not change the form of the following noun when the numeric value changes. However, they do change the form of referents. So a minimal pair in Bengali looks something like:
Even where nouns are invariant, other parts of a sentence might change. That is sufficient to establish a minimal pair. For example, even if all nouns in English were invariant (like 'fish' or 'sheep'), the verb changes are sufficient to establish a minimal pair.
In many cases, a single noun doesn't exhibit all the numeric forms. For example, in Welsh the following is a minimal pair that separates 0 and 1:
That word does not differ for 1 and 2, but a different word establishes a minimal pair for those two numbers:
So the above two minimal pairs establish that Welsh requires at least 3 forms. The minimal pairs are those that are required for correct grammar. So because 0 and 1 don't have to form a minimal pair (it is ok—even though often not optimal—to say "0 people") , 0 doesn't establish a separate category. However, implementations are encouraged to provide the ability to have special plural messages for 0 in particular, so that more natural language can be used:
So the first step is to see how many different case are required, based on minimal pairs, such as: {0} год 1 год {0} лет 5 лет {0} года 2 года In some languages, fractions require a separate category. For example, Ukrainian in the example above. The next step is to determine the rules: which numbers go into which categories. IntegersTest a variety of integers. Look for cases where the 'teens' (11-19) behave differently. Many languages only care about the last 2 digits only, or the last digit only. FractionsFractions are often a bit tricky to determine: languages have very different behavior for them. In some languages the fraction is ignored (when selecting the category), in some languages the final digits of the fraction are important, in some languages a number changes category just if there are visible trailing zeros. Make sure to try out a range of fractions to make sure how the numbers behave: values like 1 vs 1.0 may behave differently, as may numbers like 1.1 vs 1.2 vs 1.21, and so on. Choosing Plural Category NamesIn some sense, the names for the categories are somewhat arbitrary. Yet for consistency across languages, the following guidelines should be used when selecting the plural
category names.
Important NotesThese categories are only mnemonics -- the names don't necessarily imply the exact contents of the category. For example, for both English and French the number 1 has the category one (singular). In English, every other number has a plural form, and is given the category other. French is similar, except that the number 0 also has the category one and not other or zero, because the form of units qualified by 0 is also singular.
Note that these categories may be different from the forms used for pronouns or other parts of speech. In particular, they are solely concerned with changes that would need to be made if different numbers, expressed with decimal digits, are used with a sentence. If there is a dual form in the language, but it isn't used with decimal numbers, it should not be reflected in the categories. That is, the key feature to look for is: If you were to substitute a different number for "1" in a sentence or phrase, would the rest of the text be required to change? For example, in a caption for a video: "Duration: 1 hour" → "Duration: 3.2 hours" Plural rule syntax |

