CLDR - Unicode Common Locale Data Repository

CLDR - Unicode Common Locale Data Repository

Search this site

    Navigation

    • Unicode CLDR Project
    • CLDR Releases/Downloads
    • CLDR Survey Tool
    • CLDR Change Requests
    • CLDR Charts
    • CLDR Process
    • CLDR Specifications
    • Information Hub for Linguists
    • Unicode Extensions for BCP 47
    • Implementer’s FAQ
    • Message Format Subcommittee
    • Keyboard workgroup
    • ULI (disbanded)

    Milestone Schedule

    See Current CLDR Cycle.

    Internal Development

    • CLDR Development Site
      • New CLDR Developers
      • Handling Tickets (bugs/enhancements)
    • CLDR: Big Red Switch
    • Messages
    • Design Proposals
    • Direct Modifications to CLDR Data
    • Updating Codes
    • Updating DTDs
    • Editing the CLDR Spec
    • Sitemap

    Access to Copyright and terms of use

    Unicode Utilities

    The Unicode Utilities provide a number of different utilities that use and demonstrate features of the Unicode encoding and locale data. These include the following:

    • bidi - Unicode Bidi Algorithm (UBA) Demo
    • bnf-regex - Simplified regex generation
    • breaks - Unicode Segmentation
    • Changes
    • character - Unicode Character Properties
    • confusables - Visually Similar Characters
    • idna - Internationalized Domain Names (IDN)
    • languageid - BCP47 Language Tags
    • list-unicodeset - Manipulate sets of Unicode characters
    • properties - Unicode Properties and Values
    • regex - Generate corrected regex
    • transform - Transform strings using CLDR
    • unicodeset - Compare sets of characters

    Common to many of the utilities is the use of UnicodeSet. For more information, see list-unicodeset - Manipulate sets of Unicode characters
    Subpages (13): bidi - Unicode Bidi Algorithm (UBA) Demo bnf-regex - Simplified regex generation breaks - Unicode Segmentation Changes character - Unicode Character Properties confusables - Visually Similar Characters idna - Internationalized Domain Names (IDN) languageid - BCP47 Language Tags list-unicodeset - Manipulate sets of Unicode characters properties - Unicode Properties and Values regex - Generate corrected regex transform - Transform strings using CLDR unicodeset - Compare sets of characters
    Comments

    Sign in|Recent Site Activity|Report Abuse|Print Page|Powered By Google Sites