Block: Combining Diacritical Marks Supplement
|Range||U+1DC0 - U+1DFF|
Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context.
The following Unicode-related documents record the purpose and process of defining specific characters in the Combining Diacritical Marks block:
Possibly the greatest number of combining diacritics required to compose a valid character in any Unicode language is 8, for the "well-known grapheme cluster in Tibetan and Ranjana scripts", ཧྐྵྨླྺྼྻྂ, or HAKṢHMALAWARAYAṀ.
It is U+0F67 U+0F90 U+0FB5 U+0FA8 U+0FB3 U+0FBA U+0FBC U+0FBB U+0F82, or:
TIBETAN LETTER HA + TIBETAN SUBJOINED LETTER KA + TIBETAN SUBJOINED LETTER SSA + TIBETAN SUBJOINED LETTER MA + TIBETAN SUBJOINED LETTER LA + TIBETAN SUBJOINED LETTER FIXED-FORM WA + TIBETAN SUBJOINED LETTER FIXED-FORM RA + TIBETAN SUBJOINED LETTER FIXED-FORM YA + TIBETAN SIGN NYI ZLA NAA DA.
Some users have explored the limits of rendering in web browsers and other software by "decorating" words with multiple nonsensical diacritics per character.