Recent from talks
Nothing was collected or created yet.
Enclosed Alphanumerics
View on Wikipedia| Enclosed Alphanumerics | |
|---|---|
| Range | U+2460..U+24FF (160 code points) |
| Plane | BMP |
| Scripts | Common |
| Assigned | 160 code points |
| Unused | 0 reserved code points |
| Unicode version history | |
| 1.0.0 (1991) | 139 (+139) |
| 3.2 (2002) | 159 (+20) |
| 4.0 (2003) | 160 (+1) |
| Unicode documentation | |
| Code chart ∣ Web page | |
| Note: [1][2] | |
Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.
It is currently fully allocated. Within the Basic Multilingual Plane, a few additional enclosed numerals are in the Dingbats and the Enclosed CJK Letters and Months blocks. There is also a block with more of these characters in the Supplementary Multilingual Plane named Enclosed Alphanumeric Supplement (U+1F100–U+1F1FF), as of Unicode 6.0.
Purpose
[edit]Many of these characters were originally intended for use as bullets for lists.[3] The parenthesized forms are historically based on typewriter approximations of the circled versions.[3] Although these roles have been supplanted by styles and other markup in "rich text" contexts, the characters are included in the Unicode standard "for interoperability with the legacy East Asian character sets and for the occasional text context where such symbols otherwise occur."[3] The Unicode Standard considers these characters to be distinct from characters which are similar in form but specialized in purpose, such as the circled C, P or R characters which are defined as copyright and trademark symbols or the circled a used for an at sign.[3]
A circled s (Ⓢ) was used in documents circa 1900 printed by German missionaries, especially the Basel Mission, in the Malayalam language to denote a ditto mark.[4]
Block
[edit]| Enclosed Alphanumerics[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+246x | ① | ② | ③ | ④ | ⑤ | ⑥ | ⑦ | ⑧ | ⑨ | ⑩ | ⑪ | ⑫ | ⑬ | ⑭ | ⑮ | ⑯ |
| U+247x | ⑰ | ⑱ | ⑲ | ⑳ | ⑴ | ⑵ | ⑶ | ⑷ | ⑸ | ⑹ | ⑺ | ⑻ | ⑼ | ⑽ | ⑾ | ⑿ |
| U+248x | ⒀ | ⒁ | ⒂ | ⒃ | ⒄ | ⒅ | ⒆ | ⒇ | ⒈ | ⒉ | ⒊ | ⒋ | ⒌ | ⒍ | ⒎ | ⒏ |
| U+249x | ⒐ | ⒑ | ⒒ | ⒓ | ⒔ | ⒕ | ⒖ | ⒗ | ⒘ | ⒙ | ⒚ | ⒛ | ⒜ | ⒝ | ⒞ | ⒟ |
| U+24Ax | ⒠ | ⒡ | ⒢ | ⒣ | ⒤ | ⒥ | ⒦ | ⒧ | ⒨ | ⒩ | ⒪ | ⒫ | ⒬ | ⒭ | ⒮ | ⒯ |
| U+24Bx | ⒰ | ⒱ | ⒲ | ⒳ | ⒴ | ⒵ | Ⓐ | Ⓑ | Ⓒ | Ⓓ | Ⓔ | Ⓕ | Ⓖ | Ⓗ | Ⓘ | Ⓙ |
| U+24Cx | Ⓚ | Ⓛ | Ⓜ | Ⓝ | Ⓞ | Ⓟ | Ⓠ | Ⓡ | Ⓢ | Ⓣ | Ⓤ | Ⓥ | Ⓦ | Ⓧ | Ⓨ | Ⓩ |
| U+24Dx | ⓐ | ⓑ | ⓒ | ⓓ | ⓔ | ⓕ | ⓖ | ⓗ | ⓘ | ⓙ | ⓚ | ⓛ | ⓜ | ⓝ | ⓞ | ⓟ |
| U+24Ex | ⓠ | ⓡ | ⓢ | ⓣ | ⓤ | ⓥ | ⓦ | ⓧ | ⓨ | ⓩ | ⓪ | ⓫ | ⓬ | ⓭ | ⓮ | ⓯ |
| U+24Fx | ⓰ | ⓱ | ⓲ | ⓳ | ⓴ | ⓵ | ⓶ | ⓷ | ⓸ | ⓹ | ⓺ | ⓻ | ⓼ | ⓽ | ⓾ | ⓿ |
Notes
| ||||||||||||||||
Emoji
[edit]The Enclosed Alphanumerics block contains one emoji: U+24C2, the enclosed M used as a symbol for mask works.[5][6]
It defaults to a text presentation and has two standardized variants defined to specify text presentation (U+FE0E VS15) or emoji-style (U+FE0F VS16).[7]
| U+ | 24C2 |
| base code point | Ⓜ |
| base+VS15 (text) | Ⓜ︎ |
| base+VS16 (emoji) | Ⓜ️ |
History
[edit]The following Unicode-related documents record the purpose and process of defining specific characters in the Enclosed Alphanumerics block:
| Version | Final code points[a] | Count | L2 ID | WG2 ID | Document |
|---|---|---|---|---|---|
| 1.0.0 | U+2460..24EA | 139 | (to be determined) | ||
| L2/11-438[b][c] | N4182 | Edberg, Peter (2011-12-22), Emoji Variation Sequences (Revision of L2/11-429) | |||
| 3.2 | U+24EB..24FE | 20 | L2/99-238 | Consolidated document containing 6 Japanese proposals, 1999-07-15 | |
| N2093 | Addition of medical symbols and enclosed numbers, 1999-09-13 | ||||
| L2/00-010 | N2103 | Umamaheswaran, V. S. (2000-01-05), "8.8", Minutes of WG 2 meeting 37, Copenhagen, Denmark: 1999-09-13—16 | |||
| L2/00-296 | N2256 | Sato, T. K. (2000-09-04), Circled Numbers in JIS X 0213 | |||
| 4.0 | U+24FF | 1 | L2/01-480 | Muller, Eric (2001-12-14), Proposal to add NEGATIVE CIRCLED DIGIT ZERO | |
| L2/02-193 | Muller, Eric (2001-12-14), Proposal to add Negative Circled Digit Zero | ||||
| L2/02-070 | Moore, Lisa (2002-08-26), "NEGATIVE CIRCLED DIGIT ZERO", Minutes for UTC #90, Consensus: Accept the character NEGATIVE CIRCLED DIGIT ZERO at U+24FF. | ||||
See also
[edit]- Special characters:
- Enclosed C
- Enclosed A
- Copyright symbol
- Registered trademark symbol
- Sound recording copyright symbol
- At sign (partially enclosed A)
- Character sets:
- Japanese rebus monogram
- Enclosed Alphanumeric Supplement
- Enclosed CJK Letters and Months (extends enclosed Arabic decimal numbers to ㊿)
References
[edit]- ^ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
- ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
- ^ a b c d The Unicode Standard, 6.0.1
- ^ Joseph Muliyil; M Krishnan (1904). "Contents". The New Malayalam Reader (in Malayalam). Mangalore: Basel Mission Book and Tract Repository. p. vii.
- ^ "UTR #51: Unicode Emoji". Unicode Consortium. 2023-09-05.
- ^ "UCD: Emoji Data for UTR #51". Unicode Consortium. 2023-02-01.
- ^ "UTS #51 Emoji Variation Sequences". The Unicode Consortium.
Enclosed Alphanumerics
View on GrokipediaDefinition and Scope
Core Definition
Enclosed Alphanumerics constitutes a Unicode block designated U+2460–U+24FF in the Basic Multilingual Plane, encompassing typographical symbols where individual Latin letters or Arabic-Indic digits are rendered within enclosing geometric forms such as circles or parentheses.[6] This block includes 160 assigned code points, primarily featuring enclosed representations of digits 0 through 20 and alphabetic characters A–Z in both uppercase and lowercase variants.[7] Examples include circled digits like ① (U+2460, CIRCLED DIGIT ONE) through ⑩ (U+2469, CIRCLED DIGIT TEN), parenthesized forms such as ⑴ (U+2474, PARENTHESIZED DIGIT ONE), and fully circled letters like Ⓐ (U+24B6, CIRCLED LATIN CAPITAL LETTER A).[1] The characters in this block are designed for sequential or emphatic numbering and labeling in text, distinguishing them from plain alphanumerics by their visual enclosure, which aids in hierarchical or bulleted lists without relying on font-specific styling.[8] Unlike related blocks such as Enclosed Alphanumeric Supplement (U+1F100–U+1F1FF), which incorporates additional enclosure styles like squares and regional indicators often used in emoji contexts, Enclosed Alphanumerics focuses on simpler circular and parenthetical variants rooted in East Asian typography traditions.[9] These symbols maintain compatibility with legacy encodings and support bidirectional text rendering, with most classified as Left-to-Right (L) in Unicode properties.[7] Introduced as part of Unicode Version 1.1 in June 1993, the block standardized pre-existing symbols from standards like ISO/IEC 8859 and JIS X 0208 to facilitate cross-platform consistency in document formatting. No characters within this block are designated as emoji by default, though some may receive color rendering in certain environments based on system-wide emoji support.Distinguishing Features
Enclosed alphanumerics consist of typographical symbols that integrate Latin capital and small letters or Arabic-Indic digits within enclosures such as circles, parentheses, or appended full stops, encoded as discrete code points in the Unicode block U+2460–U+24FF.[1] This precomposed structure distinguishes them from standard alphanumerics by embedding visual emphasis and demarcation in a single glyph, enabling uses like ordered labeling or stylistic variation without additional diacritics or text markup.[7] Subtypes within the block highlight specialized enclosure styles: circled digits 1–20 (U+2460–U+2473), parenthesized digits 1–10 (U+2474–U+247D), digits 1–20 with full stops (U+2488–U+249B), circled uppercase Latin letters A–Z (U+24B6–U+24CF), and circled lowercase Latin letters a–z (U+24D0–U+24E9).[1] Parenthesized lowercase letters occupy U+249C–U+24B5, while negative circled numbers (U+24EB–U+24FF) provide inverted variants for contrast.[7] These fixed inventories contrast with arbitrary enclosures achievable via combining sequences elsewhere in Unicode, prioritizing compatibility and atomic rendering over composability.[1] In Unicode properties, enclosed alphanumerics differ from regular letters and digits by their general category assignments—primarily other symbols (So) for letter variants or other numbers (No) for digit variants—excluding them from letter-specific behaviors like case mapping, uppercase/lowercase folding, or participation in alphabetic scripts for line breaking and word formation.[7] Assigned to the Latin script yet lacking decomposition mappings in many cases, they maintain semantic equivalence to their unenclosed counterparts only in numeric value or approximate visual form, not in collation or normalization equivalence.[1] Rendering behaviors further set them apart, as their design accommodates East Asian typographic conventions, such as thicker enclosures for circled numbers used in Japanese legal or ranking contexts, with font support ensuring consistent glyph metrics despite proportional variations.[1] No uppercase/lowercase pairings exist for parenthesized forms, reinforcing their role as fixed symbols rather than cased text elements.[1]Purpose and Design Rationale
Typographical Functions
Enclosed alphanumerics function as specialized typographical symbols for enumeration and list structuring, enabling compact and visually distinct representation of sequential items in text. These characters, such as circled digits (e.g., ① through ⑳) and parenthesized numbers (e.g., ⑴ through ⑵⓪), originated primarily from East Asian character encoding standards and were incorporated into Unicode for compatibility, allowing their use in ordered lists where plain numerals might blend into surrounding content or require additional formatting.[10] In practice, they delineate hierarchy in documents, such as numbering subsections, options in forms, or steps in instructions, without altering line spacing or relying on external styling.[11] In Japanese typography, circled alphanumerics—known as maru-zumi suuji (丸数字)—traditionally mark list elements in dense layouts, such as technical manuals, questionnaires, or educational materials, where the enclosure provides emphasis and prevents ambiguity in vertical or bidirectional text flows.[12] Parenthesized and period-appended variants (e.g., ⒈, ⒉) offer alternatives for legal or bibliographic numbering, accommodating conventions in CJK printing that favor enclosed forms for aesthetic and scannability reasons. This design supports precise rendering in fixed-width fonts, where the enclosure ensures consistent glyph widths for tabular alignment.[13] Beyond compatibility, these symbols aid in semantic markup for digital applications, distinguishing ordinal from cardinal numbers in parsed text, though their rendering can vary by font support, with some systems approximating via combining characters for unsupported ranges. Circled letters (e.g., Ⓐ through Ⓩ) extend this to alphabetic indexing, useful in glossaries or categorized lists, maintaining typographic uniformity across Latin and East Asian scripts.[3]Historical Motivations for Enclosure Styles
Enclosure styles for alphanumerics, such as circles, squares, and parentheses surrounding letters or digits, emerged primarily to provide compact, high-contrast markers for ordered lists, rankings, and annotations in printed text, where visual distinction from body content was essential for scannability and space efficiency. In East Asian typography, particularly Japanese printing traditions, circled numerals served to enumerate items in dense vertical layouts, separating labels from kanji-heavy prose and enabling hierarchical nesting—such as using plain numbers for main points and enclosed variants for subpoints—without requiring additional indentation or line breaks. This approach addressed the challenges of traditional movable type systems, where uniform character heights and tight spacing demanded symbols that stood out without disrupting flow.[11] In Western typesetting, parenthesized alphanumerics developed as a means to reinforce numerical accuracy and prevent misinterpretation or alteration, especially in legal, financial, and technical documents printed from the 16th century onward, when movable type proliferated. Printers cast specialized sorts for these enclosures to avoid hand-engraving or inconsistent manual bracketing, ensuring reproducibility across editions; for instance, parenthetical digits alongside spelled-out numbers (e.g., "five (5)") minimized fraud risks in contracts by making alterations evident. Such styles also supported educational and referential uses, like labeling diagrams or footnotes, where enclosures reduced cognitive load by grouping identifier and qualifier.[14] Broader typographic rationales included accommodating legacy practices in multilingual publishing, where enclosures facilitated compatibility across scripts—e.g., distinguishing Roman numerals in East Asian contexts—and promoted uniformity in mass-produced materials like forms and checklists. By the late 20th century, these motivations influenced digital encoding, prioritizing symbols that preserved print-era clarity amid varying font rendering. Empirical studies on legibility underscore that enclosures improve target detection in scanning tasks by 20-30% compared to plain alphanumerics, validating their persistence for informational density.[15]Technical Details
Unicode Block Allocation
The Enclosed Alphanumerics block occupies the Unicode code point range U+2460 to U+24FF within the Basic Multilingual Plane (BMP).[1] This allocation spans 160 consecutive code points, providing dedicated space for typographical symbols enclosing Latin alphanumeric characters, such as circled digits (e.g., U+2460 ①), parenthesized numbers (e.g., U+2474 ⑴), and dingbat variants (e.g., U+24B6 Ⓐ). All code points in this range are assigned, with no reserved or unallocated positions as of Unicode 17.0, reflecting the block's focused design for enclosure styles without fragmentation.[1] This block's placement in the BMP, early in the Unicode repertoire (following geometric shapes and preceding CJK radicals), prioritizes accessibility for legacy systems limited to 16-bit encoding, ensuring broad compatibility since its introduction in Unicode 1.0.[10] The allocation was structured into subranges for logical grouping: U+2460–U+24CF for numerals and letters in circles or parentheses, U+24D0–U+24EF for larger circled letters, and U+24F0–U+24FF for additional circled digits and negatives, minimizing gaps while accommodating East Asian typographic traditions influencing the designs.[10] Subsequent versions, up to Unicode 16.0, have maintained this exact allocation without reallocation or contraction, as the block achieved full utilization by Unicode 3.2 with the assignment of U+24FF (⓿).[16]Character Inventory and Encoding
The Enclosed Alphanumerics Unicode block occupies the contiguous code point range U+2460 to U+24FF within the Basic Multilingual Plane, allocating 160 assigned characters that represent typographical variants of digits and Latin letters enclosed in circles, parentheses, or similar forms.[3] These precomposed symbols serve as compatibility characters, often decomposable into a base alphanumeric plus an enclosing mark (e.g., circled digit one at U+2460 approximates a combination of digit one U+0031 and a circle), enabling round-trip mapping from legacy East Asian encodings without altering semantic content.[1] Encoding occurs via standard Unicode transformation formats such as UTF-8 (three bytes per character, e.g., U+2460 as 0xE2 0x91 0xA0) or UTF-16 (two bytes), with properties including Symbol, Other (So) category, and varying decomposition types for normalization.[3] The inventory divides into distinct subgroups focused on numeric and alphabetic enclosures, reflecting historical usage in lists, labels, and dingbats:| Subgroup | Code Point Range | Count | Description and Examples |
|---|---|---|---|
| Circled digits 1–20 | U+2460–U+2473 | 20 | Enclosed in a single circle; e.g., ① (U+2460, Circled Digit One), ⑳ (U+2473, Circled Number Twenty).[1] |
| Parenthesized digits 1–20 | U+2474–U+2487 | 20 | Enclosed in parentheses; e.g., ⑴ (U+2474, Parenthesized Digit One), ⒇ (U+2487, Parenthesized Number Twenty).[1] |
| Digits with full stop 1–20 | U+2488–U+249B | 20 | Number followed by period, enclosed; e.g., ⒈ (U+2488, Digit One Full Stop), ⒛ (U+249B, Digit Twenty Full Stop).[1] |
| Parenthesized Latin small letters a–z | U+249C–U+24B5 | 26 | Letters in parentheses, lowercase only (no case mappings); e.g., ⒜ (U+249C, Parenthesized Latin Small Letter A), Ⓩ (U+24B5, Parenthesized Latin Small Letter Z).[3] |
| Circled Latin letters A–Z, a–z | U+24B6–U+24E9 | 52 | Uppercase A–Z (U+24B6–U+24CF) and lowercase a–z (U+24D0–U+24E9) in circles; e.g., Ⓐ (U+24B6, Circled Latin Capital Letter A), ⓩ (U+24E9, Circled Latin Small Letter Z).[1] |
| Circled digit zero | U+24EA | 1 | ⓪ (U+24EA, Circled Digit Zero).[3] |
| Negative (white-on-black) circled numbers 11–20 | U+24EB–U+24F4 | 10 | Inverted enclosure for eleven to twenty; e.g., ⓫ (U+24EB, Negative Circled Number Eleven), ⓪ (wait, no: up to 20).[1] |
| Double-circled digits 1–10 | U+24F5–U+24FE | 10 | Digits in dual circles; e.g., ⓵ (U+24F5, Double Circled Digit One), ⓾ (U+24FE, Double Circled Digit Ten).[3] |
| Negative circled digit zero | U+24FF | 1 | ⓿ (U+24FF, Negative Circled Digit Zero).[1] |
