Recent from talks
Nothing was collected or created yet.
KS X 1001
View on WikipediaThis article includes a list of general references, but it lacks sufficient corresponding inline citations. (November 2017) |
| MIME / IANA | ks_c_5601-1987 |
|---|---|
| Alias(es) | KS C 5601 |
| Languages | Partial support: |
| Standard | KS X 1001 |
| Classification | ISO-2022-compatible DBCS, CJK encoding |
| Encoding formats | |
| Preceded by | N-byte Hangul code (KS C 5601-1974) |
| Other related encodings | Associated supplements: KS X 1002 Other Hangul ISO 2022 DBCSes: Other CJK ISO 2022 DBCSes: |
KS X 1001, "Code for Information Interchange (Hangul and Hanja)",[d][1] formerly called KS C 5601, is a South Korean coded character set standard to represent Hangul and Hanja characters on a computer.
KS X 1001 is encoded by the most common legacy (pre-Unicode) character encodings for Korean, including EUC-KR and Microsoft's Unified Hangul Code (UHC). It contains Korean Hangul syllables, CJK ideographs (Hanja), Greek, Cyrillic, Japanese (Hiragana and Katakana) and some other characters.
KS X 1001 is arranged as a 94×94 table, following the structure of 2-byte code words in ISO 2022 and EUC. Therefore, its code points are pairs of integers 1–94. However, some encodings (UHC and Johab), in addition to providing codes for every code point, provide additional codes for characters otherwise representable only as code point sequences.
History
[edit]This standard was previously known as KS C 5601. There have been several revisions of this standard. For example, there were revisions in 1987, 1992, 1998 and 2002.
The present, double-byte, Wansung (완성; Wanseong; lit. precomposing)[1] character set was standardised by the third edition of KS C 5601,[2] which was published in 1986.[3] It is an ISO 2022 compatible encoding, typically used in EUC form, which assigns double-byte codes for non-Hangul, Hangul jamo, and the most common Hangul syllables, in contrast to Johab (조합; Johap; lit. combining)[1] which is not compatible with ISO 2022, but assigns double-byte codes to all Hangul syllables using modern jamo.[2] Wansung is technically a variable-length encoding, allowing other syllables to be represented with eight-byte sequences (using the jamo and Hangul Filler character), but this feature is not always implemented.[4]
The earliest edition of KS C 5601, published in 1974,[2] defined a variable-length[2] 7-bit character set which assigned single-byte code points to 51[3] basic Hangul jamo, somewhat analogously to JIS C 6220, in an encoding known as "N-byte Hangul".[5] The second edition, published in 1982, retained the main character set from the 1974 edition but defined two supplementary sets, including a version of Johab. Neither edition was adopted as widely as intended.[2]
Wansung was kept unchanged in the 1987 and 1992 editions. In the 1992 edition, additional annex material was added,[3] including the definition of the Johab encoding[6] in annex 3, and the older N-byte Hangul encoding in annex 4.[1][5] It was published in response to industry use of Johab as a competing encoding to Wansung, being used at the time by Hangul Word Processor. Following the introduction of Unified Hangul Code by Microsoft in Windows 95, and Hangul Word Processor abandoning Johab in favour of Unicode in 2000, Johab ceased to be commonly used.[2]
Encodings
[edit]Encoding schemes of KS X 1001 include EUC-KR (in both ASCII and ISO 646-KR based variants, the latter of which includes a won currency sign (₩) at byte 0x5C rather than a backslash) and ISO-2022-KR,[7] as well as ISO-2022-JP-2 (which also encodes JIS X 0208 and JIS X 0212). These all have the drawback that they only assign codes for the 2350 precomposed Hangul syllables which have their own KS X 1001 codepoints (out of 11172 in total, not counting those using obsolete jamo), and require others to use eight-byte composition sequences, which are not supported by some partial implementations of the standard.[4]
The Johab encoding (stipulated in annex 3 of the 1992 version of the standard) and the EUC-KR superset known as Unified Hangul Code (UHC, also called Windows-949) provide single codes for all 11172 Hangul syllables.[7][6] ISO-2022-KR and Johab are rarely used. Some operating systems extend this standard in other non-uniform ways, e.g. the EUC-KR extensions MacKorean on the classic Mac OS, and IBM-949 by IBM.
Hangul Filler
[edit]The Hangul Filler character is used to introduce eight-byte Hangul composition sequences[8][9] and to stand in for an absent element (usually an empty final) in such a sequence.[9]
Unicode includes the Wansung code Hangul Filler in the Hangul Compatibility Jamo block for round-trip compatibility, but uses its own system (with its own, differently used, filler characters) for composing Hangul. The KS X 1001 Hangul composition system is not used in Unicode, and the filler renders merely as an empty space; KS X 1001 composition sequences using modern jamo may be mapped to precomposed characters in Unicode.[9] This is not usually done with Unified Hangul Code.
For round-trip compatibility, Unicode also includes the N-byte Hangul code Hangul Filler separately in the Halfwidth and Fullwidth Forms block, named the "Halfwidth Hangul Filler".
Wansung code charts
[edit]Following are the code charts for KS X 1001 in Wansung layout. Where a pair of hexadecimal numbers is given, the smaller is used when encoded over GL (0x21-0x7E), as in ISO-2022-KR when the Korean set has been shifted to, and the larger is used in the more typical case of it being encoded over GR (0xA1-0xFE), as in EUC-KR or UHC. Johab changes the arrangement to encode all 11172 Hangul clusters separately and in order.
To illustrate vendor differences in implementation, multiple Unicode mappings are shown for some characters. Apple's HangulTalk extensions to the Wansung plane (i.e. where both bytes are in the 0xA1-0xFE range) are shown, but other HangulTalk extension ranges are not. The additional codes for composed syllables in Unified Hangul Code, and IBM's extensions in IBM-949, are also not shown, since both fall outside of the Wansung plane.
Lead bytes
[edit]| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | SP[e] | 1-_ | 2-_ | 3-_ | 4-_ | 5-_ | 6-_ | 7-_ | 8-_ | 9-_ | 10-_ | 11-_ | 12-_ | 13-_ | 14-_ | 15-_ |
| 3x/Bx | 16-_ | 17-_ | 18-_ | 19-_ | 20-_ | 21-_ | 22-_ | 23-_ | 24-_ | 25-_ | 26-_ | 27-_ | 28-_ | 29-_ | 30-_ | 31-_ |
| 4x/Cx | 32-_ | 33-_ | 34-_ | 35-_ | 36-_ | 37-_ | 38-_ | 39-_ | 40-_ | 41-_ | 42-_ | 43-_ | 44-_ | 45-_ | 46-_ | 47-_ |
| 5x/Dx | 48-_ | 49-_ | 50-_ | 51-_ | 52-_ | 53-_ | 54-_ | 55-_ | 56-_ | 57-_ | 58-_ | 59-_ | 60-_ | 61-_ | 62-_ | 63-_ |
| 6x/Ex | 64-_ | 65-_ | 66-_ | 67-_ | 68-_ | 69-_ | 70-_ | 71-_ | 72-_ | 73-_ | 74-_ | 75-_ | 76-_ | 77-_ | 78-_ | 79-_ |
| 7x/Fx | 80-_ | 81-_ | 82-_ | 83-_ | 84-_ | 85-_ | 86-_ | 87-_ | 88-_ | 89-_ | 90-_ | 91-_ | 92-_ | 93-_ | 94-_ | DEL[e] |
Non-Hanja non-precomposed sets
[edit]The rows 41 and 94 may be used for user-defined purposes.[10]
Character set 0x21 / 0xA1 (row number 1, special characters)
[edit]This set contains punctuation and other symbols, excluding punctuation present in KS X 1003 (which is included in row 3). Encodings which combine KS X 1001 with single-byte ASCII may use alternative Unicode mapping to the Halfwidth and Fullwidth Forms block for the backslash. Unicode mapping of the wave dash (tilde dash) also differs between vendors, and may be U+301C (favoured by IBM and Apple)[11][12][13] or U+223C (favoured by Microsoft).[14][15] Compare the similar but not identical handling of the JIS wave dash, and the handling of the tilde in the next row.
Except for the backslash, if two mappings are shown below, the first is used by Apple and the second is used by Microsoft.[13][15]
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | IDSP | 、 | 。 | · | ‥ | … | ¨ | 〃 | –/SHY | —/― | ‖/∥ | \/\ | 〜/∼ | ‘ | ’ | |
| 3x/Bx | “ | ” | 〔 | 〕 | 〈 | 〉 | 《 | 》 | 「 | 」 | 『 | 』 | 【 | 】 | ± | × |
| 4x/Cx | ÷ | ≠ | ≤ | ≥ | ∞ | ∴ | ° | ′ | ″ | ℃ | Å | ¢/¢ | £/£ | ¥/¥ | ♂ | ♀ |
| 5x/Dx | ∠ | ⊥ | ⌒ | ∂ | ∇ | ≡ | ≒ | § | ※ | ☆ | ★ | ○ | ● | ◎ | ◇ | ◆ |
| 6x/Ex | □ | ■ | △ | ▲ | ▽ | ▼ | → | ← | ↑ | ↓ | ↔ | 〓 | ≪ | ≫ | √ | ∽ |
| 7x/Fx | ∝ | ∵ | ∫ | ∬ | ∈ | ∋ | ⊆ | ⊇ | ⊂ | ⊃ | ∪ | ∩ | ∧ | ∨ | ¬/¬ |
Character set 0x22 / 0xA2 (row number 2, special characters)
[edit]This set contains additional punctuation and symbols. Similarly to the tilde character in the previous row, different mappings are used by Apple and Microsoft for the tilde character in this row (U+02DC by Apple, FF5E by Microsoft),[13][15] which is intended to be shown as a raised tilde, whereas the tilde in the previous row is intended to be shown in-line at dash height.[10] Mapping of the circled dot also differs.[13][15]
The euro and registered trademark sign were added to the standard in 1998, while the Korean postal mark (㉾) was added in 2002.[1] These three code points, as with the still-unused code points, have been put to use for other, non-standard, purposes by vendors, e.g. for boxed list markers by Apple.[16] Microsoft updated its Unified Hangul Code implementation to add the 1998 additions including the euro sign, but did not add the Korean postal mark when it was added to the standard.[17]
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | ⇒ | ⇔ | ∀ | ∃ | ´ | ˜/~ | ˇ | ˘ | ˝ | ˚ | ˙ | ¸ | ˛ | ¡ | ¿ | |
| 3x/Bx | ː | ∮ | ∑ | ∏ | ¤ | ℉ | ‰ | ◁ | ◀ | ▷ | ▶ | ♤ | ♠ | ♡ | ♥ | ♧ |
| 4x/Cx | ♣ | ◉/⊙ | ◈ | ▣ | ◐ | ◑ | ▒ | ▤ | ▥ | ▨ | ▧ | ▦ | ▩ | ♨ | ☏ | ☎ |
| 5x/Dx | ☜ | ☞ | ¶ | † | ‡ | ↕ | ↗ | ↙ | ↖ | ↘ | ♭ | ♩ | ♪ | ♬ | ㉿ | ㈜ |
| 6x/Ex | № | ㏇ | ™ | ㏂ | ㏘ | ℡ | €/1⃞[f] | ®/2⃞[f] | ㉾/3⃞[f] | 4⃞[f] | 5⃞[f] | 6⃞[f] | 7⃞[f] | 8⃞[f] | 9⃞[f] | [10][g] |
| 7x/Fx | [11][g] | [12][g] | [13][g] | [14][g] | [15][g] | [16][g] | [17][g] | [18][g] | [19][g] | [20][g] | ¶[h] | 𝄂[i] | 𝄃[j] | ⋮ | ∷ | |
Additions by Apple Later standard additions colliding with Apple additions | ||||||||||||||||
Character set 0x23 / 0xA3 (row number 3, basic Latin / ISO 646-KR)
[edit]This set corresponds to KS X 1003 (the ISO 646 variant for Korean, a similar set to ASCII), but as two-byte codes preceded by 0x23 (or 0xA3 in GR-invoked (EUC) form). It includes the English alphabet / Basic Latin alphabet, western Arabic numerals and punctuation.
Compare the Roman set of JIS X 0201, which differs by including a Yen sign rather than a Won sign. Contrast the third rows of KPS 9566 and of JIS X 0208, which follow the ISO 646 layout but only include letters and digits.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / | |
| 3x/Bx | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
| 4x/Cx | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
| 5x/Dx | P | Q | R | S | T | U | V | W | X | Y | Z | [ | ₩ | ] | ^ | _ |
| 6x/Ex | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
| 7x/Fx | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ‾ |
Encodings such as EUC-KR and UHC combine KS X 1001 with single-byte ASCII or KS X 1003, and hence use alternative Unicode mappings to the Halfwidth and Fullwidth Forms block for the double-byte representations of these characters.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / | |
| 3x/Bx | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
| 4x/Cx | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
| 5x/Dx | P | Q | R | S | T | U | V | W | X | Y | Z | [ | ₩ | ] | ^ | _ |
| 6x/Ex | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
| 7x/Fx | p | q | r | s | t | u | v | w | x | y | z | { | | | } |  ̄ |
Character set 0x24 / 0xA4 (row number 4, Hangul jamo)
[edit]This set includes modern Hangul consonants, followed by vowels, both ordered by South Korean collation customs, followed by obsolete consonants. When used individually, these characters map to the Unicode Hangul Compatibility Jamo block, and do not have a one-to-one mapping with the position-specific characters in the Hangul Jamo block. Compare with row 4 of the North Korean KPS 9566. Character 04-52 is a Hangul Filler (see above), used in combining sequences.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | ㄱ 3131 |
ㄲ 3132 |
ㄳ 3133 |
ㄴ 3134 |
ㄵ 3135 |
ㄶ 3136 |
ㄷ 3137 |
ㄸ 3138 |
ㄹ 3139 |
ㄺ 313A |
ㄻ 313B |
ㄼ 313C |
ㄽ 313D |
ㄾ 313E |
ㄿ 313F | |
| 3x/Bx | ㅀ 3140 |
ㅁ 3141 |
ㅂ 3142 |
ㅃ 3143 |
ㅄ 3144 |
ㅅ 3145 |
ㅆ 3146 |
ㅇ 3147 |
ㅈ 3148 |
ㅉ 3149 |
ㅊ 314A |
ㅋ 314B |
ㅌ 314C |
ㅍ 314D |
ㅎ 314E |
ㅏ 314F |
| 4x/Cx | ㅐ 3150 |
ㅑ 3151 |
ㅒ 3152 |
ㅓ 3153 |
ㅔ 3154 |
ㅕ 3155 |
ㅖ 3156 |
ㅗ 3157 |
ㅘ 3158 |
ㅙ 3159 |
ㅚ 315A |
ㅛ 315B |
ㅜ 315C |
ㅝ 315D |
ㅞ 315E |
ㅟ 315F |
| 5x/Dx | ㅠ 3160 |
ㅡ 3161 |
ㅢ 3162 |
ㅣ 3163 |
HF 3164 |
ㅥ 3165 |
ㅦ 3166 |
ㅧ 3167 |
ㅨ 3168 |
ㅩ 3169 |
ㅪ 316A |
ㅫ 316B |
ㅬ 316C |
ㅭ 316D |
ㅮ 316E |
ㅯ 316F |
| 6x/Ex | ㅰ 3170 |
ㅱ 3171 |
ㅲ 3172 |
ㅳ 3173 |
ㅴ 3174 |
ㅵ 3175 |
ㅶ 3176 |
ㅷ 3177 |
ㅸ 3178 |
ㅹ 3179 |
ㅺ 317A |
ㅻ 317B |
ㅼ 317C |
ㅽ 317D |
ㅾ 317E |
ㅿ 317F |
| 7x/Fx | ㆀ 3180 |
ㆁ 3181 |
ㆂ 3182 |
ㆃ 3183 |
ㆄ 3184 |
ㆅ 3185 |
ㆆ 3186 |
ㆇ 3187 |
ㆈ 3188 |
ㆉ 3189 |
ㆊ 318A |
ㆋ 318B |
ㆌ 318C |
ㆍ 318D |
ㆎ 318E |
Character set 0x25 / 0xA5 (row number 5, Roman numerals and Greek)
[edit]This set contains Roman numerals and basic support for the Greek alphabet, without diacritics or the final sigma. Apple includes some additional punctuation in this row, as well as some black circled list markers continuing from those in row 6.[16]
Contrast row 6 of KPS 9566, which includes the same characters but in a different layout.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | ⅰ | ⅱ | ⅲ | ⅳ | ⅴ | ⅵ | ⅶ | ⅷ | ⅸ | ⅹ | ||||||
| 3x/Bx | Ⅰ | Ⅱ | Ⅲ | Ⅳ | Ⅴ | Ⅵ | Ⅶ | Ⅷ | Ⅸ | Ⅹ | ||||||
| 4x/Cx | Α | Β | Γ | Δ | Ε | Ζ | Η | Θ | Ι | Κ | Λ | Μ | Ν | Ξ | Ο | |
| 5x/Dx | Π | Ρ | Σ | Τ | Υ | Φ | Χ | Ψ | Ω | !︀[k] | 。︀[l] | ′[m] | ″[m] | ‴ | ||
| 6x/Ex | α | β | γ | δ | ε | ζ | η | θ | ι | κ | λ | μ | ν | ξ | ο | |
| 7x/Fx | π | ρ | σ | τ | υ | φ | χ | ψ | ω | (27)[n] | (28)[o] | (29)[p] | (30)[q] | |||
Additions by Apple | ||||||||||||||||
Character set 0x26 / 0xA6 (row number 6, box drawing)
[edit]This row contains characters for drawing boxes in a semigraphic context. Apple also includes some black circled list markers.[16]
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | ─ | │ | ┌ | ┐ | ┘ | └ | ├ | ┬ | ┤ | ┴ | ┼ | ━ | ┃ | ┏ | ┓ | |
| 3x/Bx | ┛ | ┗ | ┣ | ┳ | ┫ | ┻ | ╋ | ┠ | ┯ | ┨ | ┷ | ┿ | ┝ | ┰ | ┥ | ┸ |
| 4x/Cx | ╂ | ┒ | ┑ | ┚ | ┙ | ┖ | ┕ | ┎ | ┍ | ┞ | ┟ | ┡ | ┢ | ┦ | ┧ | ┩ |
| 5x/Dx | ┪ | ┭ | ┮ | ┱ | ┲ | ┵ | ┶ | ┹ | ┺ | ┽ | ┾ | ╀ | ╁ | ╃ | ╄ | ╅ |
| 6x/Ex | ╆ | ╇ | ╈ | ╉ | ╊ | ❶ | ❷ | ❸ | ❹ | ❺ | ❻ | ❼ | ❽ | ❾ | ❿ | ⓫ |
| 7x/Fx | ⓬ | ⓭ | ⓮ | ⓯ | ⓰ | ⓱ | ⓲ | ⓳ | ⓴ | (21)[r] | (22)[s] | (23)[t] | (24)[u] | (25)[v] | (26)[w] | |
Additions by Apple | ||||||||||||||||
Character set 0x27 / 0xA7 (row number 7, unit symbols)
[edit]This row contains unit symbols as single characters, including those which consist of multiple letters. Apple also includes some circled list markers continuing from those in row 8.[16]
Compare and contrast with the repertoire of unit symbols included in row 8 of KPS 9566.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | ㎕ | ㎖ | ㎗ | ℓ | ㎘ | ㏄ | ㎣ | ㎤ | ㎥ | ㎦ | ㎙ | ㎚ | ㎛ | ㎜ | ㎝ | |
| 3x/Bx | ㎞ | ㎟ | ㎠ | ㎡ | ㎢ | ㏊ | ㎍ | ㎎ | ㎏ | ㏏ | ㎈ | ㎉ | ㏈ | ㎧ | ㎨ | ㎰ |
| 4x/Cx | ㎱ | ㎲ | ㎳ | ㎴ | ㎵ | ㎶ | ㎷ | ㎸ | ㎹ | ㎀ | ㎁ | ㎂ | ㎃ | ㎄ | ㎺ | ㎻ |
| 5x/Dx | ㎼ | ㎽ | ㎾ | ㎿ | ㎐ | ㎑ | ㎒ | ㎓ | ㎔ | Ω | ㏀ | ㏁ | ㎊ | ㎋ | ㎌ | ㏖ |
| 6x/Ex | ㏅ | ㎭ | ㎮ | ㎯ | ㏛ | ㎩ | ㎪ | ㎫ | ㎬ | ㏝ | ㏐ | ㏓ | ㏃ | ㏉ | ㏜ | ㏆ |
| 7x/Fx | ⑯ | ⑰ | ⑱ | ⑲ | ⑳ | ㉑ | ㉒ | ㉓ | ㉔ | ㉕ | ㉖ | ㉗ | ㉘ | ㉙ | ㉚ | |
Additions by Apple | ||||||||||||||||
Character set 0x28 / 0xA8 (row number 8, extended Latin, encircled, fractions)
[edit]| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | Æ | Ð | ª | Ħ | IJ | Ŀ | Ł | Ø | Œ | º | Þ | Ŧ | Ŋ | |||
| 3x/Bx | ㉠ | ㉡ | ㉢ | ㉣ | ㉤ | ㉥ | ㉦ | ㉧ | ㉨ | ㉩ | ㉪ | ㉫ | ㉬ | ㉭ | ㉮ | |
| 4x/Cx | ㉯ | ㉰ | ㉱ | ㉲ | ㉳ | ㉴ | ㉵ | ㉶ | ㉷ | ㉸ | ㉹ | ㉺ | ㉻ | ⓐ | ⓑ | ⓒ |
| 5x/Dx | ⓓ | ⓔ | ⓕ | ⓖ | ⓗ | ⓘ | ⓙ | ⓚ | ⓛ | ⓜ | ⓝ | ⓞ | ⓟ | ⓠ | ⓡ | ⓢ |
| 6x/Ex | ⓣ | ⓤ | ⓥ | ⓦ | ⓧ | ⓨ | ⓩ | ① | ② | ③ | ④ | ⑤ | ⑥ | ⑦ | ⑧ | ⑨ |
| 7x/Fx | ⑩ | ⑪ | ⑫ | ⑬ | ⑭ | ⑮ | ½ | ⅓ | ⅔ | ¼ | ¾ | ⅛ | ⅜ | ⅝ | ⅞ |
Character set 0x29 / 0xA9 (row number 9, extended Latin, encircled, superscript and subscript)
[edit]| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | æ | đ | ð | ħ | ı | ij | ĸ | ŀ | ł | ø | œ | ß | þ | ŧ | ŋ | |
| 3x/Bx | ʼn | ㈀ | ㈁ | ㈂ | ㈃ | ㈄ | ㈅ | ㈆ | ㈇ | ㈈ | ㈉ | ㈊ | ㈋ | ㈌ | ㈍ | ㈎ |
| 4x/Cx | ㈏ | ㈐ | ㈑ | ㈒ | ㈓ | ㈔ | ㈕ | ㈖ | ㈗ | ㈘ | ㈙ | ㈚ | ㈛ | ⒜ | ⒝ | ⒞ |
| 5x/Dx | ⒟ | ⒠ | ⒡ | ⒢ | ⒣ | ⒤ | ⒥ | ⒦ | ⒧ | ⒨ | ⒩ | ⒪ | ⒫ | ⒬ | ⒭ | ⒮ |
| 6x/Ex | ⒯ | ⒰ | ⒱ | ⒲ | ⒳ | ⒴ | ⒵ | ⑴ | ⑵ | ⑶ | ⑷ | ⑸ | ⑹ | ⑺ | ⑻ | ⑼ |
| 7x/Fx | ⑽ | ⑾ | ⑿ | ⒀ | ⒁ | ⒂ | ¹ | ² | ³ | ⁴ | ⁿ | ₁ | ₂ | ₃ | ₄ |
Character set 0x2A / 0xAA (row number 10, Hiragana)
[edit]This set contains Hiragana for writing the Japanese language. Apple also includes some bracketed list markers continuing from those in row 9.[16]
Compare row 10 of KPS 9566, which uses the same layout. Compare and contrast row 4 of JIS X 0208, which also uses the same layout, but in a different row.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | ぁ | あ | ぃ | い | ぅ | う | ぇ | え | ぉ | お | か | が | き | ぎ | く | |
| 3x/Bx | ぐ | け | げ | こ | ご | さ | ざ | し | じ | す | ず | せ | ぜ | そ | ぞ | た |
| 4x/Cx | だ | ち | ぢ | っ | つ | づ | て | で | と | ど | な | に | ぬ | ね | の | は |
| 5x/Dx | ば | ぱ | ひ | び | ぴ | ふ | ぶ | ぷ | へ | べ | ぺ | ほ | ぼ | ぽ | ま | み |
| 6x/Ex | む | め | も | ゃ | や | ゅ | ゆ | ょ | よ | ら | り | る | れ | ろ | ゎ | わ |
| 7x/Fx | ゐ | ゑ | を | ん | ⒃ | ⒄ | ⒅ | ⒆ | ⒇ | (21)[x] | (22)[x] | (23)[x] | (24)[x] | (25)[x] | (26)[x] | |
Additions by Apple | ||||||||||||||||
Character set 0x2B / 0xAB (row number 11, Katakana)
[edit]This set contains katakana for writing the Japanese language. However, the Japanese long vowel mark, which is used in katakana text and included in row 1 of JIS X 0208, is not included.[20] Apple also includes some bracketed list markers continuing from those in rows 9 and 10.[16]
Compare row 11 of KPS 9566, which uses the same layout. Compare and contrast row 5 of JIS X 0208, which also uses the same layout, but in a different row.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | ァ | ア | ィ | イ | ゥ | ウ | ェ | エ | ォ | オ | カ | ガ | キ | ギ | ク | |
| 3x/Bx | グ | ケ | ゲ | コ | ゴ | サ | ザ | シ | ジ | ス | ズ | セ | ゼ | ソ | ゾ | タ |
| 4x/Cx | ダ | チ | ヂ | ッ | ツ | ヅ | テ | デ | ト | ド | ナ | ニ | ヌ | ネ | ノ | ハ |
| 5x/Dx | バ | パ | ヒ | ビ | ピ | フ | ブ | プ | ヘ | ベ | ペ | ホ | ボ | ポ | マ | ミ |
| 6x/Ex | ム | メ | モ | ャ | ヤ | ュ | ユ | ョ | ヨ | ラ | リ | ル | レ | ロ | ヮ | ワ |
| 7x/Fx | ヰ | ヱ | ヲ | ン | ヴ | ヵ | ヶ | (27)[x] | (28)[x] | (29)[x] | (30)[x] | |||||
Additions by Apple | ||||||||||||||||
Character set 0x2C / 0xAC (row number 12, Cyrillic)
[edit]This set contains the modern Russian alphabet, and is not necessarily sufficient to represent other forms of the Cyrillic script. Apple also includes some black boxed list markers.[16]
Compare row 5 of KPS 9566 and row 7 of JIS X 0208, which use the same layout (but in a different row).
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | А | Б | В | Г | Д | Е | Ё | Ж | З | И | Й | К | Л | М | Н | |
| 3x/Bx | О | П | Р | С | Т | У | Ф | Х | Ц | Ч | Ш | Щ | Ъ | Ы | Ь | Э |
| 4x/Cx | Ю | Я | 1⃞[y] | 2⃞[y] | 3⃞[y] | 4⃞[y] | 5⃞[y] | 6⃞[y] | 7⃞[y] | 8⃞[y] | 9⃞[y] | [10][z] | [11][z] | [12][z] | [13][z] | [14][z] |
| 5x/Dx | [15][z] | а | б | в | г | д | е | ё | ж | з | и | й | к | л | м | н |
| 6x/Ex | о | п | р | с | т | у | ф | х | ц | ч | ш | щ | ъ | ы | ь | э |
| 7x/Fx | ю | я | [16][z] | [17][z] | [18][z] | [19][z] | [20][z] | |||||||||
Additions by Apple | ||||||||||||||||
Extended character set 0x2D / 0xAD (row number 13, Apple additional punctuation)
[edit]| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 2x/Ax | 「[aa] | 」[aa] | 『[aa] | 』[aa] | ⇨[ab] | ⇦[ab] | ⇧[ab] | ⇩[ab] | 〞 | 〟 | ‶ | ″[ac] | ‵ | ′[ac] | 🠩[ad] | |
| 3x/Bx | ![ae] |
Precomposed Hangul sets (rows number 16 through 40)
[edit]Code points for precomposed Hangul are included in a continuous sorted block between code points 16-01 and 40-94 inclusive. Not all possible syllable clusters are included in this range. Compare the different ordering and availability in KPS 9566.
Initial+vowel+final syllables 뢨, 썅, 쏀, 쓩, and 쭁 are included but their initial+vowel counterparts 뢔, 쌰, 쎼, 쓔, and 쬬 are not. This can cause a problem with inputting, because input methods have to go through an initial+vowel syllable first in order to get to an initial+vowel+final syllable (e.g. ㅎ → 하 → 한).
Those which are not listed here may be represented using eight-byte composition sequences. All other modern-jamo clusters are assigned codes elsewhere by UHC. All possible modern-jamo clusters are assigned codes by Johab.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 302x/B0Ax | 가 AC00 |
각 AC01 |
간 AC04 |
갇 AC07 |
갈 AC08 |
갉 AC09 |
갊 AC0A |
감 AC10 |
갑 AC11 |
값 AC12 |
갓 AC13 |
갔 AC14 |
강 AC15 |
갖 AC16 |
갗 AC17 | |
| 303x/B0Bx | 같 AC19 |
갚 AC1A |
갛 AC1B |
개 AC1C |
객 AC1D |
갠 AC20 |
갤 AC24 |
갬 AC2C |
갭 AC2D |
갯 AC2F |
갰 AC30 |
갱 AC31 |
갸 AC38 |
갹 AC39 |
갼 AC3C |
걀 AC40 |
| 304x/B0Cx | 걋 AC4B |
걍 AC4D |
걔 AC54 |
걘 AC58 |
걜 AC5C |
거 AC70 |
걱 AC71 |
건 AC74 |
걷 AC77 |
걸 AC78 |
걺 AC7A |
검 AC80 |
겁 AC81 |
것 AC83 |
겄 AC84 |
겅 AC85 |
| 305x/B0Dx | 겆 AC86 |
겉 AC89 |
겊 AC8A |
겋 AC8B |
게 AC8C |
겐 AC90 |
겔 AC94 |
겜 AC9C |
겝 AC9D |
겟 AC9F |
겠 ACA0 |
겡 ACA1 |
겨 ACA8 |
격 ACA9 |
겪 ACAA |
견 ACAC |
| 306x/B0Ex | 겯 ACAF |
결 ACB0 |
겸 ACB8 |
겹 ACB9 |
겻 ACBB |
겼 ACBC |
경 ACBD |
곁 ACC1 |
계 ACC4 |
곈 ACC8 |
곌 ACCC |
곕 ACD5 |
곗 ACD7 |
고 ACE0 |
곡 ACE1 |
곤 ACE4 |
| 307x/B0Fx | 곧 ACE7 |
골 ACE8 |
곪 ACEA |
곬 ACEC |
곯 ACEF |
곰 ACF0 |
곱 ACF1 |
곳 ACF3 |
공 ACF5 |
곶 ACF6 |
과 ACFC |
곽 ACFD |
관 AD00 |
괄 AD04 |
괆 AD06 |
|
| 312x/B1Ax | 괌 AD0C |
괍 AD0D |
괏 AD0F |
광 AD11 |
괘 AD18 |
괜 AD1C |
괠 AD20 |
괩 AD29 |
괬 AD2C |
괭 AD2D |
괴 AD34 |
괵 AD35 |
괸 AD38 |
괼 AD3C |
굄 AD44 | |
| 313x/B1Bx | 굅 AD45 |
굇 AD47 |
굉 AD49 |
교 AD50 |
굔 AD54 |
굘 AD58 |
굡 AD61 |
굣 AD63 |
구 AD6C |
국 AD6D |
군 AD70 |
굳 AD73 |
굴 AD74 |
굵 AD75 |
굶 AD76 |
굻 AD7B |
| 314x/B1Cx | 굼 AD7C |
굽 AD7D |
굿 AD7F |
궁 AD81 |
궂 AD82 |
궈 AD88 |
궉 AD89 |
권 AD8C |
궐 AD90 |
궜 AD9C |
궝 AD9D |
궤 ADA4 |
궷 ADB7 |
귀 ADC0 |
귁 ADC1 |
귄 ADC4 |
| 315x/B1Dx | 귈 ADC8 |
귐 ADD0 |
귑 ADD1 |
귓 ADD3 |
규 ADDC |
균 ADE0 |
귤 ADE4 |
그 ADF8 |
극 ADF9 |
근 ADFC |
귿 ADFF |
글 AE00 |
긁 AE01 |
금 AE08 |
급 AE09 |
긋 AE0B |
| 316x/B1Ex | 긍 AE0D |
긔 AE14 |
기 AE30 |
긱 AE31 |
긴 AE34 |
긷 AE37 |
길 AE38 |
긺 AE3A |
김 AE40 |
깁 AE41 |
깃 AE43 |
깅 AE45 |
깆 AE46 |
깊 AE4A |
까 AE4C |
깍 AE4D |
| 317x/B1Fx | 깎 AE4E |
깐 AE50 |
깔 AE54 |
깖 AE56 |
깜 AE5C |
깝 AE5D |
깟 AE5F |
깠 AE60 |
깡 AE61 |
깥 AE65 |
깨 AE68 |
깩 AE69 |
깬 AE6C |
깰 AE70 |
깸 AE78 |
|
| 322x/B2Ax | 깹 AE79 |
깻 AE7B |
깼 AE7C |
깽 AE7D |
꺄 AE84 |
꺅 AE85 |
꺌 AE8C |
꺼 AEBC |
꺽 AEBD |
꺾 AEBE |
껀 AEC0 |
껄 AEC4 |
껌 AECC |
껍 AECD |
껏 AECF | |
| 323x/B2Bx | 껐 AED0 |
껑 AED1 |
께 AED8 |
껙 AED9 |
껜 AEDC |
껨 AEE8 |
껫 AEEB |
껭 AEED |
껴 AEF4 |
껸 AEF8 |
껼 AEFC |
꼇 AF07 |
꼈 AF08 |
꼍 AF0D |
꼐 AF10 |
꼬 AF2C |
| 324x/B2Cx | 꼭 AF2D |
꼰 AF30 |
꼲 AF32 |
꼴 AF34 |
꼼 AF3C |
꼽 AF3D |
꼿 AF3F |
꽁 AF41 |
꽂 AF42 |
꽃 AF43 |
꽈 AF48 |
꽉 AF49 |
꽐 AF50 |
꽜 AF5C |
꽝 AF5D |
꽤 AF64 |
| 325x/B2Dx | 꽥 AF65 |
꽹 AF79 |
꾀 AF80 |
꾄 AF84 |
꾈 AF88 |
꾐 AF90 |
꾑 AF91 |
꾕 AF95 |
꾜 AF9C |
꾸 AFB8 |
꾹 AFB9 |
꾼 AFBC |
꿀 AFC0 |
꿇 AFC7 |
꿈 AFC8 |
꿉 AFC9 |
| 326x/B2Ex | 꿋 AFCB |
꿍 AFCD |
꿎 AFCE |
꿔 AFD4 |
꿜 AFDC |
꿨 AFE8 |
꿩 AFE9 |
꿰 AFF0 |
꿱 AFF1 |
꿴 AFF4 |
꿸 AFF8 |
뀀 B000 |
뀁 B001 |
뀄 B004 |
뀌 B00C |
뀐 B010 |
| 327x/B2Fx | 뀔 B014 |
뀜 B01C |
뀝 B01D |
뀨 B028 |
끄 B044 |
끅 B045 |
끈 B048 |
끊 B04A |
끌 B04C |
끎 B04E |
끓 B053 |
끔 B054 |
끕 B055 |
끗 B057 |
끙 B059 |
|
| 332x/B3Ax | 끝 B05D |
끼 B07C |
끽 B07D |
낀 B080 |
낄 B084 |
낌 B08C |
낍 B08D |
낏 B08F |
낑 B091 |
나 B098 |
낙 B099 |
낚 B09A |
난 B09C |
낟 B09F |
날 B0A0 | |
| 333x/B3Bx | 낡 B0A1 |
낢 B0A2 |
남 B0A8 |
납 B0A9 |
낫 B0AB |
났 B0AC |
낭 B0AD |
낮 B0AE |
낯 B0AF |
낱 B0B1 |
낳 B0B3 |
내 B0B4 |
낵 B0B5 |
낸 B0B8 |
낼 B0BC |
냄 B0C4 |
| 334x/B3Cx | 냅 B0C5 |
냇 B0C7 |
냈 B0C8 |
냉 B0C9 |
냐 B0D0 |
냑 B0D1 |
냔 B0D4 |
냘 B0D8 |
냠 B0E0 |
냥 B0E5 |
너 B108 |
넉 B109 |
넋 B10B |
넌 B10C |
널 B110 |
넒 B112 |
| 335x/B3Dx | 넓 B113 |
넘 B118 |
넙 B119 |
넛 B11B |
넜 B11C |
넝 B11D |
넣 B123 |
네 B124 |
넥 B125 |
넨 B128 |
넬 B12C |
넴 B134 |
넵 B135 |
넷 B137 |
넸 B138 |
넹 B139 |
| 336x/B3Ex | 녀 B140 |
녁 B141 |
년 B144 |
녈 B148 |
념 B150 |
녑 B151 |
녔 B154 |
녕 B155 |
녘 B158 |
녜 B15C |
녠 B160 |
노 B178 |
녹 B179 |
논 B17C |
놀 B180 |
놂 B182 |
| 337x/B3Fx | 놈 B188 |
놉 B189 |
놋 B18B |
농 B18D |
높 B192 |
놓 B193 |
놔 B194 |
놘 B198 |
놜 B19C |
놨 B1A8 |
뇌 B1CC |
뇐 B1D0 |
뇔 B1D4 |
뇜 B1DC |
뇝 B1DD |
|
| 342x/B4Ax | 뇟 B1DF |
뇨 B1E8 |
뇩 B1E9 |
뇬 B1EC |
뇰 B1F0 |
뇹 B1F9 |
뇻 B1FB |
뇽 B1FD |
누 B204 |
눅 B205 |
눈 B208 |
눋 B20B |
눌 B20C |
눔 B214 |
눕 B215 | |
| 343x/B4Bx | 눗 B217 |
눙 B219 |
눠 B220 |
눴 B234 |
눼 B23C |
뉘 B258 |
뉜 B25C |
뉠 B260 |
뉨 B268 |
뉩 B269 |
뉴 B274 |
뉵 B275 |
뉼 B27C |
늄 B284 |
늅 B285 |
늉 B289 |
| 344x/B4Cx | 느 B290 |
늑 B291 |
는 B294 |
늘 B298 |
늙 B299 |
늚 B29A |
늠 B2A0 |
늡 B2A1 |
늣 B2A3 |
능 B2A5 |
늦 B2A6 |
늪 B2AA |
늬 B2AC |
늰 B2B0 |
늴 B2B4 |
니 B2C8 |
| 345x/B4Dx | 닉 B2C9 |
닌 B2CC |
닐 B2D0 |
닒 B2D2 |
님 B2D8 |
닙 B2D9 |
닛 B2DB |
닝 B2DD |
닢 B2E2 |
다 B2E4 |
닥 B2E5 |
닦 B2E6 |
단 B2E8 |
닫 B2EB |
달 B2EC |
닭 B2ED |
| 346x/B4Ex | 닮 B2EE |
닯 B2EF |
닳 B2F3 |
담 B2F4 |
답 B2F5 |
닷 B2F7 |
닸 B2F8 |
당 B2F9 |
닺 B2FA |
닻 B2FB |
닿 B2FF |
대 B300 |
댁 B301 |
댄 B304 |
댈 B308 |
댐 B310 |
| 347x/B4Fx | 댑 B311 |
댓 B313 |
댔 B314 |
댕 B315 |
댜 B31C |
더 B354 |
덕 B355 |
덖 B356 |
던 B358 |
덛 B35B |
덜 B35C |
덞 B35E |
덟 B35F |
덤 B364 |
덥 B365 |
|
| 352x/B5Ax | 덧 B367 |
덩 B369 |
덫 B36B |
덮 B36E |
데 B370 |
덱 B371 |
덴 B374 |
델 B378 |
뎀 B380 |
뎁 B381 |
뎃 B383 |
뎄 B384 |
뎅 B385 |
뎌 B38C |
뎐 B390 | |
| 353x/B5Bx | 뎔 B394 |
뎠 B3A0 |
뎡 B3A1 |
뎨 B3A8 |
뎬 B3AC |
도 B3C4 |
독 B3C5 |
돈 B3C8 |
돋 B3CB |
돌 B3CC |
돎 B3CE |
돐 B3D0 |
돔 B3D4 |
돕 B3D5 |
돗 B3D7 |
동 B3D9 |
| 354x/B5Cx | 돛 B3DB |
돝 B3DD |
돠 B3E0 |
돤 B3E4 |
돨 B3E8 |
돼 B3FC |
됐 B410 |
되 B418 |
된 B41C |
될 B420 |
됨 B428 |
됩 B429 |
됫 B42B |
됴 B434 |
두 B450 |
둑 B451 |
| 355x/B5Dx | 둔 B454 |
둘 B458 |
둠 B460 |
둡 B461 |
둣 B463 |
둥 B465 |
둬 B46C |
뒀 B480 |
뒈 B488 |
뒝 B49D |
뒤 B4A4 |
뒨 B4A8 |
뒬 B4AC |
뒵 B4B5 |
뒷 B4B7 |
뒹 B4B9 |
| 356x/B5Ex | 듀 B4C0 |
듄 B4C4 |
듈 B4C8 |
듐 B4D0 |
듕 B4D5 |
드 B4DC |
득 B4DD |
든 B4E0 |
듣 B4E3 |
들 B4E4 |
듦 B4E6 |
듬 B4EC |
듭 B4ED |
듯 B4EF |
등 B4F1 |
듸 B4F8 |
| 357x/B5Fx | 디 B514 |
딕 B515 |
딘 B518 |
딛 B51B |
딜 B51C |
딤 B524 |
딥 B525 |
딧 B527 |
딨 B528 |
딩 B529 |
딪 B52A |
따 B530 |
딱 B531 |
딴 B534 |
딸 B538 |
|
| 362x/B6Ax | 땀 B540 |
땁 B541 |
땃 B543 |
땄 B544 |
땅 B545 |
땋 B54B |
때 B54C |
땍 B54D |
땐 B550 |
땔 B554 |
땜 B55C |
땝 B55D |
땟 B55F |
땠 B560 |
땡 B561 | |
| 363x/B6Bx | 떠 B5A0 |
떡 B5A1 |
떤 B5A4 |
떨 B5A8 |
떪 B5AA |
떫 B5AB |
떰 B5B0 |
떱 B5B1 |
떳 B5B3 |
떴 B5B4 |
떵 B5B5 |
떻 B5BB |
떼 B5BC |
떽 B5BD |
뗀 B5C0 |
뗄 B5C4 |
| 364x/B6Cx | 뗌 B5CC |
뗍 B5CD |
뗏 B5CF |
뗐 B5D0 |
뗑 B5D1 |
뗘 B5D8 |
뗬 B5EC |
또 B610 |
똑 B611 |
똔 B614 |
똘 B618 |
똥 B625 |
똬 B62C |
똴 B634 |
뙈 B648 |
뙤 B664 |
| 365x/B6Dx | 뙨 B668 |
뚜 B69C |
뚝 B69D |
뚠 B6A0 |
뚤 B6A4 |
뚫 B6AB |
뚬 B6AC |
뚱 B6B1 |
뛔 B6D4 |
뛰 B6F0 |
뛴 B6F4 |
뛸 B6F8 |
뜀 B700 |
뜁 B701 |
뜅 B705 |
뜨 B728 |
| 366x/B6Ex | 뜩 B729 |
뜬 B72C |
뜯 B72F |
뜰 B730 |
뜸 B738 |
뜹 B739 |
뜻 B73B |
띄 B744 |
띈 B748 |
띌 B74C |
띔 B754 |
띕 B755 |
띠 B760 |
띤 B764 |
띨 B768 |
띰 B770 |
| 367x/B6Fx | 띱 B771 |
띳 B773 |
띵 B775 |
라 B77C |
락 B77D |
란 B780 |
랄 B784 |
람 B78C |
랍 B78D |
랏 B78F |
랐 B790 |
랑 B791 |
랒 B792 |
랖 B796 |
랗 B797 |
|
| 372x/B7Ax | 래 B798 |
랙 B799 |
랜 B79C |
랠 B7A0 |
램 B7A8 |
랩 B7A9 |
랫 B7AB |
랬 B7AC |
랭 B7AD |
랴 B7B4 |
략 B7B5 |
랸 B7B8 |
럇 B7C7 |
량 B7C9 |
러 B7EC | |
| 373x/B7Bx | 럭 B7ED |
런 B7F0 |
럴 B7F4 |
럼 B7FC |
럽 B7FD |
럿 B7FF |
렀 B800 |
렁 B801 |
렇 B807 |
레 B808 |
렉 B809 |
렌 B80C |
렐 B810 |
렘 B818 |
렙 B819 |
렛 B81B |
| 374x/B7Cx | 렝 B81D |
려 B824 |
력 B825 |
련 B828 |
렬 B82C |
렴 B834 |
렵 B835 |
렷 B837 |
렸 B838 |
령 B839 |
례 B840 |
롄 B844 |
롑 B851 |
롓 B853 |
로 B85C |
록 B85D |
| 375x/B7Dx | 론 B860 |
롤 B864 |
롬 B86C |
롭 B86D |
롯 B86F |
롱 B871 |
롸 B878 |
롼 B87C |
뢍 B88D |
뢨 B8A8 |
뢰 B8B0 |
뢴 B8B4 |
뢸 B8B8 |
룀 B8C0 |
룁 B8C1 |
룃 B8C3 |
| 376x/B7Ex | 룅 B8C5 |
료 B8CC |
룐 B8D0 |
룔 B8D4 |
룝 B8DD |
룟 B8DF |
룡 B8E1 |
루 B8E8 |
룩 B8E9 |
룬 B8EC |
룰 B8F0 |
룸 B8F8 |
룹 B8F9 |
룻 B8FB |
룽 B8FD |
뤄 B904 |
| 377x/B7Fx | 뤘 B918 |
뤠 B920 |
뤼 B93C |
뤽 B93D |
륀 B940 |
륄 B944 |
륌 B94C |
륏 B94F |
륑 B951 |
류 B958 |
륙 B959 |
륜 B95C |
률 B960 |
륨 B968 |
륩 B969 |
|
| 382x/B8Ax | 륫 B96B |
륭 B96D |
르 B974 |
륵 B975 |
른 B978 |
를 B97C |
름 B984 |
릅 B985 |
릇 B987 |
릉 B989 |
릊 B98A |
릍 B98D |
릎 B98E |
리 B9AC |
릭 B9AD | |
| 383x/B8Bx | 린 B9B0 |
릴 B9B4 |
림 B9BC |
립 B9BD |
릿 B9BF |
링 B9C1 |
마 B9C8 |
막 B9C9 |
만 B9CC |
많 B9CE |
맏 B9CF |
말 B9D0 |
맑 B9D1 |
맒 B9D2 |
맘 B9D8 |
맙 B9D9 |
| 384x/B8Cx | 맛 B9DB |
망 B9DD |
맞 B9DE |
맡 B9E1 |
맣 B9E3 |
매 B9E4 |
맥 B9E5 |
맨 B9E8 |
맬 B9EC |
맴 B9F4 |
맵 B9F5 |
맷 B9F7 |
맸 B9F8 |
맹 B9F9 |
맺 B9FA |
먀 BA00 |
| 385x/B8Dx | 먁 BA01 |
먈 BA08 |
먕 BA15 |
머 BA38 |
먹 BA39 |
먼 BA3C |
멀 BA40 |
멂 BA42 |
멈 BA48 |
멉 BA49 |
멋 BA4B |
멍 BA4D |
멎 BA4E |
멓 BA53 |
메 BA54 |
멕 BA55 |
| 386x/B8Ex | 멘 BA58 |
멜 BA5C |
멤 BA64 |
멥 BA65 |
멧 BA67 |
멨 BA68 |
멩 BA69 |
며 BA70 |
멱 BA71 |
면 BA74 |
멸 BA78 |
몃 BA83 |
몄 BA84 |
명 BA85 |
몇 BA87 |
몌 BA8C |
| 387x/B8Fx | 모 BAA8 |
목 BAA9 |
몫 BAAB |
몬 BAAC |
몰 BAB0 |
몲 BAB2 |
몸 BAB8 |
몹 BAB9 |
못 BABB |
몽 BABD |
뫄 BAC4 |
뫈 BAC8 |
뫘 BAD8 |
뫙 BAD9 |
뫼 BAFC |
|
| 392x/B9Ax | 묀 BB00 |
묄 BB04 |
묍 BB0D |
묏 BB0F |
묑 BB11 |
묘 BB18 |
묜 BB1C |
묠 BB20 |
묩 BB29 |
묫 BB2B |
무 BB34 |
묵 BB35 |
묶 BB36 |
문 BB38 |
묻 BB3B | |
| 393x/B9Bx | 물 BB3C |
묽 BB3D |
묾 BB3E |
뭄 BB44 |
뭅 BB45 |
뭇 BB47 |
뭉 BB49 |
뭍 BB4D |
뭏 BB4F |
뭐 BB50 |
뭔 BB54 |
뭘 BB58 |
뭡 BB61 |
뭣 BB63 |
뭬 BB6C |
뮈 BB88 |
| 394x/B9Cx | 뮌 BB8C |
뮐 BB90 |
뮤 BBA4 |
뮨 BBA8 |
뮬 BBAC |
뮴 BBB4 |
뮷 BBB7 |
므 BBC0 |
믄 BBC4 |
믈 BBC8 |
믐 BBD0 |
믓 BBD3 |
미 BBF8 |
믹 BBF9 |
민 BBFC |
믿 BBFF |
| 395x/B9Dx | 밀 BC00 |
밂 BC02 |
밈 BC08 |
밉 BC09 |
밋 BC0B |
밌 BC0C |
밍 BC0D |
및 BC0F |
밑 BC11 |
바 BC14 |
박 BC15 |
밖 BC16 |
밗 BC17 |
반 BC18 |
받 BC1B |
발 BC1C |
| 396x/B9Ex | 밝 BC1D |
밞 BC1E |
밟 BC1F |
밤 BC24 |
밥 BC25 |
밧 BC27 |
방 BC29 |
밭 BC2D |
배 BC30 |
백 BC31 |
밴 BC34 |
밸 BC38 |
뱀 BC40 |
뱁 BC41 |
뱃 BC43 |
뱄 BC44 |
| 397x/B9Fx | 뱅 BC45 |
뱉 BC49 |
뱌 BC4C |
뱍 BC4D |
뱐 BC50 |
뱝 BC5D |
버 BC84 |
벅 BC85 |
번 BC88 |
벋 BC8B |
벌 BC8C |
벎 BC8E |
범 BC94 |
법 BC95 |
벗 BC97 |
|
| 3A2x/BAAx | 벙 BC99 |
벚 BC9A |
베 BCA0 |
벡 BCA1 |
벤 BCA4 |
벧 BCA7 |
벨 BCA8 |
벰 BCB0 |
벱 BCB1 |
벳 BCB3 |
벴 BCB4 |
벵 BCB5 |
벼 BCBC |
벽 BCBD |
변 BCC0 | |
| 3A3x/BABx | 별 BCC4 |
볍 BCCD |
볏 BCCF |
볐 BCD0 |
병 BCD1 |
볕 BCD5 |
볘 BCD8 |
볜 BCDC |
보 BCF4 |
복 BCF5 |
볶 BCF6 |
본 BCF8 |
볼 BCFC |
봄 BD04 |
봅 BD05 |
봇 BD07 |
| 3A4x/BACx | 봉 BD09 |
봐 BD10 |
봔 BD14 |
봤 BD24 |
봬 BD2C |
뵀 BD40 |
뵈 BD48 |
뵉 BD49 |
뵌 BD4C |
뵐 BD50 |
뵘 BD58 |
뵙 BD59 |
뵤 BD64 |
뵨 BD68 |
부 BD80 |
북 BD81 |
| 3A5x/BADx | 분 BD84 |
붇 BD87 |
불 BD88 |
붉 BD89 |
붊 BD8A |
붐 BD90 |
붑 BD91 |
붓 BD93 |
붕 BD95 |
붙 BD99 |
붚 BD9A |
붜 BD9C |
붤 BDA4 |
붰 BDB0 |
붸 BDB8 |
뷔 BDD4 |
| 3A6x/BAEx | 뷕 BDD5 |
뷘 BDD8 |
뷜 BDDC |
뷩 BDE9 |
뷰 BDF0 |
뷴 BDF4 |
뷸 BDF8 |
븀 BE00 |
븃 BE03 |
븅 BE05 |
브 BE0C |
븍 BE0D |
븐 BE10 |
블 BE14 |
븜 BE1C |
븝 BE1D |
| 3A7x/BAFx | 븟 BE1F |
비 BE44 |
빅 BE45 |
빈 BE48 |
빌 BE4C |
빎 BE4E |
빔 BE54 |
빕 BE55 |
빗 BE57 |
빙 BE59 |
빚 BE5A |
빛 BE5B |
빠 BE60 |
빡 BE61 |
빤 BE64 |
|
| 3B2x/BBAx | 빨 BE68 |
빪 BE6A |
빰 BE70 |
빱 BE71 |
빳 BE73 |
빴 BE74 |
빵 BE75 |
빻 BE7B |
빼 BE7C |
빽 BE7D |
뺀 BE80 |
뺄 BE84 |
뺌 BE8C |
뺍 BE8D |
뺏 BE8F | |
| 3B3x/BBBx | 뺐 BE90 |
뺑 BE91 |
뺘 BE98 |
뺙 BE99 |
뺨 BEA8 |
뻐 BED0 |
뻑 BED1 |
뻔 BED4 |
뻗 BED7 |
뻘 BED8 |
뻠 BEE0 |
뻣 BEE3 |
뻤 BEE4 |
뻥 BEE5 |
뻬 BEEC |
뼁 BF01 |
| 3B4x/BBCx | 뼈 BF08 |
뼉 BF09 |
뼘 BF18 |
뼙 BF19 |
뼛 BF1B |
뼜 BF1C |
뼝 BF1D |
뽀 BF40 |
뽁 BF41 |
뽄 BF44 |
뽈 BF48 |
뽐 BF50 |
뽑 BF51 |
뽕 BF55 |
뾔 BF94 |
뾰 BFB0 |
| 3B5x/BBDx | 뿅 BFC5 |
뿌 BFCC |
뿍 BFCD |
뿐 BFD0 |
뿔 BFD4 |
뿜 BFDC |
뿟 BFDF |
뿡 BFE1 |
쀼 C03C |
쁑 C051 |
쁘 C058 |
쁜 C05C |
쁠 C060 |
쁨 C068 |
쁩 C069 |
삐 C090 |
| 3B6x/BBEx | 삑 C091 |
삔 C094 |
삘 C098 |
삠 C0A0 |
삡 C0A1 |
삣 C0A3 |
삥 C0A5 |
사 C0AC |
삭 C0AD |
삯 C0AF |
산 C0B0 |
삳 C0B3 |
살 C0B4 |
삵 C0B5 |
삶 C0B6 |
삼 C0BC |
| 3B7x/BBFx | 삽 C0BD |
삿 C0BF |
샀 C0C0 |
상 C0C1 |
샅 C0C5 |
새 C0C8 |
색 C0C9 |
샌 C0CC |
샐 C0D0 |
샘 C0D8 |
샙 C0D9 |
샛 C0DB |
샜 C0DC |
생 C0DD |
샤 C0E4 |
|
| 3C2x/BCAx | 샥 C0E5 |
샨 C0E8 |
샬 C0EC |
샴 C0F4 |
샵 C0F5 |
샷 C0F7 |
샹 C0F9 |
섀 C100 |
섄 C104 |
섈 C108 |
섐 C110 |
섕 C115 |
서 C11C |
석 C11D |
섞 C11E | |
| 3C3x/BCBx | 섟 C11F |
선 C120 |
섣 C123 |
설 C124 |
섦 C126 |
섧 C127 |
섬 C12C |
섭 C12D |
섯 C12F |
섰 C130 |
성 C131 |
섶 C136 |
세 C138 |
섹 C139 |
센 C13C |
셀 C140 |
| 3C4x/BCCx | 셈 C148 |
셉 C149 |
셋 C14B |
셌 C14C |
셍 C14D |
셔 C154 |
셕 C155 |
션 C158 |
셜 C15C |
셤 C164 |
셥 C165 |
셧 C167 |
셨 C168 |
셩 C169 |
셰 C170 |
셴 C174 |
| 3C5x/BCDx | 셸 C178 |
솅 C185 |
소 C18C |
속 C18D |
솎 C18E |
손 C190 |
솔 C194 |
솖 C196 |
솜 C19C |
솝 C19D |
솟 C19F |
송 C1A1 |
솥 C1A5 |
솨 C1A8 |
솩 C1A9 |
솬 C1AC |
| 3C6x/BCEx | 솰 C1B0 |
솽 C1BD |
쇄 C1C4 |
쇈 C1C8 |
쇌 C1CC |
쇔 C1D4 |
쇗 C1D7 |
쇘 C1D8 |
쇠 C1E0 |
쇤 C1E4 |
쇨 C1E8 |
쇰 C1F0 |
쇱 C1F1 |
쇳 C1F3 |
쇼 C1FC |
쇽 C1FD |
| 3C7x/BCFx | 숀 C200 |
숄 C204 |
숌 C20C |
숍 C20D |
숏 C20F |
숑 C211 |
수 C218 |
숙 C219 |
순 C21C |
숟 C21F |
술 C220 |
숨 C228 |
숩 C229 |
숫 C22B |
숭 C22D |
|
| 3D2x/BDAx | 숯 C22F |
숱 C231 |
숲 C232 |
숴 C234 |
쉈 C248 |
쉐 C250 |
쉑 C251 |
쉔 C254 |
쉘 C258 |
쉠 C260 |
쉥 C265 |
쉬 C26C |
쉭 C26D |
쉰 C270 |
쉴 C274 | |
| 3D3x/BDBx | 쉼 C27C |
쉽 C27D |
쉿 C27F |
슁 C281 |
슈 C288 |
슉 C289 |
슐 C290 |
슘 C298 |
슛 C29B |
슝 C29D |
스 C2A4 |
슥 C2A5 |
슨 C2A8 |
슬 C2AC |
슭 C2AD |
슴 C2B4 |
| 3D4x/BDCx | 습 C2B5 |
슷 C2B7 |
승 C2B9 |
시 C2DC |
식 C2DD |
신 C2E0 |
싣 C2E3 |
실 C2E4 |
싫 C2EB |
심 C2EC |
십 C2ED |
싯 C2EF |
싱 C2F1 |
싶 C2F6 |
싸 C2F8 |
싹 C2F9 |
| 3D5x/BDDx | 싻 C2FB |
싼 C2FC |
쌀 C300 |
쌈 C308 |
쌉 C309 |
쌌 C30C |
쌍 C30D |
쌓 C313 |
쌔 C314 |
쌕 C315 |
쌘 C318 |
쌜 C31C |
쌤 C324 |
쌥 C325 |
쌨 C328 |
쌩 C329 |
| 3D6x/BDEx | 썅 C345 |
써 C368 |
썩 C369 |
썬 C36C |
썰 C370 |
썲 C372 |
썸 C378 |
썹 C379 |
썼 C37C |
썽 C37D |
쎄 C384 |
쎈 C388 |
쎌 C38C |
쏀 C3C0 |
쏘 C3D8 |
쏙 C3D9 |
| 3D7x/BDFx | 쏜 C3DC |
쏟 C3DF |
쏠 C3E0 |
쏢 C3E2 |
쏨 C3E8 |
쏩 C3E9 |
쏭 C3ED |
쏴 C3F4 |
쏵 C3F5 |
쏸 C3F8 |
쐈 C408 |
쐐 C410 |
쐤 C424 |
쐬 C42C |
쐰 C430 |
|
| 3E2x/BEAx | 쐴 C434 |
쐼 C43C |
쐽 C43D |
쑈 C448 |
쑤 C464 |
쑥 C465 |
쑨 C468 |
쑬 C46C |
쑴 C474 |
쑵 C475 |
쑹 C479 |
쒀 C480 |
쒔 C494 |
쒜 C49C |
쒸 C4B8 | |
| 3E3x/BEBx | 쒼 C4BC |
쓩 C4E9 |
쓰 C4F0 |
쓱 C4F1 |
쓴 C4F4 |
쓸 C4F8 |
쓺 C4FA |
쓿 C4FF |
씀 C500 |
씁 C501 |
씌 C50C |
씐 C510 |
씔 C514 |
씜 C51C |
씨 C528 |
씩 C529 |
| 3E4x/BECx | 씬 C52C |
씰 C530 |
씸 C538 |
씹 C539 |
씻 C53B |
씽 C53D |
아 C544 |
악 C545 |
안 C548 |
앉 C549 |
않 C54A |
알 C54C |
앍 C54D |
앎 C54E |
앓 C553 |
암 C554 |
| 3E5x/BEDx | 압 C555 |
앗 C557 |
았 C558 |
앙 C559 |
앝 C55D |
앞 C55E |
애 C560 |
액 C561 |
앤 C564 |
앨 C568 |
앰 C570 |
앱 C571 |
앳 C573 |
앴 C574 |
앵 C575 |
야 C57C |
| 3E6x/BEEx | 약 C57D |
얀 C580 |
얄 C584 |
얇 C587 |
얌 C58C |
얍 C58D |
얏 C58F |
양 C591 |
얕 C595 |
얗 C597 |
얘 C598 |
얜 C59C |
얠 C5A0 |
얩 C5A9 |
어 C5B4 |
억 C5B5 |
| 3E7x/BEFx | 언 C5B8 |
얹 C5B9 |
얻 C5BB |
얼 C5BC |
얽 C5BD |
얾 C5BE |
엄 C5C4 |
업 C5C5 |
없 C5C6 |
엇 C5C7 |
었 C5C8 |
엉 C5C9 |
엊 C5CA |
엌 C5CC |
엎 C5CE |
|
| 3F2x/BFAx | 에 C5D0 |
엑 C5D1 |
엔 C5D4 |
엘 C5D8 |
엠 C5E0 |
엡 C5E1 |
엣 C5E3 |
엥 C5E5 |
여 C5EC |
역 C5ED |
엮 C5EE |
연 C5F0 |
열 C5F4 |
엶 C5F6 |
엷 C5F7 | |
| 3F3x/BFBx | 염 C5FC |
엽 C5FD |
엾 C5FE |
엿 C5FF |
였 C600 |
영 C601 |
옅 C605 |
옆 C606 |
옇 C607 |
예 C608 |
옌 C60C |
옐 C610 |
옘 C618 |
옙 C619 |
옛 C61B |
옜 C61C |
| 3F4x/BFCx | 오 C624 |
옥 C625 |
온 C628 |
올 C62C |
옭 C62D |
옮 C62E |
옰 C630 |
옳 C633 |
옴 C634 |
옵 C635 |
옷 C637 |
옹 C639 |
옻 C63B |
와 C640 |
왁 C641 |
완 C644 |
| 3F5x/BFDx | 왈 C648 |
왐 C650 |
왑 C651 |
왓 C653 |
왔 C654 |
왕 C655 |
왜 C65C |
왝 C65D |
왠 C660 |
왬 C66C |
왯 C66F |
왱 C671 |
외 C678 |
왹 C679 |
왼 C67C |
욀 C680 |
| 3F6x/BFEx | 욈 C688 |
욉 C689 |
욋 C68B |
욍 C68D |
요 C694 |
욕 C695 |
욘 C698 |
욜 C69C |
욤 C6A4 |
욥 C6A5 |
욧 C6A7 |
용 C6A9 |
우 C6B0 |
욱 C6B1 |
운 C6B4 |
울 C6B8 |
| 3F7x/BFFx | 욹 C6B9 |
욺 C6BA |
움 C6C0 |
웁 C6C1 |
웃 C6C3 |
웅 C6C5 |
워 C6CC |
웍 C6CD |
원 C6D0 |
월 C6D4 |
웜 C6DC |
웝 C6DD |
웠 C6E0 |
웡 C6E1 |
웨 C6E8 |
|
| 402x/C0Ax | 웩 C6E9 |
웬 C6EC |
웰 C6F0 |
웸 C6F8 |
웹 C6F9 |
웽 C6FD |
위 C704 |
윅 C705 |
윈 C708 |
윌 C70C |
윔 C714 |
윕 C715 |
윗 C717 |
윙 C719 |
유 C720 | |
| 403x/C0Bx | 육 C721 |
윤 C724 |
율 C728 |
윰 C730 |
윱 C731 |
윳 C733 |
융 C735 |
윷 C737 |
으 C73C |
윽 C73D |
은 C740 |
을 C744 |
읊 C74A |
음 C74C |
읍 C74D |
읏 C74F |
| 404x/C0Cx | 응 C751 |
읒 C752 |
읓 C753 |
읔 C754 |
읕 C755 |
읖 C756 |
읗 C757 |
의 C758 |
읜 C75C |
읠 C760 |
읨 C768 |
읫 C76B |
이 C774 |
익 C775 |
인 C778 |
일 C77C |
| 405x/C0Dx | 읽 C77D |
읾 C77E |
잃 C783 |
임 C784 |
입 C785 |
잇 C787 |
있 C788 |
잉 C789 |
잊 C78A |
잎 C78E |
자 C790 |
작 C791 |
잔 C794 |
잖 C796 |
잗 C797 |
잘 C798 |
| 406x/C0Ex | 잚 C79A |
잠 C7A0 |
잡 C7A1 |
잣 C7A3 |
잤 C7A4 |
장 C7A5 |
잦 C7A6 |
재 C7AC |
잭 C7AD |
잰 C7B0 |
잴 C7B4 |
잼 C7BC |
잽 C7BD |
잿 C7BF |
쟀 C7C0 |
쟁 C7C1 |
| 407x/C0Fx | 쟈 C7C8 |
쟉 C7C9 |
쟌 C7CC |
쟎 C7CE |
쟐 C7D0 |
쟘 C7D8 |
쟝 C7DD |
쟤 C7E4 |
쟨 C7E8 |
쟬 C7EC |
저 C800 |
적 C801 |
전 C804 |
절 C808 |
젊 C80A |
|
| 412x/C1Ax | 점 C810 |
접 C811 |
젓 C813 |
정 C815 |
젖 C816 |
제 C81C |
젝 C81D |
젠 C820 |
젤 C824 |
젬 C82C |
젭 C82D |
젯 C82F |
젱 C831 |
져 C838 |
젼 C83C | |
| 413x/C1Bx | 졀 C840 |
졈 C848 |
졉 C849 |
졌 C84C |
졍 C84D |
졔 C854 |
조 C870 |
족 C871 |
존 C874 |
졸 C878 |
졺 C87A |
좀 C880 |
좁 C881 |
좃 C883 |
종 C885 |
좆 C886 |
| 414x/C1Cx | 좇 C887 |
좋 C88B |
좌 C88C |
좍 C88D |
좔 C894 |
좝 C89D |
좟 C89F |
좡 C8A1 |
좨 C8A8 |
좼 C8BC |
좽 C8BD |
죄 C8C4 |
죈 C8C8 |
죌 C8CC |
죔 C8D4 |
죕 C8D5 |
| 415x/C1Dx | 죗 C8D7 |
죙 C8D9 |
죠 C8E0 |
죡 C8E1 |
죤 C8E4 |
죵 C8F5 |
주 C8FC |
죽 C8FD |
준 C900 |
줄 C904 |
줅 C905 |
줆 C906 |
줌 C90C |
줍 C90D |
줏 C90F |
중 C911 |
| 416x/C1Ex | 줘 C918 |
줬 C92C |
줴 C934 |
쥐 C950 |
쥑 C951 |
쥔 C954 |
쥘 C958 |
쥠 C960 |
쥡 C961 |
쥣 C963 |
쥬 C96C |
쥰 C970 |
쥴 C974 |
쥼 C97C |
즈 C988 |
즉 C989 |
| 417x/C1Fx | 즌 C98C |
즐 C990 |
즘 C998 |
즙 C999 |
즛 C99B |
증 C99D |
지 C9C0 |
직 C9C1 |
진 C9C4 |
짇 C9C7 |
질 C9C8 |
짊 C9CA |
짐 C9D0 |
집 C9D1 |
짓 C9D3 |
|
| 422x/C2Ax | 징 C9D5 |
짖 C9D6 |
짙 C9D9 |
짚 C9DA |
짜 C9DC |
짝 C9DD |
짠 C9E0 |
짢 C9E2 |
짤 C9E4 |
짧 C9E7 |
짬 C9EC |
짭 C9ED |
짯 C9EF |
짰 C9F0 |
짱 C9F1 | |
| 423x/C2Bx | 째 C9F8 |
짹 C9F9 |
짼 C9FC |
쨀 CA00 |
쨈 CA08 |
쨉 CA09 |
쨋 CA0B |
쨌 CA0C |
쨍 CA0D |
쨔 CA14 |
쨘 CA18 |
쨩 CA29 |
쩌 CA4C |
쩍 CA4D |
쩐 CA50 |
쩔 CA54 |
| 424x/C2Cx | 쩜 CA5C |
쩝 CA5D |
쩟 CA5F |
쩠 CA60 |
쩡 CA61 |
쩨 CA68 |
쩽 CA7D |
쪄 CA84 |
쪘 CA98 |
쪼 CABC |
쪽 CABD |
쫀 CAC0 |
쫄 CAC4 |
쫌 CACC |
쫍 CACD |
쫏 CACF |
| 425x/C2Dx | 쫑 CAD1 |
쫓 CAD3 |
쫘 CAD8 |
쫙 CAD9 |
쫠 CAE0 |
쫬 CAEC |
쫴 CAF4 |
쬈 CB08 |
쬐 CB10 |
쬔 CB14 |
쬘 CB18 |
쬠 CB20 |
쬡 CB21 |
쭁 CB41 |
쭈 CB48 |
쭉 CB49 |
| 426x/C2Ex | 쭌 CB4C |
쭐 CB50 |
쭘 CB58 |
쭙 CB59 |
쭝 CB5D |
쭤 CB64 |
쭸 CB78 |
쭹 CB79 |
쮜 CB9C |
쮸 CBB8 |
쯔 CBD4 |
쯤 CBE4 |
쯧 CBE7 |
쯩 CBE9 |
찌 CC0C |
찍 CC0D |
| 427x/C2Fx | 찐 CC10 |
찔 CC14 |
찜 CC1C |
찝 CC1D |
찡 CC21 |
찢 CC22 |
찧 CC27 |
차 CC28 |
착 CC29 |
찬 CC2C |
찮 CC2E |
찰 CC30 |
참 CC38 |
찹 CC39 |
찻 CC3B |
|
| 432x/C3Ax | 찼 CC3C |
창 CC3D |
찾 CC3E |
채 CC44 |
책 CC45 |
챈 CC48 |
챌 CC4C |
챔 CC54 |
챕 CC55 |
챗 CC57 |
챘 CC58 |
챙 CC59 |
챠 CC60 |
챤 CC64 |
챦 CC66 | |
| 433x/C3Bx | 챨 CC68 |
챰 CC70 |
챵 CC75 |
처 CC98 |
척 CC99 |
천 CC9C |
철 CCA0 |
첨 CCA8 |
첩 CCA9 |
첫 CCAB |
첬 CCAC |
청 CCAD |
체 CCB4 |
첵 CCB5 |
첸 CCB8 |
첼 CCBC |
| 434x/C3Cx | 쳄 CCC4 |
쳅 CCC5 |
쳇 CCC7 |
쳉 CCC9 |
쳐 CCD0 |
쳔 CCD4 |
쳤 CCE4 |
쳬 CCEC |
쳰 CCF0 |
촁 CD01 |
초 CD08 |
촉 CD09 |
촌 CD0C |
촐 CD10 |
촘 CD18 |
촙 CD19 |
| 435x/C3Dx | 촛 CD1B |
총 CD1D |
촤 CD24 |
촨 CD28 |
촬 CD2C |
촹 CD39 |
최 CD5C |
쵠 CD60 |
쵤 CD64 |
쵬 CD6C |
쵭 CD6D |
쵯 CD6F |
쵱 CD71 |
쵸 CD78 |
춈 CD88 |
추 CD94 |
| 436x/C3Ex | 축 CD95 |
춘 CD98 |
출 CD9C |
춤 CDA4 |
춥 CDA5 |
춧 CDA7 |
충 CDA9 |
춰 CDB0 |
췄 CDC4 |
췌 CDCC |
췐 CDD0 |
취 CDE8 |
췬 CDEC |
췰 CDF0 |
췸 CDF8 |
췹 CDF9 |
| 437x/C3Fx | 췻 CDFB |
췽 CDFD |
츄 CE04 |
츈 CE08 |
츌 CE0C |
츔 CE14 |
츙 CE19 |
츠 CE20 |
측 CE21 |
츤 CE24 |
츨 CE28 |
츰 CE30 |
츱 CE31 |
츳 CE33 |
층 CE35 |
|
| 442x/C4Ax | 치 CE58 |
칙 CE59 |
친 CE5C |
칟 CE5F |
칠 CE60 |
칡 CE61 |
침 CE68 |
칩 CE69 |
칫 CE6B |
칭 CE6D |
카 CE74 |
칵 CE75 |
칸 CE78 |
칼 CE7C |
캄 CE84 | |
| 443x/C4Bx | 캅 CE85 |
캇 CE87 |
캉 CE89 |
캐 CE90 |
캑 CE91 |
캔 CE94 |
캘 CE98 |
캠 CEA0 |
캡 CEA1 |
캣 CEA3 |
캤 CEA4 |
캥 CEA5 |
캬 CEAC |
캭 CEAD |
컁 CEC1 |
커 CEE4 |
| 444x/C4Cx | 컥 CEE5 |
컨 CEE8 |
컫 CEEB |
컬 CEEC |
컴 CEF4 |
컵 CEF5 |
컷 CEF7 |
컸 CEF8 |
컹 CEF9 |
케 CF00 |
켁 CF01 |
켄 CF04 |
켈 CF08 |
켐 CF10 |
켑 CF11 |
켓 CF13 |
| 445x/C4Dx | 켕 CF15 |
켜 CF1C |
켠 CF20 |
켤 CF24 |
켬 CF2C |
켭 CF2D |
켯 CF2F |
켰 CF30 |
켱 CF31 |
켸 CF38 |
코 CF54 |
콕 CF55 |
콘 CF58 |
콜 CF5C |
콤 CF64 |
콥 CF65 |
| 446x/C4Ex | 콧 CF67 |
콩 CF69 |
콰 CF70 |
콱 CF71 |
콴 CF74 |
콸 CF78 |
쾀 CF80 |
쾅 CF85 |
쾌 CF8C |
쾡 CFA1 |
쾨 CFA8 |
쾰 CFB0 |
쿄 CFC4 |
쿠 CFE0 |
쿡 CFE1 |
쿤 CFE4 |
| 447x/C4Fx | 쿨 CFE8 |
쿰 CFF0 |
쿱 CFF1 |
쿳 CFF3 |
쿵 CFF5 |
쿼 CFFC |
퀀 D000 |
퀄 D004 |
퀑 D011 |
퀘 D018 |
퀭 D02D |
퀴 D034 |
퀵 D035 |
퀸 D038 |
퀼 D03C |
|
| 452x/C5Ax | 큄 D044 |
큅 D045 |
큇 D047 |
큉 D049 |
큐 D050 |
큔 D054 |
큘 D058 |
큠 D060 |
크 D06C |
큭 D06D |
큰 D070 |
클 D074 |
큼 D07C |
큽 D07D |
킁 D081 | |
| 453x/C5Bx | 키 D0A4 |
킥 D0A5 |
킨 D0A8 |
킬 D0AC |
킴 D0B4 |
킵 D0B5 |
킷 D0B7 |
킹 D0B9 |
타 D0C0 |
탁 D0C1 |
탄 D0C4 |
탈 D0C8 |
탉 D0C9 |
탐 D0D0 |
탑 D0D1 |
탓 D0D3 |
| 454x/C5Cx | 탔 D0D4 |
탕 D0D5 |
태 D0DC |
택 D0DD |
탠 D0E0 |
탤 D0E4 |
탬 D0EC |
탭 D0ED |
탯 D0EF |
탰 D0F0 |
탱 D0F1 |
탸 D0F8 |
턍 D10D |
터 D130 |
턱 D131 |
턴 D134 |
| 455x/C5Dx | 털 D138 |
턺 D13A |
텀 D140 |
텁 D141 |
텃 D143 |
텄 D144 |
텅 D145 |
테 D14C |
텍 D14D |
텐 D150 |
텔 D154 |
템 D15C |
텝 D15D |
텟 D15F |
텡 D161 |
텨 D168 |
| 456x/C5Ex | 텬 D16C |
텼 D17C |
톄 D184 |
톈 D188 |
토 D1A0 |
톡 D1A1 |
톤 D1A4 |
톨 D1A8 |
톰 D1B0 |
톱 D1B1 |
톳 D1B3 |
통 D1B5 |
톺 D1BA |
톼 D1BC |
퇀 D1C0 |
퇘 D1D8 |
| 457x/C5Fx | 퇴 D1F4 |
퇸 D1F8 |
툇 D207 |
툉 D209 |
툐 D210 |
투 D22C |
툭 D22D |
툰 D230 |
툴 D234 |
툼 D23C |
툽 D23D |
툿 D23F |
퉁 D241 |
퉈 D248 |
퉜 D25C |
|
| 462x/C6Ax | 퉤 D264 |
튀 D280 |
튁 D281 |
튄 D284 |
튈 D288 |
튐 D290 |
튑 D291 |
튕 D295 |
튜 D29C |
튠 D2A0 |
튤 D2A4 |
튬 D2AC |
튱 D2B1 |
트 D2B8 |
특 D2B9 | |
| 463x/C6Bx | 튼 D2BC |
튿 D2BF |
틀 D2C0 |
틂 D2C2 |
틈 D2C8 |
틉 D2C9 |
틋 D2CB |
틔 D2D4 |
틘 D2D8 |
틜 D2DC |
틤 D2E4 |
틥 D2E5 |
티 D2F0 |
틱 D2F1 |
틴 D2F4 |
틸 D2F8 |
| 464x/C6Cx | 팀 D300 |
팁 D301 |
팃 D303 |
팅 D305 |
파 D30C |
팍 D30D |
팎 D30E |
판 D310 |
팔 D314 |
팖 D316 |
팜 D31C |
팝 D31D |
팟 D31F |
팠 D320 |
팡 D321 |
팥 D325 |
| 465x/C6Dx | 패 D328 |
팩 D329 |
팬 D32C |
팰 D330 |
팸 D338 |
팹 D339 |
팻 D33B |
팼 D33C |
팽 D33D |
퍄 D344 |
퍅 D345 |
퍼 D37C |
퍽 D37D |
펀 D380 |
펄 D384 |
펌 D38C |
| 466x/C6Ex | 펍 D38D |
펏 D38F |
펐 D390 |
펑 D391 |
페 D398 |
펙 D399 |
펜 D39C |
펠 D3A0 |
펨 D3A8 |
펩 D3A9 |
펫 D3AB |
펭 D3AD |
펴 D3B4 |
편 D3B8 |
펼 D3BC |
폄 D3C4 |
| 467x/C6Fx | 폅 D3C5 |
폈 D3C8 |
평 D3C9 |
폐 D3D0 |
폘 D3D8 |
폡 D3E1 |
폣 D3E3 |
포 D3EC |
폭 D3ED |
폰 D3F0 |
폴 D3F4 |
폼 D3FC |
폽 D3FD |
폿 D3FF |
퐁 D401 |
|
| 472x/C7Ax | 퐈 D408 |
퐝 D41D |
푀 D440 |
푄 D444 |
표 D45C |
푠 D460 |
푤 D464 |
푭 D46D |
푯 D46F |
푸 D478 |
푹 D479 |
푼 D47C |
푿 D47F |
풀 D480 |
풂 D482 | |
| 473x/C7Bx | 품 D488 |
풉 D489 |
풋 D48B |
풍 D48D |
풔 D494 |
풩 D4A9 |
퓌 D4CC |
퓐 D4D0 |
퓔 D4D4 |
퓜 D4DC |
퓟 D4DF |
퓨 D4E8 |
퓬 D4EC |
퓰 D4F0 |
퓸 D4F8 |
퓻 D4FB |
| 474x/C7Cx | 퓽 D4FD |
프 D504 |
픈 D508 |
플 D50C |
픔 D514 |
픕 D515 |
픗 D517 |
피 D53C |
픽 D53D |
핀 D540 |
필 D544 |
핌 D54C |
핍 D54D |
핏 D54F |
핑 D551 |
하 D558 |
| 475x/C7Dx | 학 D559 |
한 D55C |
할 D560 |
핥 D565 |
함 D568 |
합 D569 |
핫 D56B |
항 D56D |
해 D574 |
핵 D575 |
핸 D578 |
핼 D57C |
햄 D584 |
햅 D585 |
햇 D587 |
했 D588 |
| 476x/C7Ex | 행 D589 |
햐 D590 |
향 D5A5 |
허 D5C8 |
헉 D5C9 |
헌 D5CC |
헐 D5D0 |
헒 D5D2 |
험 D5D8 |
헙 D5D9 |
헛 D5DB |
헝 D5DD |
헤 D5E4 |
헥 D5E5 |
헨 D5E8 |
헬 D5EC |
| 477x/C7Fx | 헴 D5F4 |
헵 D5F5 |
헷 D5F7 |
헹 D5F9 |
혀 D600 |
혁 D601 |
현 D604 |
혈 D608 |
혐 D610 |
협 D611 |
혓 D613 |
혔 D614 |
형 D615 |
혜 D61C |
혠 D620 |
|
| 482x/C8Ax | 혤 D624 |
혭 D62D |
호 D638 |
혹 D639 |
혼 D63C |
홀 D640 |
홅 D645 |
홈 D648 |
홉 D649 |
홋 D64B |
홍 D64D |
홑 D651 |
화 D654 |
확 D655 |
환 D658 | |
| 483x/C8Bx | 활 D65C |
홧 D667 |
황 D669 |
홰 D670 |
홱 D671 |
홴 D674 |
횃 D683 |
횅 D685 |
회 D68C |
획 D68D |
횐 D690 |
횔 D694 |
횝 D69D |
횟 D69F |
횡 D6A1 |
효 D6A8 |
| 484x/C8Cx | 횬 D6AC |
횰 D6B0 |
횹 D6B9 |
횻 D6BB |
후 D6C4 |
훅 D6C5 |
훈 D6C8 |
훌 D6CC |
훑 D6D1 |
훔 D6D4 |
훗 D6D7 |
훙 D6D9 |
훠 D6E0 |
훤 D6E4 |
훨 D6E8 |
훰 D6F0 |
| 485x/C8Dx | 훵 D6F5 |
훼 D6FC |
훽 D6FD |
휀 D700 |
휄 D704 |
휑 D711 |
휘 D718 |
휙 D719 |
휜 D71C |
휠 D720 |
휨 D728 |
휩 D729 |
휫 D72B |
휭 D72D |
휴 D734 |
휵 D735 |
| 486x/C8Ex | 휸 D738 |
휼 D73C |
흄 D744 |
흇 D747 |
흉 D749 |
흐 D750 |
흑 D751 |
흔 D754 |
흖 D756 |
흗 D757 |
흘 D758 |
흙 D759 |
흠 D760 |
흡 D761 |
흣 D763 |
흥 D765 |
| 487x/C8Fx | 흩 D769 |
희 D76C |
흰 D770 |
흴 D774 |
흼 D77C |
흽 D77D |
힁 D781 |
히 D788 |
힉 D789 |
힌 D78C |
힐 D790 |
힘 D798 |
힙 D799 |
힛 D79B |
힝 D79D |
Statistics by jamo
[edit]
|
|
|
Hanja sets (rows number 42 through 93)
[edit]KS X 1001 encodes hanja with multiple pronunciations multiple times, once for each pronunciation. (Some pronunciations are inherited from Middle Chinese, and others are an effect of the initial sound rule.) One character, 樂, is encoded four times. The first 268 characters (U+F900–U+FA0B) in the CJK Compatibility Ideographs block correspond to these duplicates.
In the table below, the first row-cell value (and reading) for each Hanja maps to the CJK Unified Ideographs block; others map to the CJK Compatibility Ideographs block.
| Hanja | Locations and readings |
|---|---|
| 賈 | 42-25 (가 ga), 45-47 (고 go) |
| 降 | 43-29 (강 gang), 90-02 (항 hang) |
| 見 | 44-24 (견 gyeon), 90-70 (현 hyeon) |
| 更 | 44-58 (경 gyeong), 43-54 (갱 gaeng) |
| 契 | 44-88 (계 gye), 48-48 (글 geul) |
| 串 | 45-90 (곶 got), 46-13 (관 gwan) |
| 廓 | 46-09 (곽 gwak), 92-09 (확 hwak) |
| 句 | 47-03 (구 gu), 47-91 (귀 gwi) |
| 龜 | 47-47 (구 gu), 48-02 (귀 gwi), 48-24 (균 gyun) |
| 豈 | 49-34 (기 gi), 43-48 (개 gae) |
| 金 | 49-49 (김 gim), 48-61 (금 geum) |
| 拏 | 49-57 (나 na), 52-92 (라 ra) |
| 諾 | 49-71 (낙 nak), 53-11 (락 rak) |
| 奈 | 50-15 (내 nae), 49-53 (나 na) |
| 女 | 50-19 (녀 nyeo), 69-92 (여 yeo) |
| 年 | 50-20 (년 nyeon), 70-36 (연 yeon) |
| 撚 | 50-21 (년 nyeon), 70-42 (연 yeon) |
| 秊 | 50-22 (년 nyeon), 70-60 (연 yeon) |
| 念 | 50-23 (념 nyeom), 70-86 (염 yeom) |
| 捻 | 50-26 (념 nyeom), 70-87 (염 yeom) |
| 寧 | 50-27 (녕 nyeong), 54-24 (령 ryeong), 71-12 (영 yeong) |
| 怒 | 50-33 (노 no), 54-45 (로 ro) |
| 尿 | 50-67 (뇨 nyo), 72-81 (요 yo) |
| 杻 | 50-78 (뉴 nyu), 74-84 (유 yu) |
| 紐 | 50-79 (뉴 nyu), 75-10 (유 yu) |
| 泥 | 50-90 (니 ni), 76-18 (이 i) |
| 匿 | 50-91 (닉 nik), 76-41 (익 ik) |
| 溺 | 50-92 (닉 nik), 76-42 (익 ik) |
| 茶 | 50-94 (다 da), 83-17 (차 cha) |
| 丹 | 51-01 (단 dan), 53-14 (란 ran) |
| 糖 | 51-56 (당 dang), 87-24 (탕 tang) |
| 宅 | 51-75 (댁 daek), 87-40 (택 taek) |
| 度 | 51-88 (도 do), 86-84 (탁 tak) |
| 讀 | 52-33 (독 dok), 52-70 (두 du) |
| 洞 | 52-55 (동 dong), 87-51 (통 tong) |
| 喇 | 52-90 (라 ra), 49-52 (나 na) |
| 懶 | 52-91 (라 ra), 49-56 (나 na) |
| 癩 | 52-93 (라 ra), 49-59 (나 na) |
| 羅 | 52-94 (라 ra), 49-60 (나 na) |
| 蘿 | 53-01 (라 ra), 49-61 (나 na) |
| 螺 | 53-02 (라 ra), 49-62 (나 na) |
| 裸 | 53-03 (라 ra), 49-63 (나 na) |
| 邏 | 53-04 (라 ra), 49-64 (나 na) |
| 洛 | 53-06 (락 rak), 49-67 (낙 nak) |
| 烙 | 53-07 (락 rak), 49-68 (낙 nak) |
| 珞 | 53-08 (락 rak), 49-69 (낙 nak) |
| 落 | 53-10 (락 rak), 49-70 (낙 nak) |
| 酪 | 53-12 (락 rak), 49-72 (낙 nak) |
| 駱 | 53-13 (락 rak), 49-73 (낙 nak) |
| 亂 | 53-15 (란 ran), 49-74 (난 nan) |
| 卵 | 53-16 (란 ran), 49-75 (난 nan) |
| 欄 | 53-17 (란 ran), 49-77 (난 nan) |
| 爛 | 53-20 (란 ran), 49-79 (난 nan) |
| 蘭 | 53-21 (란 ran), 49-80 (난 nan) |
| 鸞 | 53-22 (란 ran), 49-82 (난 nan) |
| 嵐 | 53-25 (람 ram), 49-86 (남 nam) |
| 濫 | 53-29 (람 ram), 49-90 (남 nam) |
| 藍 | 53-32 (람 ram), 49-92 (남 nam) |
| 襤 | 53-33 (람 ram), 49-93 (남 nam) |
| 拉 | 53-35 (랍 rap), 49-94 (납 nap) |
| 臘 | 53-36 (랍 rap), 50-02 (납 nap) |
| 蠟 | 53-37 (랍 rap), 50-03 (납 nap) |
| 廊 | 53-38 (랑 rang), 50-07 (낭 nang) |
| 朗 | 53-39 (랑 rang), 50-08 (낭 nang) |
| 浪 | 53-40 (랑 rang), 50-09 (낭 nang) |
| 狼 | 53-41 (랑 rang), 50-10 (낭 nang) |
| 郞 | 53-45 (랑 rang), 50-11 (낭 nang) |
| 來 | 53-46 (래 rae), 50-13 (내 nae) |
| 冷 | 53-50 (랭 raeng), 50-18 (냉 naeng) |
| 掠 | 53-51 (략 ryak), 69-17 (약 yak) |
| 略 | 53-52 (략 ryak), 69-18 (약 yak) |
| 亮 | 53-53 (량 ryang), 69-25 (양 yang) |
| 兩 | 53-55 (량 ryang), 69-27 (양 yang) |
| 凉 | 53-56 (량 ryang), 69-28 (양 yang) |
| 梁 | 53-57 (량 ryang), 69-36 (양 yang) |
| 糧 | 53-61 (량 ryang), 69-46 (양 yang) |
| 良 | 53-62 (량 ryang), 69-48 (양 yang) |
| 諒 | 53-63 (량 ryang), 69-50 (양 yang) |
| 量 | 53-65 (량 ryang), 69-54 (양 yang) |
| 勵 | 53-68 (려 ryeo), 69-90 (여 yeo) |
| 呂 | 53-69 (려 ryeo), 69-91 (여 yeo) |
| 廬 | 53-70 (려 ryeo), 69-94 (여 yeo) |
| 旅 | 53-73 (려 ryeo), 70-01 (여 yeo) |
| 濾 | 53-75 (려 ryeo), 70-04 (여 yeo) |
| 礪 | 53-76 (려 ryeo), 70-07 (여 yeo) |
| 閭 | 53-79 (려 ryeo), 70-13 (여 yeo) |
| 驪 | 53-81 (려 ryeo), 70-15 (여 yeo) |
| 麗 | 53-82 (려 ryeo), 70-16 (여 yeo) |
| 黎 | 53-83 (려 ryeo), 70-17 (여 yeo) |
| 力 | 53-84 (력 ryeok), 70-19 (역 yeok) |
| 曆 | 53-85 (력 ryeok), 70-23 (역 yeok) |
| 歷 | 53-86 (력 ryeok), 70-24 (역 yeok) |
| 轢 | 53-89 (력 ryeok), 70-28 (역 yeok) |
| 憐 | 53-91 (련 ryeon), 70-38 (연 yeon) |
| 戀 | 53-92 (련 ryeon), 70-39 (연 yeon) |
| 漣 | 53-94 (련 ryeon), 70-50 (연 yeon) |
| 煉 | 54-01 (련 ryeon), 70-54 (연 yeon) |
| 璉 | 54-02 (련 ryeon), 70-57 (연 yeon) |
| 練 | 54-03 (련 ryeon), 70-63 (연 yeon) |
| 聯 | 54-04 (련 ryeon), 70-65 (연 yeon) |
| 蓮 | 54-05 (련 ryeon), 70-69 (연 yeon) |
| 輦 | 54-06 (련 ryeon), 70-68 (연 yeon) |
| 連 | 54-07 (련 ryeon), 70-70 (연 yeon) |
| 鍊 | 54-08 (련 ryeon), 70-72 (연 yeon) |
| 列 | 54-10 (렬 ryeol), 70-74 (열 yeol) |
| 劣 | 54-11 (렬 ryeol), 70-75 (열 yeol) |
| 烈 | 54-13 (렬 ryeol), 70-79 (열 yeol) |
| 裂 | 54-14 (렬 ryeol), 70-81 (열 yeol) |
| 廉 | 54-15 (렴 ryeom), 70-85 (염 yeom) |
| 殮 | 54-17 (렴 ryeom), 70-89 (염 yeom) |
| 簾 | 54-19 (렴 ryeom), 71-01 (염 yeom) |
| 獵 | 54-20 (렵 ryeop), 71-06 (엽 yeop) |
| 令 | 54-21 (령 ryeong), 71-09 (영 yeong) |
| 囹 | 54-23 (령 ryeong), 71-10 (영 yeong) |
| 嶺 | 54-26 (령 ryeong), 71-13 (영 yeong) |
| 怜 | 54-27 (령 ryeong), 71-16 (영 yeong) |
| 玲 | 54-28 (령 ryeong), 71-31 (영 yeong) |
| 羚 | 54-30 (령 ryeong), 71-38 (영 yeong) |
| 聆 | 54-32 (령 ryeong), 71-39 (영 yeong) |
| 鈴 | 54-34 (령 ryeong), 71-43 (영 yeong) |
| 零 | 54-35 (령 ryeong), 71-45 (영 yeong) |
| 靈 | 54-36 (령 ryeong), 71-47 (영 yeong) |
| 領 | 54-37 (령 ryeong), 71-48 (영 yeong) |
| 例 | 54-39 (례 rye), 71-51 (예 ye) |
| 禮 | 54-41 (례 rye), 71-63 (예 ye) |
| 醴 | 54-42 (례 rye), 71-68 (예 ye) |
| 隷 | 54-43 (례 rye), 71-70 (예 ye) |
| 勞 | 54-44 (로 ro), 50-30 (노 no) |
| 擄 | 54-47 (로 ro), 50-34 (노 no) |
| 櫓 | 54-48 (로 ro), 50-35 (노 no) |
| 爐 | 54-51 (로 ro), 50-36 (노 no) |
| 盧 | 54-52 (로 ro), 50-38 (노 no) |
| 老 | 54-53 (로 ro), 50-39 (노 no) |
| 蘆 | 54-54 (로 ro), 50-40 (노 no) |
| 虜 | 54-55 (로 ro), 50-41 (노 no) |
| 路 | 54-56 (로 ro), 50-42 (노 no) |
| 露 | 54-58 (로 ro), 50-43 (노 no) |
| 魯 | 54-59 (로 ro), 50-45 (노 no) |
| 鷺 | 54-60 (로 ro), 50-46 (노 no) |
| 碌 | 54-62 (록 rok), 50-47 (녹 nok) |
| 祿 | 54-63 (록 rok), 50-48 (녹 nok) |
| 綠 | 54-64 (록 rok), 50-49 (녹 nok) |
| 菉 | 54-65 (록 rok), 50-50 (녹 nok) |
| 錄 | 54-66 (록 rok), 50-51 (녹 nok) |
| 鹿 | 54-67 (록 rok), 50-52 (녹 nok) |
| 論 | 54-69 (론 ron), 50-53 (논 non) |
| 壟 | 54-70 (롱 rong), 50-54 (농 nong) |
| 弄 | 54-71 (롱 rong), 50-55 (농 nong) |
| 籠 | 54-75 (롱 rong), 50-57 (농 nong) |
| 聾 | 54-76 (롱 rong), 50-58 (농 nong) |
| 牢 | 54-79 (뢰 roe), 50-62 (뇌 noe) |
| 磊 | 54-80 (뢰 roe), 50-63 (뇌 noe) |
| 賂 | 54-81 (뢰 roe), 50-65 (뇌 noe) |
| 雷 | 54-84 (뢰 roe), 50-66 (뇌 noe) |
| 了 | 54-85 (료 ryo), 72-71 (요 yo) |
| 僚 | 54-86 (료 ryo), 72-72 (요 yo) |
| 寮 | 54-87 (료 ryo), 72-80 (요 yo) |
| 料 | 54-89 (료 ryo), 72-87 (요 yo) |
| 燎 | 54-90 (료 ryo), 72-91 (요 yo) |
| 療 | 54-91 (료 ryo), 72-94 (요 yo) |
| 蓼 | 54-94 (료 ryo), 73-07 (요 yo) |
| 遼 | 55-01 (료 ryo), 73-12 (요 yo) |
| 龍 | 55-03 (룡 ryong), 73-44 (용 yong) |
| 壘 | 55-04 (루 ru), 50-68 (누 nu) |
| 屢 | 55-06 (루 ru), 50-69 (누 nu) |
| 樓 | 55-07 (루 ru), 50-70 (누 nu) |
| 淚 | 55-08 (루 ru), 50-71 (누 nu) |
| 漏 | 55-09 (루 ru), 50-72 (누 nu) |
| 累 | 55-11 (루 ru), 50-73 (누 nu) |
| 縷 | 55-12 (루 ru), 50-74 (누 nu) |
| 陋 | 55-16 (루 ru), 50-75 (누 nu) |
| 劉 | 55-17 (류 ryu), 74-69 (유 yu) |
| 柳 | 55-19 (류 ryu), 74-87 (유 yu) |
| 流 | 55-21 (류 ryu), 74-92 (유 yu) |
| 溜 | 55-22 (류 ryu), 74-94 (유 yu) |
| 琉 | 55-24 (류 ryu), 75-04 (유 yu) |
| 留 | 55-26 (류 ryu), 75-07 (유 yu) |
| 硫 | 55-28 (류 ryu), 75-09 (유 yu) |
| 類 | 55-30 (류 ryu), 75-26 (유 yu) |
| 六 | 55-31 (륙 ryuk), 75-27 (육 yuk) |
| 戮 | 55-32 (륙 ryuk), 75-29 (육 yuk) |
| 陸 | 55-33 (륙 ryuk), 75-33 (육 yuk) |
| 倫 | 55-35 (륜 ryun), 75-34 (윤 yun) |
| 崙 | 55-36 (륜 ryun), 75-38 (윤 yun) |
| 淪 | 55-37 (륜 ryun), 75-39 (윤 yun) |
| 輪 | 55-39 (륜 ryun), 75-44 (윤 yun) |
| 律 | 55-40 (률 ryul), 75-47 (율 yul) |
| 慄 | 55-41 (률 ryul), 75-48 (율 yul) |
| 栗 | 55-42 (률 ryul), 75-49 (율 yul) |
| 隆 | 55-44 (륭 ryung), 75-56 (융 yung) |
| 勒 | 55-45 (륵 reuk), 50-80 (늑 neuk) |
| 肋 | 55-46 (륵 reuk), 50-81 (늑 neuk) |
| 凜 | 55-47 (름 reum), 50-82 (늠 neum) |
| 凌 | 55-48 (릉 reung), 50-83 (능 neung) |
| 稜 | 55-50 (릉 reung), 50-84 (능 neung) |
| 綾 | 55-51 (릉 reung), 50-85 (능 neung) |
| 菱 | 55-52 (릉 reung), 50-87 (능 neung) |
| 陵 | 55-53 (릉 reung), 50-88 (능 neung) |
| 利 | 55-55 (리 ri), 76-06 (이 i) |
| 吏 | 55-57 (리 ri), 76-07 (이 i) |
| 履 | 55-59 (리 ri), 76-10 (이 i) |
| 李 | 55-61 (리 ri), 76-16 (이 i) |
| 梨 | 55-62 (리 ri), 76-17 (이 i) |
| 理 | 55-66 (리 ri), 76-21 (이 i) |
| 痢 | 55-69 (리 ri), 76-24 (이 i) |
| 罹 | 55-71 (리 ri), 76-26 (이 i) |
| 裏 | 55-74 (리 ri), 76-32 (이 i) |
| 裡 | 55-75 (리 ri), 76-33 (이 i) |
| 里 | 55-76 (리 ri), 76-37 (이 i) |
| 離 | 55-78 (리 ri), 76-38 (이 i) |
| 吝 | 55-80 (린 rin), 76-53 (인 in) |
| 燐 | 55-82 (린 rin), 76-61 (인 in) |
| 璘 | 55-83 (린 rin), 76-62 (인 in) |
| 藺 | 55-84 (린 rin), 76-65 (인 in) |
| 隣 | 55-86 (린 rin), 76-68 (인 in) |
| 鱗 | 55-87 (린 rin), 76-71 (인 in) |
| 麟 | 55-88 (린 rin), 76-72 (인 in) |
| 林 | 55-89 (림 rim), 76-87 (임 im) |
| 淋 | 55-90 (림 rim), 76-88 (임 im) |
| 臨 | 55-92 (림 rim), 76-90 (임 im) |
| 立 | 56-01 (립 rip), 77-01 (입 ip) |
| 笠 | 56-02 (립 rip), 77-02 (입 ip) |
| 粒 | 56-03 (립 rip), 77-03 (입 ip) |
| 磻 | 58-82 (반 ban), 59-68 (번 beon) |
| 復 | 60-54 (복 bok), 61-05 (부 bu) |
| 輻 | 60-63 (복 bok), 88-80 (폭 pok) |
| 不 | 60-84 (부 bu), 61-53 (불 bul) |
| 北 | 61-33 (북 buk), 59-37 (배 bae) |
| 殺 | 63-15 (살 sal), 65-77 (쇄 swae) |
| 狀 | 63-50 (상 sang), 77-78 (장 jang) |
| 塞 | 63-61 (새 sae), 63-65 (색 saek) |
| 索 | 63-67 (색 saek), 62-94 (삭 sak) |
| 說 | 64-67 (설 seol), 65-13 (세 se), 70-82 (열 yeol) |
| 省 | 64-93 (성 seong), 63-72 (생 saeng) |
| 率 | 65-67 (솔 sol), 55-43 (률 ryul), 75-50 (율 yul) |
| 數 | 66-06 (수 su), 62-92 (삭 sak) |
| 拾 | 67-06 (습 seup), 68-09 (십 sip) |
| 識 | 67-59 (식 sik), 82-29 (지 ji) |
| 什 | 68-07 (십 sip), 82-90 (집 jip) |
| 惡 | 68-34 (악 ak), 71-87 (오 o) |
| 樂 | 68-37 (악 ak), 49-66 (낙 nak), 53-05 (락 rak), 72-89 (요 yo) |
| 若 | 69-20 (약 yak), 69-14 (야 ya) |
| 易 | 70-22 (역 yeok), 76-15 (이 i) |
| 葉 | 71-08 (엽 yeop), 64-81 (섭 seop) |
| 阮 | 72-54 (완 wan), 74-33 (원 won) |
| 異 | 76-22 (이 i), 55-68 (리 ri) |
| 咽 | 76-54 (인 in), 70-76 (열 yeol) |
| 刺 | 77-09 (자 ja), 84-07 (척 cheok) |
| 炙 | 77-19 (자 ja), 78-59 (적 jeok) |
| 切 | 79-23 (절 jeol), 84-78 (체 che) |
| 辰 | 82-67 (진 jin), 67-85 (신 sin) |
| 車 | 83-19 (차 cha), 43-71 (거 geo) |
| 參 | 83-49 (참 cham), 63-19 (삼 sam) |
| 拓 | 84-12 (척 cheok), 86-86 (탁 tak) |
| 沈 | 86-56 (침 chim), 68-01 (심 sim) |
| 便 | 88-21 (편 pyeon), 60-05 (변 byeon) |
| 暴 | 88-76 (폭 pok), 88-59 (포 po) |
| 泌 | 89-18 (필 pil), 61-84 (비 bi) |
| 行 | 90-28 (행 haeng), 90-01 (항 hang) |
| 瑩 | 91-09 (형 hyeong), 71-33 (영 yeong) |
| 滑 | 92-33 (활 hwal), 45-72 (골 gol) |
| 暈 | 93-27 (훈 hun), 73-87 (운 un) |
Johab encoding
[edit]
- 한글: Hangul
- 한자: Hanja
- 특수문자: special characters (non-Hangul and non-Hanja characters)
KS X 1001, since 1992, also defines an alternative encoding known as Johab. This represents a Hangul syllable as the sequence of three five-bit values, split across two 8-bit bytes, most significant bit first. The most significant bit of the lead byte is always set (allowing combination with single-byte ASCII or KS X 1003). This encoding is also used for the modern jamo from row 4 of KS X 1001, by using the filler values for the other components. The Johab encoding for Hangul is shown in the table below.[22]
Johab encodes the remainder of KS X 1001 using lead bytes which do not correspond to an initial jamo (0xE0–0xF9 for Hanja and 0xD9–0xDE[23] for non-Hanja, excluding Hangul syllables and modern jamo), with trail bytes in the ranges 0x31–0x7E and 0x91–0xFE.[22] These codes are algorithmically mapped from the characters' KS X 1001 code points,[23] with two KS X 1001 rows per lead byte (compare and contrast Shift JIS).

The ASCII-based Johab encoding is numbered Code page 1361 by Microsoft.[24] Other, vendor-defined, Johab variants also exist; for example, IBM defines one for use as a Shift Out set with EBCDIC. That variant uses shift in and shift out to switch between a single-byte EBCDIC page and Johab, uses a different encoding for the non-Hangul characters (using lead bytes 0x40–6C with a different layout), and uses lead bytes 0xD4–DD as a user-defined region, but uses the same Johab layout as the 1992 standard for the Hangul characters when in shift-out state.[25] IBM number the EBCDIC-based, stateful Johab encoding Code page 1364,[25] and also define a subset of that encoding, including fewer Hangul characters but in the same layout, as Code page 933.[26]
Some other vendors such as Samsung or GoldStar (now LG) used other "Johab" encodings where the mappings of five-bit codes to jamo differ from the below, consequently not being compatible with the 1992 standard Johab.[af] The table below corresponds to the 1992 standard and also to IBM usage.
| Five-bit sequence | As initial | As vowel | As final |
|---|---|---|---|
| 00000 | not used | not used[ag] | not used |
| 00001 | filler | not used[ah] | filler (empty final) |
| 00010 | ㄱ | filler | ㄱ |
| 00011 | ㄲ | ㅏ | ㄲ |
| 00100 | ㄴ | ㅐ | ㄳ |
| 00101 | ㄷ | ㅑ | ㄴ |
| 00110 | ㄸ | ㅒ | ㄵ |
| 00111 | ㄹ | ㅓ | ㄶ |
| 01000 | ㅁ | not used[ag] | ㄷ |
| 01001 | ㅂ | not used[ah] | ㄹ |
| 01010 | ㅃ | ㅔ | ㄺ |
| 01011 | ㅅ | ㅕ | ㄻ |
| 01100 | ㅆ | ㅖ | ㄼ |
| 01101 | ㅇ | ㅗ | ㄽ |
| 01110 | ㅈ | ㅘ | ㄾ |
| 01111 | ㅉ | ㅙ | ㄿ |
| 10000 | ㅊ | not used[ag] | ㅀ |
| 10001 | ㅋ | not used[ah] | ㅁ |
| 10010 | ㅌ | ㅚ | not used |
| 10011 | ㅍ | ㅛ | ㅂ |
| 10100 | ㅎ | ㅜ | ㅄ |
| 10101 | not used | ㅝ | ㅅ |
| 10110 | non-Hangul lead bytes | ㅞ | ㅆ |
| 10111 | ㅟ | ㅇ | |
| 11000 | not used[ag] | ㅈ | |
| 11001 | not used[ah] | ㅊ | |
| 11010 | ㅠ | ㅋ | |
| 11011 | ㅡ | ㅌ | |
| 11100 | ㅢ | ㅍ | |
| 11101 | ㅣ | ㅎ | |
| 11110 | not used | not used | |
| 11111 | not used | not used | not used |
N-byte Hangul code
[edit]This is the N-byte Hangul code,[5] as specified by KS C 5601-1974 and by annex 4 of KS C 5601–1992. The second half of IBM's Code page 1040[27] is a superset of this, assigning the characters ¢¬\~ (although not £) to the same locations as in Code page 1041, while the unextended N-Byte Hangul (besides C0 control code replacement graphics in some usage contexts, shared with IBM-1040) is Code page 891.[28] Character 0x40/0xC0 is a Hangul Filler (see above), used in combining sequences.
Similarly to its Japanese counterpart JIS C 6220 (JIS X 0201), N-byte Hangul code could be used as a 7-bit encoding, with character allocations over the range 0x40 through 0x7C.[5] The chart below shows the code in an 8-bit environment with the high bit set (i.e. over 0xC0 through 0xFC), as it is used in e.g. code page 891 or 1040.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| 8x | ||||||||||||||||
| 9x | ||||||||||||||||
| Ax | ||||||||||||||||
| Bx | ||||||||||||||||
| Cx | HWHF | ᄀ | ᄁ | ᆪ | ᄂ | ᆬ | ᆭ | ᄃ | ᄄ | ᄅ | ᆰ | ᆱ | ᆲ | ᆳ | ᆴ | ᆵ |
| Dx | ᄚ | ᄆ | ᄇ | ᄈ | ᄡ | ᄉ | ᄊ | ᄋ | ᄌ | ᄍ | ᄎ | ᄏ | ᄐ | ᄑ | ᄒ | |
| Ex | ᅡ | ᅢ | ᅣ | ᅤ | ᅥ | ᅦ | ᅧ | ᅨ | ᅩ | ᅪ | ᅫ | ᅬ | ||||
| Fx | ᅭ | ᅮ | ᅯ | ᅰ | ᅱ | ᅲ | ᅳ | ᅴ | ᅵ |
Footnotes
[edit]- ^ Missing Greek diacritics and final sigma.
- ^ Missing chōonpu, shinjitai, kokuji.
- ^ a b Missing lowercase å, although the uppercase U+212B Å ANGSTROM SIGN is included, as are both cases of Ø and Æ.
- ^ Korean: 정보 교환용 부호계 (한글 및 한자); RR: Jeongbo Gyohwannyong Buhogye (Hangeul mit Hanja)
- ^ a b As a ISO 2022 compatible 94n-character set, the plain space and delete character are always available as single-byte codes at 0x20 and 0x7F (not 0xA0 and 0xFF) respectively.
- ^ a b c d e f g h i Mapped by Apple to a sequence of the ASCII digit, the combining square U+20DE, and the private-use character U+F87C.[16] Appearance shown here simulated.
- ^ a b c d e f g h i j k Mapped by Apple to an ASCII sequence with the number in square brackets, prefixed by the private-use character U+F863.[16] Appearance shown here simulated.
- ^ Narrower than 0xA2D2. Apple appends a private-use character U+F87F for round trip purposes.
- ^ Two vertical lines with right-hand line bolder. Apple maps this to the ordinary double vertical U+2016 (‖) plus a private-use character U+F87B for round trip purposes; U+1D102 is shown here to reflect the intended appearance.
- ^ Two vertical lines with left-hand line bolder. Apple maps this to the ordinary double vertical U+2016 (‖) plus a private-use character U+F87C for round trip purposes; U+1D103 is shown here to reflect the intended appearance.
- ^ Variant aligned to the bottom-left of the character cell, for horizontal use. Apple maps this to U+FF01+F874, where U+F874 is a private-use character used by Apple to tag the character for round-tripping purposes.[16] More recently,[18] a standardised variation sequence has been added for this form, using an appended Variation Selector 1 (U+FE00).[19]
- ^ Variant aligned to the bottom-left of the character cell, for horizontal use. Apple maps this to U+3002+F87D, where U+F87D is a private-use character used by Apple to tag the character for round-tripping purposes.[16] More recently,[18] a standardised variation sequence has been added for this form, using an appended Variation Selector 1 (U+FE00).[19]
- ^ a b Duplicate, mapped by Apple with an appended private-use character U+F87F for round tripping.[16]
- ^ An exact match does note exist in Unicode, shown simulated. Apple maps to U+3257+F87A, where U+F87A is a private-use character, and U+3257 is the unfilled circled character.[16]
- ^ An exact match does note exist in Unicode, shown simulated. Apple maps to U+3258+F87A, where U+F87A is a private-use character, and U+3258 is the unfilled circled character.[16]
- ^ An exact match does note exist in Unicode, shown simulated. Apple maps to U+3259+F87A, where U+F87A is a private-use character, and U+3259 is the unfilled circled character.[16]
- ^ An exact match does note exist in Unicode, shown simulated. Apple maps to U+325A+F87A, where U+F87A is a private-use character, and U+325A is the unfilled circled character.[16]
- ^ An exact match does note exist in Unicode, shown simulated. Apple maps to U+3251+F87A, where U+F87A is a private-use character, and U+3251 is the unfilled circled character.[16]
- ^ An exact match does note exist in Unicode, shown simulated. Apple maps to U+3252+F87A, where U+F87A is a private-use character, and U+3252 is the unfilled circled character.[16]
- ^ An exact match does note exist in Unicode, shown simulated. Apple maps to U+3253+F87A, where U+F87A is a private-use character, and U+3253 is the unfilled circled character.[16]
- ^ An exact match does note exist in Unicode, shown simulated. Apple maps to U+3254+F87A, where U+F87A is a private-use character, and U+3254 is the unfilled circled character.[16]
- ^ An exact match does note exist in Unicode, shown simulated. Apple maps to U+3255+F87A, where U+F87A is a private-use character, and U+3255 is the unfilled circled character.[16]
- ^ An exact match does note exist in Unicode, shown simulated. Apple maps to U+3256+F87A, where U+F87A is a private-use character, and U+3256 is the unfilled circled character.[16]
- ^ a b c d e f g h i j Mapped by Apple to an ASCII sequence, prefixed by the private-use character U+F862.[16]
- ^ a b c d e f g h i Mapped by Apple to a sequence of the ASCII digit, the combining square U+20DE, and the private-use character U+F875.[16] Appearance shown here simulated.
- ^ a b c d e f g h i j k Mapped by Apple to an ASCII sequence with the number in square brackets, prefixed by the private-use character U+F866.[16] Appearance shown here simulated.
- ^ a b c d These largely duplicate characters in row 1. Apple describes them as "long" versions, and appends the private-use character U+F879 for round-tripping purposes.[16]
- ^ a b c d The MacKorean encoding also includes several sets of differently styled arrows (including white arrows) outside of the KS X 1001 plane ranges, with lead bytes 0xA8 and 0xAC, and trail bytes between 0x41 and 0xA0. Apple appends the private-use character U+F878 to this particular set of white arrows for round-tripping purposes.[16]
- ^ a b Unlike those in row 1, these two marks are intended to display in low position (similarly to U+301F). Apple appends the private-use character U+F873 to distinguish them.[16]
- ^ Apple maps this to U+21E7+F87F, where U+21E7 is the white up arrow[16] and U+F87F is a private-use character, although this character is a black arrow.[21] The character shown is a Unicode character which postdates Apple's mapping table, and resembles the glyph[21] for this MacKorean character.
- ^ Apple maps this to a sequence of the ASCII exclamation mark followed by the private-use character U+F87F.[16]
- ^ Some mappings for these encodings are available here.
- ^ a b c d Were this one used, it would result in a trail byte in the C0 control codes range.
- ^ a b c d Were this one used, it would result in trail bytes in the 0x2_ and 0x3_ rows of ASCII. Johab does not use the 0x2_ row for trail bytes, similarly to most common legacy CJK encodings (compare Shift JIS, GBK, Big5). The EBCDIC-based version of Johab does not use trail bytes from either row, due to them being in the EBCDIC control code range.
References
[edit]- ^ a b c d e Lunde, Ken (2009). "Chapter 3: Character Set Standards". CJKV Information Processing. "O'Reilly Media, Inc.". pp. 143–148. ISBN 978-0596514471.
- ^ a b c d e f Hwang, Jinsang (2005). The Social Shaping of ICTs Standards: A Case of National Coded Character Set Standards Controversy in Korea (PDF). University of Edinburgh.
- ^ a b c Lunde, Ken (1995-12-18). "2.4.6: Obsolete Standards". CJK.INF Version 1.9.
- ^ a b Shin, Jungshik. "What are KS X 1001(KS C 5601) and other Hangul codes?". Hangul & Internet in Korea FAQ.
- ^ a b c d Lunde, Ken (1995-12-18). "3.3.6: N-byte Hangul". CJK.INF Version 1.9.
- ^ a b "INFO: Hangul (Korean) Character Sets", Microsoft Support, Microsoft
- ^ a b Zsigri, Gyula (2002-06-18). "KSC and UHC".
- ^ Chang, Hye-Shik (28 November 2021). "cpython/Modules/cjkcodecs/_codecs_kr.c (revision d3faf43)". cPython source tree. Python Software Foundation.
- ^ a b c Chung, Jaemin (2017-03-30). Proposal to add an informative note to U+3164 HANGUL FILLER (PDF). Unicode Consortium. UTC L2/17-081.
- ^ a b Korea Bureau of Standards (1988-10-01). Korean Graphic Character Set for Information Interchange (PDF). ITSCJ/IPSJ. ISO-IR-149.
- ^ "ibm-1363_P110-1997 (lead byte A1)". ICU Demonstration - Converter Explorer. International Components for Unicode / Unicode Consortium.
- ^ "euc-kr (lead byte A1)". ICU Demonstration - Converter Explorer. International Components for Unicode.
- ^ a b c d "Map (external version) from Mac OS Korean encoding to Unicode 3.2 and later". Apple.
- ^ "windows-949-2000 (lead byte A1)". ICU Demonstration - Converter Explorer. International Components for Unicode / Unicode Consortium.
- ^ a b c d "Lead Byte A1-A2 (Code page 949)". MSDN. Microsoft. 6 February 2008.
- ^ a b c d e f g h i j k l m n o p q r s t u v w x y z aa ab ac ad ae Apple (2005-04-05). "Map (external version) from Mac OS Korean encoding to Unicode 3.2 and later". Unicode Consortium.
- ^ "windows-949-2000 (lead byte A2)". ICU Demonstration - Converter Explorer. International Components for Unicode / Unicode Consortium.
- ^ a b Lunde, Ken (2018-01-21). "Proposal to add standardized variation sequences for fullwidth East Asian punctuation" (PDF). UTC L2/17-436.
- ^ a b "StandardizedVariants.txt: Standardized Variation Sequences". Unicode Character Database. Unicode Consortium.
- ^ Lunde, Ken (2009). "Seemingly Missing Characters". CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. p. 180. ISBN 978-0-596-51447-1.
- ^ a b Lunde, Ken (2009). "Appendix E: Vendor Character Set Standards" (PDF). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. ISBN 978-0-596-51447-1.
- ^ a b c Lunde, Ken (2008). "Chapter 4: Encoding Methods (§ Johab Encoding—KS X 1001:2004)". CJKV Information Processing (2nd ed.). Sebastopol, California: O'Reilly Media. pp. 268–273. ISBN 978-0-596-51447-1.
- ^ a b Shin, Jungshik (2011-10-14) [1999-08-16]. Johab to Unicode table. Unicode Consortium.
- ^ "Code Page Identifiers". Windows Dev Center. Microsoft. 7 January 2021.
- ^ a b "ibm-1364_P110-2007". International Components for Unicode. Unicode Consortium.
- ^ "ibm-933_P110-1995". International Components for Unicode. Unicode Consortium.
- ^ "Code Page 01040" (PDF). IBM. Archived (PDF) from the original on 2015-07-08.
- ^ "Code Page 00891" (PDF). IBM. Archived (PDF) from the original on 2015-07-08.
- ^ "KSRI-87-37-IR: 한글·한자 코드 표준화에 관한 연구: A Study on Standardization of Hangul and Hanja Codes" (PDF) (in Korean). Ministry of Science and Technology. 1987. p. 68. Archived from the original (PDF) on 2019-03-01.
External links
[edit]- What are KS X 1001(KS C 5601) and other Hangul codes?
- Implementing Cross-Locale CJKV Code Conversion by Ken Lunde
- Unicode mapping tables for Wansung and Johab encodings:
KS X 1001
View on GrokipediaOverview
Description and Purpose
KS X 1001, formally titled the "Code for Information Interchange (Hangul and Hanja)," is a character encoding standard developed by the Korean Agency for Technology and Standards (KATS).[6][5] Its primary purpose is to standardize the digital representation of Hangul syllables, Hanja (Chinese characters used in Korean), and supplementary symbols to facilitate computing, information processing, and data exchange in Korean-language systems.[5] The standard utilizes a 94×94 grid structure, yielding 8,836 code points to encode these elements efficiently in double-byte format.[3] It supports 2,350 precomposed Hangul syllables for common usage, alongside 4,888 Hanja characters, and incorporates additional sets such as Latin letters, Greek and Cyrillic scripts, and various symbols to ensure compatibility with international data interchange.[7][3] Serving as the successor to the earlier KS C 5601, KS X 1001 evolved from prior 7-bit encoding approaches to a double-byte system, enabling fuller support for the phonetic and logographic aspects of Korean script in modern computing environments.[6]Scope and Character Coverage
KS X 1001 defines characters within a 94×94 matrix structure, comprising 8,836 code points, where the lead byte ranges from 0x21 to 0x7E (rows 1–32) and 0xA1 to 0xFE (rows 33–94), and the trail byte follows the same ranges. Of these, 8,224 code points are assigned characters in the 1992 revision, focusing on elements essential for Korean text representation.[3][8] The matrix allocates rows 16 through 40 (25 rows × 94 positions) to 2,350 modern precomposed Hangul syllables, covering the most commonly used forms in contemporary Korean writing. Rows 42 through 93 (52 rows × 94 positions) are dedicated to 4,888 Hanja (Chinese characters used in Korean), selected for frequency in literature, education, and terminology, though this set includes 268 duplicates representing variant forms for different pronunciations or contexts. Rows 1 through 15 house 986 non-Hanja, non-precomposed characters, including compatibility Hangul jamo assigned in row 4, as well as special symbols such as box-drawing elements, measurement units, and representations from other scripts like Latin, Greek, Cyrillic, Hiragana, and Katakana.[3][9] The standard excludes support for obsolete Hangul forms and extended Hanja beyond its defined repertoire, prioritizing modern usage over historical completeness. Later revisions expanded coverage with supplementary characters, including the Euro sign (€) and registered trademark sign (®) added in 1998 and the Korean postal mark (㉾) in 2002, integrated into existing rows without altering the core matrix. In encoding variants such as Wansung, these characters are represented as two-byte sequences within the defined positions.[10][11]History
Early Development
The development of Korean character encoding standards in the 1970s was driven by the growing need to integrate Hangul into computing systems dominated by ASCII, which supported only Latin characters and posed compatibility issues for non-Latin scripts. Influenced by Japanese standards like JIS C 6220, which used a 7-bit framework for katakana, early Korean efforts sought to create analogous solutions for Hangul jamo while adhering to international norms such as ISO 646. These initiatives aimed to enable basic text processing in Korean but were constrained by the era's hardware limitations, including limited memory and the prevalence of 7-bit environments that conflicted with more complex scripts.[12][13] In 1974, the first edition of KS C 5601 was published as a variable-length 7-bit coded character set, assigning single-byte code points to 51 basic Hangul jamo elements to facilitate initial digital representation of Korean text. This approach prioritized efficiency in resource-scarce systems but covered only elemental components, requiring user assembly for full syllables, which limited its practicality for everyday use. The standard reflected the experimental nature of the time, focusing on core Hangul without extensive Hanja support, amid challenges like inadequate hardware for rendering composed characters.[12] The 1982 second edition retained the core 7-bit main set from 1974 but introduced supplementary sets to expand coverage, including an early combining method akin to later Johab approaches for generating additional Hangul syllables. These additions aimed to address gaps in character repertoire, such as more pre-combined forms and Hanja, but the increased complexity— involving variable-length sequences and manual composition—hindered widespread adoption. Persistent issues with 7-bit compatibility and the scarcity of supporting hardware and software further stalled progress, underscoring the need for a more robust framework.[12] This paved the way for the shift to double-byte encodings in subsequent standards.Standardization and Name Change
The third edition of KS C 5601, published in 1986, introduced the double-byte Wansung encoding as the primary character set, utilizing precomposed Hangul syllables and ensuring compatibility with ISO 2022 standards, commonly implemented in the EUC-KR form for practical applications.[12] This shift marked a significant advancement in handling Korean text digitally, moving away from earlier variable-length approaches to a more efficient fixed double-byte structure for the 94×94 grid.[12] A subsequent revision in 1987 formalized the Wansung format without modifications to the core character set, solidifying its role as the national standard for Korean information interchange.[12] This edition addressed implementation details while maintaining the 2,350 precomposed Hangul syllables and supporting Hanja characters, enabling broader adoption in computing environments.[12] In August 1997, as part of a comprehensive overhaul of the Korean Industrial Standards numbering scheme, the standard was redesignated from KS C 5601 to KS X 1001, with the updated edition published in 1998 to reflect the new KS X classification for information technology standards.[14] The change aimed to streamline and modernize the categorization of standards, assigning sequential numbers like 1001 for the Hangul and Hanja code set.[14] Following its establishment, KS X 1001 (formerly KS C 5601) emerged as the de facto standard for encoding Korean text across DOS, Windows, and Unix systems, facilitating widespread use in software and data processing until the widespread adoption of Unicode in the late 1990s and early 2000s.[12]Major Revisions
The 1992 revision of KS X 1001, then known as KS C 5601-1992, expanded the standard by incorporating annexes that defined alternative encoding schemes for Hangul syllables to accommodate varying implementation needs in Korean computing environments. Annex 3 formalized the Johab encoding, a direct compositional method that maps the 11,172 possible modern Hangul syllables using five-bit values for leading and vowel jamo followed by a trailing jamo, enabling full coverage beyond the precomposed syllables in the main Wansung set.[12] Annex 4 similarly documented the legacy N-byte Hangul encoding, an earlier variable-length approach for representing Hangul components. These additions addressed practical demands from software like word processors that had adopted Johab-like schemes prior to formal standardization, promoting interoperability without altering the core 94×94 grid structure.[15] In the 1998 revision, KS X 1001 underwent updates to incorporate contemporary symbols and align with the broader KS X series nomenclature, reflecting the Korean Industrial Standards Committee's shift from the KS C prefix. Key additions included the euro sign (U+20AC €) at position 2-A4 and the registered trademark symbol (U+00AE ®) at 2-B0, enhancing support for international financial and legal notations in Korean text processing. These changes ensured compatibility with emerging global standards while maintaining backward compatibility for existing implementations, such as those in EUC-KR.[10][16] The 2002 revision marked the final major update to KS X 1001, introducing the Korean postal code mark (U+327E ㉾) at row 2, column 72, to denote postal services in official documents and envelopes. This addition responded to requests from the Korea Post for a standardized symbol in digital formats, with the character proposed for inclusion in the Unicode Standard following its adoption in the Korean norm.[11] No further substantive changes occurred, as evidenced by the minor 2004 edition that primarily assigned formal names to existing characters without expanding the repertoire, with the 2004 edition last confirmed without changes in 2014.[17][1] By 2025, KS X 1001 has seen no significant revisions since 2002, preserved primarily for legacy system compatibility in South Korean software and data archives. Its mappings, including those from the 2004 edition, continue to be referenced in the Unicode Standard for compatibility ideographs and Hangul syllables, facilitating conversion from older Korean encodings.[18] The standard's active development declined around 2000 with the widespread adoption of Unicode, which subsumed KS X 1001's character repertoire into more universal blocks like Hangul Syllables and CJK Unified Ideographs, reducing the need for proprietary national updates.[19]Technical Specifications
Code Structure and ISO Compatibility
KS X 1001 organizes its characters within a 94×94 grid, yielding 8,836 assignable code points arranged as two-byte sequences comprising a lead byte and a trail byte. The grid positions are logical, with actual byte values depending on the encoding variant; certain grid positions serve as fillers rather than defined characters.[20][21] This structure aligns with ISO/IEC 2022 conventions for multi-byte encodings, facilitating integration into internationalized text streams. In ISO-2022-KR, KS X 1001 occupies the GR (graphics right) plane for double-byte Korean characters, designated via escape sequences such as ESC $ ) C, while the GL (graphics left) plane handles single-byte ASCII; shifts between planes enable mixed-language content without fixed widths. EUC-KR, conversely, applies KS X 1001 in a strictly double-byte format, using lead and trail bytes of 0xA1–0xFE (94 values each) for compatibility with Unix-like systems.[21][20] The mapping from EUC-KR bytes to KS X 1001 code points uses the formula for the table pointer: , which locates the corresponding character in the standard's index (0-based; add 1 for row and column numbers).[20] While predominantly double-byte, the ISO 2022 compatibility of KS X 1001 permits variable-length sequences, including up to 8-byte combinations of jamo elements and the Hangul filler for encoding rare syllables, although such extensions are seldom implemented.[21]Hangul Filler and Composition
In KS X 1001, the Hangul Filler character, located at row 4, column 52 (0xA4D4 in the Wansung encoding), serves as a placeholder to enable the representation of Hangul syllables not included among the standard's 2,350 precomposed forms.[22] This filler initiates and pads multi-component sequences, allowing for the decomposition and recomposition of syllables using individual jamo elements from row 4 of the code table. The composition method employs fixed-length 8-byte sequences (four 2-byte codes) consisting of the Hangul Filler followed by an initial consonant jamo, a vowel jamo, and either a final consonant jamo or another instance of the filler if no final consonant is present.[22] For example, the syllable 믜 (U+B95C) is encoded as 0xA4D4 0xA4B1 0xA4D2 0xA4D4, corresponding to filler + ㅁ (initial) + ㅢ (vowel) + filler. Similarly, the syllable 눾 (U+B23E) uses 0xA4D4 0xA4A4 0xA4CE 0xA4A3 for filler + ㄴ (initial) + ㅞ (vowel) + ㄳ (final). These sequences intersperse the filler to maintain structural consistency, drawing exclusively from the compatibility jamo in row 4. This mechanism addresses a key limitation of KS X 1001, which precomposes only 2,350 of the 11,172 possible modern Hangul syllables, thereby enabling full coverage through decomposition while extending encoded length for non-precomposed forms.[22] However, the approach increases storage requirements compared to single-code precomposed syllables and is primarily a legacy feature for compatibility. In Unicode, the Hangul Filler maps directly to U+3164, but the composition system is non-standard and not natively supported; sequences are typically normalized to precomposed Hangul syllables in the range U+AC00–U+D7A3 during conversion for round-trip compatibility.[22] This normalization ensures that filler-based representations render correctly without preserving the original decomposed structure unless explicitly required for legacy KS X 1001 processing.Encoding Variants
Wansung Encoding
Wansung Encoding, also known as the completion or precomposed form, is the primary variant of KS X 1001 (formerly KS C 5601) that employs a double-byte fixed-length structure for representing common Korean characters. Standardized in the 1987 edition of KS C 5601, it defines a 94×94 grid containing 2,350 precomposed Hangul syllables, 4,888 Hanja characters, and 986 additional symbols and punctuation marks, enabling efficient encoding of standard Korean text. For rare Hangul syllables not included in the precomposed set, the encoding allows variable-length representation through a composition mechanism using a dedicated Hangul filler character followed by individual jamo components, resulting in an 8-byte sequence.[23][24] In its implementation within the EUC-KR coded character set, Wansung uses leading and trailing bytes in the range 0xA1 to 0xFE for multibyte characters, with single-byte ASCII (0x00-0x7F) for compatibility, directly mapping to the KS X 1001 grid without additional offsets. This structure aligns with the ISO 2022 framework, where the 94-character subsets (shifted by adding 0x80 to ISO 2022 codes) facilitate 7-bit and 8-bit data transmission over networks. The Hangul filler, corresponding to Unicode U+3164, initiates the composition sequence for uncommon syllables by preceding the leading consonant, vowel, and optional trailing consonant jamo, though this feature is primarily for legacy compatibility and rarely utilized in practice.[24][23] The advantages of Wansung Encoding lie in its simplicity and efficiency for prevalent Korean content, where most text requires only two bytes per character, and its adherence to ISO 2022 ensures seamless integration in international environments. It became the de facto standard for Korean text in Unix-like operating systems via EUC-KR and was adopted in early Microsoft Windows platforms through Code Page 949 (CP949), an extension that adds support for more Hangul syllables while preserving the core Wansung mapping. However, its drawbacks include restriction to precomposed forms, limiting coverage to frequently used syllables and necessitating the cumbersome 8-byte filler sequences for others, which many software implementations do not fully support, leading to rendering issues or fallback to separate jamo display.[24][23]Johab Encoding
Johab encoding, also known as 조합 (johap, meaning "combining"), is a supplementary fixed double-byte character encoding defined in Annex 3 of the 1992 revision of KS X 1001.[25] It systematically assigns 16-bit code points to all 11,172 possible modern Hangul syllables by decomposing them into their constituent jamo elements: 19 initial consonants (indices 0–18), 21 vowels (indices 0–20), and 28 final consonants or none (indices 0–27).[4] This approach enables complete coverage of modern Hangul without relying on precomposed characters or filler bytes, unlike the primary Wansung encoding in KS X 1001.[26] The structure of Johab is non-compliant with ISO 2022 standards, employing a fixed double-byte format where Hangul syllables occupy the code range from 0x8440 to 0xF9FC. The code point for a syllable is computed using the formula:code = 0x8440 + (initial × 588 + vowel × 28 + final)
code = 0x8440 + (initial × 588 + vowel × 28 + final)
