Katakana
View on Wikipedia
| Katakana 片仮名 カタカナ | |
|---|---|
| Script type | |
Period | c. 800 – present |
| Direction | Vertical right-to-left, left-to-right |
| Languages | Japanese, Ryukyuan, Hachijō, Ainu[1] Taiwanese Hokkien, Hakka, Palauan (formerly) |
| Related scripts | |
Parent systems | |
Sister systems | Hiragana |
| ISO 15924 | |
| ISO 15924 | Kana (411), Katakana |
| Unicode | |
Unicode alias | Katakana |
| |
| Japanese writing |
|---|
| Components |
| Uses |
| Transliteration |
Katakana (片仮名、カタカナ; IPA: [katakaꜜna, kataꜜkana]) is a Japanese syllabary, one component of the Japanese writing system along with hiragana,[2] kanji and in some cases the Latin script (known as rōmaji).
The word katakana means "fragmentary kana", as the katakana characters are derived from components or fragments of more complex kanji. Katakana and hiragana are both kana systems. With one or two minor exceptions, each syllable (strictly mora) in the Japanese language is represented by one character or kana in each system. Each kana represents either a vowel such as "a" (katakana ア); a consonant followed by a vowel such as "ka" (katakana カ); or "n" (katakana ン), a nasal sonorant which, depending on the context, sounds like English m, n or ng ([ŋ]) or like the nasal vowels of Portuguese or Galician.[3]
In contrast to the hiragana syllabary, which is used for Japanese words not covered by kanji and for grammatical inflections, the katakana syllabary usage is comparable to italics in English; specifically, it is used for transcription of foreign-language words into Japanese and the writing of loan words (collectively gairaigo); for emphasis; to represent onomatopoeia; for technical and scientific terms; and for names of plants, animals, minerals and often Japanese companies.
Katakana evolved from Japanese Buddhist monks transliterating Chinese texts into Japanese.[4]
Writing system
[edit]Overview
[edit]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The complete katakana script consists of 48 characters, not counting functional and diacritic marks:
- 5 nucleus vowels
- 42 core or body (onset-nucleus) syllabograms, consisting of nine consonants in combination with each of the five vowels, of which three possible combinations (yi, ye, wu) are not canonical
- 1 coda consonant
These are conceived as a 5×10 grid (gojūon, 五十音, literally "fifty sounds"), as shown in the adjacent table, read ア (a), イ (i), ウ (u), エ (e), オ (o), カ (ka), キ (ki), ク (ku), ケ (ke), コ (ko) and so on. The gojūon inherits its vowel and consonant order from Sanskrit practice. In vertical text contexts, which used to be the default case, the grid is usually presented as 10 columns by 5 rows, with vowels on the right hand side and ア (a) on top. Katakana glyphs in the same row or column do not share common graphic characteristics. Three of the syllabograms to be expected, yi, ye and wu, may have been used idiosyncratically with varying glyphs, but never became conventional in any language and are not present at all in modern Japanese.
The 50-sound table is often amended with an extra character, the nasal ン (n). This can appear in several positions, most often next to the N signs or, because it developed from one of many mu hentaigana, below the u column. It may also be appended to the vowel row or the a column. Here, it is shown in a table of its own.
The script includes two diacritic marks placed at the upper right of the base character that change the initial sound of a syllabogram. A double dot, called dakuten, indicates a primary alteration; most often it voices the consonant: k→g, s→z, t→d and h→b; for example, カ (ka) becomes ガ (ga). Secondary alteration, where possible, is shown by a circular handakuten: h→p; For example; ハ (ha) becomes パ (pa). Diacritics, though used for over a thousand years, only became mandatory in the Japanese writing system in the second half of the 20th century. Their application is strictly limited in proper writing systems,[clarification needed] but may be more extensive in academic transcriptions.
Furthermore, some characters have different readings when printed at a smaller size after a full-sized kana or kanji (see below), but this is not a case of being bicameral.
The layout of the gojūon table promotes a systematic view of kana syllabograms as being always pronounced with the same single consonant followed by a vowel, but this is not exactly the case (and never has been). Existing schemes for the romanization of Japanese either are based on the systematic nature of the script, e.g. nihon-shiki チ ti, or they apply some Western graphotactics, usually the English one, to the common Japanese pronunciation of the kana signs, e.g. Hepburn-shiki チ chi. Both approaches conceal the fact, though, that many consonant-based katakana signs, especially those canonically ending in u, can be used in coda position, too, where the vowel is unvoiced and therefore barely perceptible.
Japanese
[edit]Syllabary and orthography
[edit]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Unused, extinct, or obsolete | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Of the 48 katakana syllabograms described above, only 46 are used in modern Japanese, and one of these is preserved for only a single use:
- wi and we are pronounced as vowels in modern Japanese and are therefore obsolete, having been supplanted by i and e, respectively.
- wo is now used only as a particle, and is normally pronounced the same as vowel オ o. As a particle, it is usually written in hiragana (を) and the katakana form, ヲ, is almost obsolete.
A small version of the katakana for ya, yu or yo (ャ, ュ or ョ, respectively) may be added to katakana ending in i. This changes the i vowel sound to a glide (palatalization) to a, u or o, e.g. キャ (ki + ya) /kja/. Addition of the small y kana is called yōon.
A character called a sokuon, which is visually identical to a small tsu ッ, indicates that the following consonant is geminated (doubled). This is represented in rōmaji by doubling the consonant that follows the sokuon. In Japanese this is an important distinction in pronunciation; for example, compare サカ saka "hill" with サッカ sakka "author". Geminated consonants are common in transliterations of foreign loanwords; for example, English "bed" is represented as ベッド (beddo). The sokuon also sometimes appears at the end of utterances, where it denotes a glottal stop. However, it cannot be used to double the na, ni, nu, ne, no syllables' consonants; to double these, the singular n (ン) is added in front of the syllable. The sokuon may also be used to approximate a non-native sound: Bach is written バッハ (Bahha); Mach as マッハ (Mahha).
Both katakana and hiragana usually spell native long vowels with the addition of a second vowel kana. However, in foreign loanwords, katakana instead uses a vowel extender mark, called a chōonpu ("long vowel mark"). This is a short line (ー) following the direction of the text, horizontal for yokogaki (horizontal text), and vertical for tategaki (vertical text). For example, メール mēru is the gairaigo for e-mail taken from the English word "mail"; the ー lengthens the e. There are some exceptions, such as ローソク (rōsoku (蝋燭, "candle")) or ケータイ(kētai (携帯, "mobile phone")), where Japanese words written in katakana use the elongation mark, too.
Standard and voiced iteration marks are written in katakana as ヽ and ヾ, respectively.
Extensions
[edit]Small versions of the five vowel kana are sometimes used to represent trailing off sounds (ハァ haa, ネェ nee). More often, they are used in katakana as Yōon-like extended digraphs, for transcribing into Japanese a syllable (or mora) that cannot be written with standard katakana. This is especially common for transcribing loanwords such as チェ (che) in チェンジ chenji ("change"), ファ (fa) in ファミリー famirī ("family") and ウィ (wi) and ディ (di) in ウィキペディア Wikipedia; see below for the full list.
Usage
[edit]
In modern Japanese, katakana is most often used for transcription of words from foreign languages or loanwords (other than words historically imported from Chinese), called gairaigo.[6] For example, "ice cream" is written アイスクリーム (aisukurīmu). Similarly, katakana is usually used for country names, foreign places, and foreign personal names. For example, the United States is usually referred to as アメリカ (Amerika), rather than in its ateji kanji spelling of 亜米利加 (Amerika).
Katakana are also used for onomatopoeia,[6] words used to represent sounds – for example, ピンポン (pinpon), the "ding-dong" sound of a doorbell.
Technical and scientific terms, such as the names of animal and plant species and minerals, are also commonly written in katakana.[7] Homo sapiens, as a species, is written ヒト (hito), rather than its kanji 人.
Katakana are often (but not always) used for transcription of Japanese company names. For example, Suzuki is written スズキ, and Toyota is written トヨタ. As these are common family names, Suzuki being the second most common in Japan,[8] using katakana helps distinguish company names from surnames in writing. Katakana are commonly used on signs, advertisements, and hoardings (i.e., billboards), for example, ココ (koko, "here"), ゴミ (gomi, "trash"), or メガネ (megane, "glasses"). Words the writer wishes to emphasize in a sentence are also sometimes written in katakana, mirroring the usage of italics in European languages.[6]
Pre–World War II official documents mix katakana and kanji in the same way that hiragana and kanji are mixed in modern Japanese texts, that is, katakana were used for okurigana and particles such as wa or o.
Katakana was also used for telegrams in Japan before 1988, and for computer systems – before the introduction of multibyte characters – in the 1980s. Most computers of that era used katakana instead of kanji or hiragana for output.
Although words borrowed from ancient Chinese are usually written in kanji, loanwords from modern Chinese varieties that are borrowed directly use katakana instead.
| Japanese | Hepburn | Meaning | Chinese | Pinyin/Yale | Source language |
|---|---|---|---|---|---|
| マージャン | mājan | mahjong | 麻將 | májiàng | Mandarin |
| ウーロン茶 | ūroncha | Oolong tea | 烏龍茶 | wūlóngchá | |
| チャーハン | chāhan | fried rice | 炒飯 | chǎofàn | |
| チャーシュー | chāshū | barbecued pork | 叉燒 | chā sīu | Cantonese |
| シューマイ | shūmai | shumai | 燒賣 | sīu máai |
The very common Chinese loanword rāmen, written in katakana as ラーメン, is rarely written with its kanji (拉麺).
There are rare instances where the opposite has occurred, with kanji forms created from words originally written in katakana. An example of this is コーヒー kōhī, ("coffee"), which can alternatively be written as 珈琲. This kanji usage is occasionally employed by coffee manufacturers or coffee shops for novelty.
Katakana is used to indicate the on'yomi (Chinese-derived readings) of a kanji in a kanji dictionary. For instance, the kanji 人 has a Japanese pronunciation, written in hiragana as ひと hito (person), as well as a Chinese derived pronunciation, written in katakana as ジン jin (used to denote groups of people). Katakana is sometimes used instead of hiragana as furigana to give the pronunciation of a word written in Roman characters, or for a foreign word, which is written as kanji for the meaning, but intended to be pronounced as the original.

Katakana are also sometimes used to indicate words being spoken in a foreign or otherwise unusual accent. For example, in a manga, the speech of a foreign character or a robot may be represented by コンニチワ konnichiwa ("hello") instead of the more typical hiragana こんにちは. Some Japanese personal names are written in katakana. This was more common in the past, hence elderly women often have katakana names. This was particularly common among women in the Meiji and Taishō periods, when many poor, illiterate parents were unwilling to pay a scholar to give their daughters names in kanji.[9] Katakana is also used to denote the fact that a character is speaking a foreign language, and what is displayed in katakana is only the Japanese "translation" of their words.
Some frequently used words may also be written in katakana in dialogs to convey an informal, conversational tone. Some examples include マンガ ("manga"), アイツ aitsu ("that guy or girl; he/him; she/her"), バカ baka ("fool"), etc.
Words with difficult-to-read kanji are sometimes written in katakana (hiragana is also used for this purpose). This phenomenon is often seen with medical terminology. For example, in the word 皮膚科 hifuka ("dermatology"), the second kanji, 膚, is considered difficult to read, and thus the word hifuka is commonly written 皮フ科 or ヒフ科, mixing kanji and katakana. Similarly, difficult-to-read kanji such as 癌 gan ("cancer") are often written in katakana or hiragana.
Katakana is also used for traditional musical notations, as in the Tozan-ryū of shakuhachi, and in sankyoku ensembles with koto, shamisen and shakuhachi.
Some instructors teaching Japanese as a foreign language "introduce katakana after the students have learned to read and write sentences in hiragana without difficulty and know the rules."[10] Most students who have learned hiragana "do not have great difficulty in memorizing" katakana as well.[11] Other instructors introduce katakana first, because these are used with loanwords. This gives students a chance to practice reading and writing kana with meaningful words. This was the approach taken by the influential American linguistics scholar Eleanor Harz Jorden in Japanese: The Written Language (parallel to Japanese: The Spoken Language).[12]

Ainu
[edit]Katakana is commonly used by Japanese linguists to write the Ainu language. In Ainu katakana usage, the consonant that comes at the end of a syllable is represented by a small version of a katakana that corresponds to that final consonant followed by a vowel (for details of which vowel, please see the table at Ainu language § Special katakana for the Ainu language). For instance, the Ainu word up is represented by ウㇷ゚ (ウプ [u followed by small pu]). Ainu also uses three handakuten modified katakana: セ゚ ([tse]) and either ツ゚ or ト゚ ([tu̜]). In Unicode, the Katakana Phonetic Extensions block (U+31F0–U+31FF) exists for Ainu language support. These characters are used for the Ainu language only.
Taiwanese
[edit]Taiwanese kana (タイ
ヲァヌ
ギイ
カア
ビェン
) is a katakana-based writing system once used to write Taiwanese Hokkien while Taiwan was under Japanese rule. It functioned as a phonetic guide for Chinese characters, much like furigana in Japanese or pinyin in Mandarin. There were similar systems for other languages in Taiwan as well, including Hakka and Formosan languages.
Unlike Japanese or Ainu, Taiwanese kana uses a methodology similar to bopomofo, with kana serving as initials, vowel medials and consonant finals, marked with tonal marks. A dot below the initial kana represents aspirated consonants, and チ, ツ, サ, セ, ソ, ウ and オ with a superpositional bar represent sounds found only in Taiwanese.
Okinawan
[edit]Katakana is used as a phonetic guide for the Okinawan language, unlike the various other systems to represent Okinawan, which use hiragana with extensions. The system was devised by the Okinawa Center of Language Study of the University of the Ryukyus. It uses many extensions and yōon to show the many non-Japanese sounds of Okinawan.
Table of katakana
[edit]This is a table of katakana together with their Hepburn romanization and rough IPA transcription for their use in Japanese. Katakana with dakuten or handakuten follow the gojūon kana without them.
Characters shi シ, tsu ツ, so ソ, and n ン look very similar in print except for the slant and stroke shape. These differences in slant and shape are more prominent when written with an ink brush.
| Monographs (gojūon) | Digraphs (yōon) | |||||||
|---|---|---|---|---|---|---|---|---|
| a | i | u | e | o | ya | yu | yo | |
| ∅ | ア a [a] |
イ i [i] |
ウ u [ɯ] |
エ e [e][n 1] |
オ o [o] |
|||
| K | カ ka [ka] |
キ ki [ki] |
ク ku [kɯ] |
ケ ke [ke] |
コ ko [ko] |
キャ kya [kʲa] |
キュ kyu [kʲɯ] |
キョ kyo [kʲo] |
| S | サ sa [sa] |
シ shi [ɕi] |
ス su [sɯ] |
セ se [se] |
ソ so [so] |
シャ sha [ɕa] |
シュ shu [ɕɯ] |
ショ sho [ɕo] |
| T | タ ta [ta] |
チ chi [t͡ɕi] |
ツ tsu [t͡sɯ] |
テ te [te] |
ト to [to] |
チャ cha [t͡ɕa] |
チュ chu [t͡ɕɯ] |
チョ cho [t͡ɕo] |
| N | ナ na [na] |
ニ ni [ɲi] |
ヌ nu [nɯ] |
ネ ne [ne] |
ノ no [no] |
ニャ nya [ɲa] |
ニュ nyu [ɲɯ] |
ニョ nyo [ɲo] |
| H | ハ ha [ha] |
ヒ hi [çi] |
フ fu [ɸɯ] |
ヘ he [he] |
ホ ho [ho] |
ヒャ hya [ça] |
ヒュ hyu [çɯ] |
ヒョ hyo [ço] |
| M | マ ma [ma] |
ミ mi [mi] |
ム mu [mɯ] |
メ me [me] |
モ mo [mo] |
ミャ mya [mʲa] |
ミュ myu [mʲɯ] |
ミョ myo [mʲo] |
| Y | ヤ ya [ja] |
𛄠 yi ([i])[n 2] |
ユ yu [jɯ] |
エ ye [je] / [e][n 3] |
ヨ yo [jo] |
|||
| R | ラ ra [ɾa] |
リ ri [ɾi] |
ル ru [ɾɯ] |
レ re [ɾe] |
ロ ro [ɾo] |
リャ rya [ɾʲa] |
リュ ryu [ɾʲɯ] |
リョ ryo [ɾʲo] |
| W | ワ wa [wa] |
ヰ wi [wi] / [i][n 4] |
𛄢 wu ([ɯ])[n 2] |
ヱ we [we] / [e][n 4] |
ヲ wo [wo] / [o][n 4] |
|||
| Monographs with diacritics: gojūon with (han)dakuten | Digraphs with diacritics: yōon with (han)dakuten | |||||||
| a | i | u | e | o | ya | yu | yo | |
| G | ガ ga [ɡa] |
ギ gi [ɡi] |
グ gu [ɡɯ] |
ゲ ge [ɡe] |
ゴ go [ɡo] |
ギャ gya [ɡʲa] |
ギュ gyu [ɡʲɯ] |
ギョ gyo [ɡʲo] |
| Z | ザ za [(d)za] |
ジ ji [(d)ʑi] |
ズ zu [(d)zɯ] |
ゼ ze [(d)ze] |
ゾ zo [(d)zo] |
ジャ ja [(d)ʑa] |
ジュ ju [(d)ʑɯ] |
ジョ jo [(d)ʑo] |
| D | ダ da [da] |
ヂ ji [(d)ʑi][n 5] |
ヅ zu [(d)zɯ][n 5] |
デ de [de] |
ド do [do] |
ヂャ ja [(d)ʑa][n 5] |
ヂュ ju [(d)ʑɯ][n 5] |
ヂョ jo [(d)ʑo][n 5] |
| B | バ ba [ba] |
ビ bi [bi] |
ブ bu [bɯ] |
ベ be [be] |
ボ bo [bo] |
ビャ bya [bʲa] |
ビュ byu [bʲɯ] |
ビョ byo [bʲo] |
| P | パ pa [pa] |
ピ pi [pi] |
プ pu [pɯ] |
ペ pe [pe] |
ポ po [po] |
ピャ pya [pʲa] |
ピュ pyu [pʲɯ] |
ピョ pyo [pʲo] |
| Final nasal monograph | Polysyllabic monographs | |||||||
| n | iu | koto | shite | toki | tomo | nari | ||
| * | ン n [m n ɲ ŋ ɴ ɰ̃] |
iu [jɯː] |
ヿ koto [koto] |
shite [ɕite] |
toki [toki] |
tomo [tomo] |
nari [naɾi] | |
| * | domo [domo] |
|||||||
| Functional graphemes | ||||||||
| sokuonfu | chōonpu | odoriji (monosyllable) | odoriji (polysyllable) | |||||
| * | ッ (indicates a geminate consonant) |
ー (indicates a long vowel) |
ヽ (reduplicates and unvoices syllable) |
〱 (reduplicates and unvoices syllable) | ||||
| * | ヾ (reduplicates and voices syllable) |
〱゙ (reduplicates and voices syllable) | ||||||
| * | ヽ゚ (reduplicates and voices syllable) |
〱゚ (reduplicates and voices syllable) | ||||||
Notes
- ^ Prior to the e/ye merger in the mid-Heian period, a different character (𛀀) was used in position e.
- ^ a b Theoretical combinations yi and wu are unused . Some katakana were invented for them by linguists in the Edo and Meiji periods in order to fill out the table, but they were never actually used in normal writing.
- ^ The combination ye existed prior to the mid-Heian period and was represented in very early katakana, but has been extinct for over a thousand years, having merged with e in the 10th century. The ye katakana (エ) was adopted for e (displacing 𛀀, the character originally used for e); the alternate katakana 𛄡 was invented for ye in the Meiji period for use in representations of Old and Early Classical Japanese so as to avoid confusion with the modern use of エ for e.
- ^ a b c The characters in positions wi and we are obsolete in modern Japanese, and have been replaced by イ (i) and エ (e). The character wo, in practice normally pronounced o, is preserved in only one use: as a particle. This is normally written in hiragana (を), so katakana ヲ sees only limited use. See Gojūon and the articles on each character for details.
- ^ a b c d e The ヂ (di) and ヅ (du) kana (often romanised as ji and zu) are primarily used for etymological spelling , when the unvoiced equivalents チ (ti) and ツ (tu) (usually romanised as chi and tsu) undergo a sound change (rendaku) and become voiced when they occur in the middle of a compound word. In other cases, the identically-pronounced ジ (ji) and ズ (zu) are used instead. ヂ (di) and ヅ (du) can never begin a word, and they are not common in katakana, since the concept of rendaku does not apply to transcribed foreign words, one of the major uses of katakana.
Extended katakana
[edit]Using small versions of the five vowel kana, many digraphs have been devised, mainly to represent the sounds in words of other languages.
Digraphs with orange backgrounds are the general ones used for loanwords or foreign places or names, and those with blue backgrounds are used for more accurate transliterations of foreign sounds, both suggested by the Cabinet of Japan's Ministry of Education, Culture, Sports, Science and Technology.[13] Katakana combinations with beige backgrounds are suggested by the American National Standards Institute[14] and the British Standards Institution as possible uses.[15] Ones with purple backgrounds appear on the 1974 version of the Hyōjun-shiki formatting.[16]
Pronunciations are shown in Hepburn romanization.
| -a | -ya | -i | -u | -yu | -e | -o | -yo | ||
| i- | イィ yi | イェ ye | |||||||
| u- | ウァ wa[a] | ウャ wya | ウィ wi | ウゥ wu[a] | ウュ wyu | ウェ we | ウォ wo | ウョ wyo | |
| ウィェ wye | |||||||||
| ヴ vu[b] | ヴァ va | ヴャ vya | ヴィ vi | - | ヴュ vyu | ヴェ ve | ヴォ vo | ヴョ vyo[c] | |
| ヴィェ vye | |||||||||
| ウ゚ n[d] | |||||||||
| ki- | キェ kye[c] | ||||||||
| gi- | ギェ gye | ||||||||
| ku- | クァ kwa | クィ kwi[e] | クェ kwe | クォ kwo | |||||
| クヮ kwa | |||||||||
| gu- | グァ gwa | グィ gwi[e][c] | グェ gwe[c] | グォ gwo[c] | |||||
| グヮ gwa | |||||||||
| k- | カ゚ nga[f] | キ゚ ngi[f] | ク゚ ngu[f] | ケ゚ nge[f] | コ゚ ngo[f] | ||||
| shi- | シェ she | ||||||||
| chi- | チェ che | ||||||||
| ji- | ジェ je | ||||||||
| dji- | ヂェ je | ||||||||
| su- | スィ si[c] | ||||||||
| tsu- | ツァ tsa | ツャ tsya | ツィ tsi | ツュ tsyu | ツェ tse | ツォ tso | ツョ tsyo | ||
| ツィェ tsye | |||||||||
| zu- | ズィ zi[c] | ||||||||
| dzu- | ヅァ za | ヅャ zya | ヅィ zi | ヅュ zyu | ヅェ ze | ヅォ zo | ヅョ zyo | ||
| ヅィェ zye | |||||||||
| te- | ティ ti | テュ tyu | |||||||
| to- | トィ twi[e] | トゥ tu | |||||||
| de- | ディ di | デュ dyu | |||||||
| do- | ドィ dwi[e] | ドゥ du | |||||||
| ni-
nu- |
ヌィ nwi[e] | ニェ nye[c] | |||||||
| hi- | ヒェ hye[c] | ||||||||
| ビェ bye | |||||||||
| ピェ pye | |||||||||
| fu- | ファ fa | フャ fya | フィ fi | フュ fyu | フェ fe | フォ fo | フョ fyo[c] | ||
| フィェ fye | |||||||||
| ブィ bwi[e] | |||||||||
| プィ pwi[e] | |||||||||
| ホゥ hu | |||||||||
| mi-
mu- |
ムィ mwi[e] | ミェ mye | |||||||
| ri-
ru- |
ルィ rwi[e] | リェ rye | |||||||
| r- | ラ゚ la[g] | リ゚ li[g] | ル゚ lu[g] | レ゚ le[g] | ロ゚ lo[g] | ||||
| ri- | リ゚ャ lya[g] | リ゚ュ lyu[g] | リ゚ェ lye[g] | リ゚ョ lyo[g] | |||||
| w- | ヷ va[h] | ヸ vi[h] | ヹ ve[h] | ヺ vo[h] | |||||
- ^ a b The use of ウ in these two cases to represent w is rare in modern Japanese except for Internet slang and customary transcription of the Latin sound [w] into katakana. E.g.: ミネルウァ (Mineruwa "Minerva", from Latin MINERVA [mɪˈnɛrwa]); ウゥルカーヌス (Wurukānusu "Vulcan", from Latin VVLCANVS, Vulcānus [wʊlˈkaːnʊs]). The wa-type of foreign sounds (as in watt or white) is usually transcribed to ワ (wa), while the wu-type (as in wood or woman) is usually to ウ (u) or ウー (ū).
- ^ ヴ has a rarely-used hiragana form in ゔ that is also vu in Hepburn romanization systems.
- ^ a b c d e f g h i j Mentioned in Note 1 「留意事項その1」 of 1991 Cabinet Order No. 2 as examples of other "usage unregulated" by the guidelines.
- ^ Prescribed by 「文部省制定 發音符號」, published by the Japanese Ministry of Education in 1944, as a way to transcribe initial ウ (u) when pronounced nasally. For example, ウ゚マイ (nmai) as a transcription of うまい (umai). Usually, ン (n) is used for this purpose instead, as in ンマイ (nmai).
- ^ a b c d e f g h i Employed in the transcription of hard consonants + i in Slavic languages.
- ^ a b c d e Sometimes used in Japanese linguistics to transcribe the bidakuon [ŋ] initial.
- ^ a b c d e f g h i Sometimes used in Japanese linguistics to mark an explicit [l] sound.
- ^ a b c d Obsolete in modern Japanese and very rarely used.[17][18]
History
[edit]

Katakana was developed in the 9th century (during the early Heian period) by Buddhist monks in Nara in order to transliterate texts and works of arts from India, by taking parts of man'yōgana characters as a form of shorthand, hence this kana is so-called kata (片; "partial, fragmented"). For example, ka (カ) comes from the left side of ka (加; lit. "increase", but the original meaning is no longer applicable to kana). The adjacent table shows the origins of each katakana: the red markings of the original Chinese character (used as man'yōgana) eventually became each corresponding symbol.[19]
Official documents of the Empire of Japan were written exclusively with kyūjitai and katakana.
Obsolete kana
[edit]Variant forms
[edit]Katakana have variant forms. For example,
(ネ) and
(ヰ).[20] However, katakana's variant forms are fewer than hiragana's. Katakana's choices of man'yōgana segments had stabilized early on and established – with few exceptions – an unambiguous phonemic orthography (one symbol per sound) long before the 1900 script regularization.[21]
Polysyllabic kana
[edit]Yi, Ye and Wu
[edit]Stroke order
[edit]The following table shows the method for writing each katakana character. It is arranged in a traditional manner, where characters are organized by the sounds that make them up. The numbers and arrows indicate the stroke order and direction, respectively.

Computer encoding
[edit]In addition to fonts intended for Japanese text and Unicode catch-all fonts (like Arial Unicode MS), many fonts intended for Chinese (such as MS Song) and Korean (such as Batang) also include katakana.
Hiragana and katakana
[edit]In addition to the usual full-width (全角, zenkaku) display forms of characters, katakana has a second form, half-width (半角, hankaku). The half-width forms were originally associated with the JIS X 0201 encoding. Although their display form is not specified in the standard, in practice they were designed to fit into the same rectangle of pixels as Roman letters to enable easy implementation on the computer equipment of the day. This space is narrower than the square space traditionally occupied by Japanese characters, hence the name "half-width". In this scheme, diacritics (dakuten and handakuten) are separate characters. When originally devised, the half-width katakana were represented by a single byte each, as in JIS X 0201, again in line with the capabilities of contemporary computer technology.
In the late 1970s, two-byte character sets such as JIS X 0208 were introduced to support the full range of Japanese characters, including katakana, hiragana and kanji. Their display forms were designed to fit into an approximately square array of pixels, hence the name "full-width". For backward compatibility, separate support for half-width katakana has continued to be available in modern multi-byte encoding schemes such as Unicode, by having two separate blocks of characters – one displayed as usual (full-width) katakana, the other displayed as half-width katakana.
Although often said to be obsolete, the half-width katakana are still used in many systems and encodings. For example, the titles of mini discs can only be entered in ASCII or half-width katakana, and half-width katakana are commonly used in computerized cash register displays, on shop receipts, and Japanese digital television and DVD subtitles. Several popular Japanese encodings such as EUC-JP, Unicode and Shift JIS have half-width katakana code as well as full-width. By contrast, ISO-2022-JP has no half-width katakana, and is mainly used over SMTP and NNTP.
Unicode
[edit]Katakana was added to the Unicode Standard in October, 1991 with the release of version 1.0.
The Unicode block for (full-width) katakana is U+30A0–U+30FF.
Encoded in this block along with the katakana are the nakaguro word-separation middle dot, the chōon vowel extender, the katakana iteration marks, and a ligature of コト sometimes used in vertical writing.
| Katakana[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+30Ax | ゠ | ァ | ア | ィ | イ | ゥ | ウ | ェ | エ | ォ | オ | カ | ガ | キ | ギ | ク |
| U+30Bx | グ | ケ | ゲ | コ | ゴ | サ | ザ | シ | ジ | ス | ズ | セ | ゼ | ソ | ゾ | タ |
| U+30Cx | ダ | チ | ヂ | ッ | ツ | ヅ | テ | デ | ト | ド | ナ | ニ | ヌ | ネ | ノ | ハ |
| U+30Dx | バ | パ | ヒ | ビ | ピ | フ | ブ | プ | ヘ | ベ | ペ | ホ | ボ | ポ | マ | ミ |
| U+30Ex | ム | メ | モ | ャ | ヤ | ュ | ユ | ョ | ヨ | ラ | リ | ル | レ | ロ | ヮ | ワ |
| U+30Fx | ヰ | ヱ | ヲ | ン | ヴ | ヵ | ヶ | ヷ | ヸ | ヹ | ヺ | ・ | ー | ヽ | ヾ | ヿ |
Notes
| ||||||||||||||||
Half-width equivalents to the usual full-width katakana also exist in Unicode. These are encoded within the Halfwidth and Fullwidth Forms block (U+FF00–U+FFEF) (which also includes full-width forms of Latin characters, for instance), starting at U+FF65 and ending at U+FF9F (characters U+FF61–U+FF64 are half-width punctuation marks). This block also includes the half-width dakuten and handakuten. The full-width versions of these characters are found in the Hiragana block.
| Katakana subset of Halfwidth and Fullwidth Forms[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| ... | (U+FF00–U+FF64 omitted) | |||||||||||||||
| U+FF6x | ・ | ヲ | ァ | ィ | ゥ | ェ | ォ | ャ | ュ | ョ | ッ | |||||
| U+FF7x | ー | ア | イ | ウ | エ | オ | カ | キ | ク | ケ | コ | サ | シ | ス | セ | ソ |
| U+FF8x | タ | チ | ツ | テ | ト | ナ | ニ | ヌ | ネ | ノ | ハ | ヒ | フ | ヘ | ホ | マ |
| U+FF9x | ミ | ム | メ | モ | ヤ | ユ | ヨ | ラ | リ | ル | レ | ロ | ワ | ン | ゙ | ゚ |
| ... | (U+FFA0–U+FFEF omitted) | |||||||||||||||
Notes
| ||||||||||||||||
Circled katakana are code points U+32D0–U+32FE in the Enclosed CJK Letters and Months block (U+3200–U+32FF). A circled ン (n) is not included.
| Katakana subset of Enclosed CJK Letters and Months[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| ... | (U+3200–U+32CF omitted) | |||||||||||||||
| U+32Dx | ㋐ | ㋑ | ㋒ | ㋓ | ㋔ | ㋕ | ㋖ | ㋗ | ㋘ | ㋙ | ㋚ | ㋛ | ㋜ | ㋝ | ㋞ | ㋟ |
| U+32Ex | ㋠ | ㋡ | ㋢ | ㋣ | ㋤ | ㋥ | ㋦ | ㋧ | ㋨ | ㋩ | ㋪ | ㋫ | ㋬ | ㋭ | ㋮ | ㋯ |
| U+32Fx | ㋰ | ㋱ | ㋲ | ㋳ | ㋴ | ㋵ | ㋶ | ㋷ | ㋸ | ㋹ | ㋺ | ㋻ | ㋼ | ㋽ | ㋾ | ㋿ |
Notes
| ||||||||||||||||
Extensions to Katakana for phonetic transcription of Ainu and other languages were added to the Unicode standard in March 2002 with the release of version 3.2.
The Unicode block for Katakana Phonetic Extensions is U+31F0–U+31FF:
| Katakana Phonetic Extensions[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+31Fx | ㇰ | ㇱ | ㇲ | ㇳ | ㇴ | ㇵ | ㇶ | ㇷ | ㇸ | ㇹ | ㇺ | ㇻ | ㇼ | ㇽ | ㇾ | ㇿ |
Notes
| ||||||||||||||||
Historic and variant forms of Japanese kana characters were added to the Unicode standard in October 2010 with the release of version 6.0.
The Unicode block for Kana Supplement is U+1B000–U+1B0FF:
| Kana Supplement[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+1B00x | 𛀀 | 𛀁 | 𛀂 | 𛀃 | 𛀄 | 𛀅 | 𛀆 | 𛀇 | 𛀈 | 𛀉 | 𛀊 | 𛀋 | 𛀌 | 𛀍 | 𛀎 | 𛀏 |
| U+1B01x | 𛀐 | 𛀑 | 𛀒 | 𛀓 | 𛀔 | 𛀕 | 𛀖 | 𛀗 | 𛀘 | 𛀙 | 𛀚 | 𛀛 | 𛀜 | 𛀝 | 𛀞 | 𛀟 |
| U+1B02x | 𛀠 | 𛀡 | 𛀢 | 𛀣 | 𛀤 | 𛀥 | 𛀦 | 𛀧 | 𛀨 | 𛀩 | 𛀪 | 𛀫 | 𛀬 | 𛀭 | 𛀮 | 𛀯 |
| U+1B03x | 𛀰 | 𛀱 | 𛀲 | 𛀳 | 𛀴 | 𛀵 | 𛀶 | 𛀷 | 𛀸 | 𛀹 | 𛀺 | 𛀻 | 𛀼 | 𛀽 | 𛀾 | 𛀿 |
| U+1B04x | 𛁀 | 𛁁 | 𛁂 | 𛁃 | 𛁄 | 𛁅 | 𛁆 | 𛁇 | 𛁈 | 𛁉 | 𛁊 | 𛁋 | 𛁌 | 𛁍 | 𛁎 | 𛁏 |
| U+1B05x | 𛁐 | 𛁑 | 𛁒 | 𛁓 | 𛁔 | 𛁕 | 𛁖 | 𛁗 | 𛁘 | 𛁙 | 𛁚 | 𛁛 | 𛁜 | 𛁝 | 𛁞 | 𛁟 |
| U+1B06x | 𛁠 | 𛁡 | 𛁢 | 𛁣 | 𛁤 | 𛁥 | 𛁦 | 𛁧 | 𛁨 | 𛁩 | 𛁪 | 𛁫 | 𛁬 | 𛁭 | 𛁮 | 𛁯 |
| U+1B07x | 𛁰 | 𛁱 | 𛁲 | 𛁳 | 𛁴 | 𛁵 | 𛁶 | 𛁷 | 𛁸 | 𛁹 | 𛁺 | 𛁻 | 𛁼 | 𛁽 | 𛁾 | 𛁿 |
| U+1B08x | 𛂀 | 𛂁 | 𛂂 | 𛂃 | 𛂄 | 𛂅 | 𛂆 | 𛂇 | 𛂈 | 𛂉 | 𛂊 | 𛂋 | 𛂌 | 𛂍 | 𛂎 | 𛂏 |
| U+1B09x | 𛂐 | 𛂑 | 𛂒 | 𛂓 | 𛂔 | 𛂕 | 𛂖 | 𛂗 | 𛂘 | 𛂙 | 𛂚 | 𛂛 | 𛂜 | 𛂝 | 𛂞 | 𛂟 |
| U+1B0Ax | 𛂠 | 𛂡 | 𛂢 | 𛂣 | 𛂤 | 𛂥 | 𛂦 | 𛂧 | 𛂨 | 𛂩 | 𛂪 | 𛂫 | 𛂬 | 𛂭 | 𛂮 | 𛂯 |
| U+1B0Bx | 𛂰 | 𛂱 | 𛂲 | 𛂳 | 𛂴 | 𛂵 | 𛂶 | 𛂷 | 𛂸 | 𛂹 | 𛂺 | 𛂻 | 𛂼 | 𛂽 | 𛂾 | 𛂿 |
| U+1B0Cx | 𛃀 | 𛃁 | 𛃂 | 𛃃 | 𛃄 | 𛃅 | 𛃆 | 𛃇 | 𛃈 | 𛃉 | 𛃊 | 𛃋 | 𛃌 | 𛃍 | 𛃎 | 𛃏 |
| U+1B0Dx | 𛃐 | 𛃑 | 𛃒 | 𛃓 | 𛃔 | 𛃕 | 𛃖 | 𛃗 | 𛃘 | 𛃙 | 𛃚 | 𛃛 | 𛃜 | 𛃝 | 𛃞 | 𛃟 |
| U+1B0Ex | 𛃠 | 𛃡 | 𛃢 | 𛃣 | 𛃤 | 𛃥 | 𛃦 | 𛃧 | 𛃨 | 𛃩 | 𛃪 | 𛃫 | 𛃬 | 𛃭 | 𛃮 | 𛃯 |
| U+1B0Fx | 𛃰 | 𛃱 | 𛃲 | 𛃳 | 𛃴 | 𛃵 | 𛃶 | 𛃷 | 𛃸 | 𛃹 | 𛃺 | 𛃻 | 𛃼 | 𛃽 | 𛃾 | 𛃿 |
Notes
| ||||||||||||||||
The Unicode block for Small Kana Extension is U+1B130–U+1B16F:
| Small Kana Extension[1][2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+1B13x | 𛄲 | |||||||||||||||
| U+1B14x | ||||||||||||||||
| U+1B15x | 𛅐 | 𛅑 | 𛅒 | 𛅕 | ||||||||||||
| U+1B16x | 𛅤 | 𛅥 | 𛅦 | 𛅧 | ||||||||||||
| Notes | ||||||||||||||||
The Kana Extended-A Unicode block is U+1B100–1B12F. It contains hentaigana (non-standard hiragana) and historic kana characters.
| Kana Extended-A[1][2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+1B10x | 𛄀 | 𛄁 | 𛄂 | 𛄃 | 𛄄 | 𛄅 | 𛄆 | 𛄇 | 𛄈 | 𛄉 | 𛄊 | 𛄋 | 𛄌 | 𛄍 | 𛄎 | 𛄏 |
| U+1B11x | 𛄐 | 𛄑 | 𛄒 | 𛄓 | 𛄔 | 𛄕 | 𛄖 | 𛄗 | 𛄘 | 𛄙 | 𛄚 | 𛄛 | 𛄜 | 𛄝 | 𛄞 | 𛄟 |
| U+1B12x | 𛄠 | 𛄡 | 𛄢 | |||||||||||||
| Notes | ||||||||||||||||
The Kana Extended-B Unicode block is U+1AFF0–1AFFF. It contains kana originally created by Japanese linguists to write Taiwanese Hokkien known as Taiwanese kana.
| Kana Extended-B[1][2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+1AFFx | 𚿰 | 𚿱 | 𚿲 | 𚿳 | 𚿵 | 𚿶 | 𚿷 | 𚿸 | 𚿹 | 𚿺 | 𚿻 | 𚿽 | 𚿾 | |||
| Notes | ||||||||||||||||
The CJK Compatibility block contains in the range U+3330-3357 square versions of katakana words, usually measure units or abbreviations of loanwords:
| CJK Compatibility[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+330x | ㌀ | ㌁ | ㌂ | ㌃ | ㌄ | ㌅ | ㌆ | ㌇ | ㌈ | ㌉ | ㌊ | ㌋ | ㌌ | ㌍ | ㌎ | ㌏ |
| U+331x | ㌐ | ㌑ | ㌒ | ㌓ | ㌔ | ㌕ | ㌖ | ㌗ | ㌘ | ㌙ | ㌚ | ㌛ | ㌜ | ㌝ | ㌞ | ㌟ |
| U+332x | ㌠ | ㌡ | ㌢ | ㌣ | ㌤ | ㌥ | ㌦ | ㌧ | ㌨ | ㌩ | ㌪ | ㌫ | ㌬ | ㌭ | ㌮ | ㌯ |
| U+333x | ㌰ | ㌱ | ㌲ | ㌳ | ㌴ | ㌵ | ㌶ | ㌷ | ㌸ | ㌹ | ㌺ | ㌻ | ㌼ | ㌽ | ㌾ | ㌿ |
| U+334x | ㍀ | ㍁ | ㍂ | ㍃ | ㍄ | ㍅ | ㍆ | ㍇ | ㍈ | ㍉ | ㍊ | ㍋ | ㍌ | ㍍ | ㍎ | ㍏ |
| U+335x | ㍐ | ㍑ | ㍒ | ㍓ | ㍔ | ㍕ | ㍖ | ㍗ | ㍘ | ㍙ | ㍚ | ㍛ | ㍜ | ㍝ | ㍞ | ㍟ |
| U+336x | ㍠ | ㍡ | ㍢ | ㍣ | ㍤ | ㍥ | ㍦ | ㍧ | ㍨ | ㍩ | ㍪ | ㍫ | ㍬ | ㍭ | ㍮ | ㍯ |
| U+337x | ㍰ | ㍱ | ㍲ | ㍳ | ㍴ | ㍵ | ㍶ | ㍷ | ㍸ | ㍹ | ㍺ | ㍻ | ㍼ | ㍽ | ㍾ | ㍿ |
| U+338x | ㎀ | ㎁ | ㎂ | ㎃ | ㎄ | ㎅ | ㎆ | ㎇ | ㎈ | ㎉ | ㎊ | ㎋ | ㎌ | ㎍ | ㎎ | ㎏ |
| U+339x | ㎐ | ㎑ | ㎒ | ㎓ | ㎔ | ㎕ | ㎖ | ㎗ | ㎘ | ㎙ | ㎚ | ㎛ | ㎜ | ㎝ | ㎞ | ㎟ |
| U+33Ax | ㎠ | ㎡ | ㎢ | ㎣ | ㎤ | ㎥ | ㎦ | ㎧ | ㎨ | ㎩ | ㎪ | ㎫ | ㎬ | ㎭ | ㎮ | ㎯ |
| U+33Bx | ㎰ | ㎱ | ㎲ | ㎳ | ㎴ | ㎵ | ㎶ | ㎷ | ㎸ | ㎹ | ㎺ | ㎻ | ㎼ | ㎽ | ㎾ | ㎿ |
| U+33Cx | ㏀ | ㏁ | ㏂ | ㏃ | ㏄ | ㏅ | ㏆ | ㏇ | ㏈ | ㏉ | ㏊ | ㏋ | ㏌ | ㏍ | ㏎ | ㏏ |
| U+33Dx | ㏐ | ㏑ | ㏒ | ㏓ | ㏔ | ㏕ | ㏖ | ㏗ | ㏘ | ㏙ | ㏚ | ㏛ | ㏜ | ㏝ | ㏞ | ㏟ |
| U+33Ex | ㏠ | ㏡ | ㏢ | ㏣ | ㏤ | ㏥ | ㏦ | ㏧ | ㏨ | ㏩ | ㏪ | ㏫ | ㏬ | ㏭ | ㏮ | ㏯ |
| U+33Fx | ㏰ | ㏱ | ㏲ | ㏳ | ㏴ | ㏵ | ㏶ | ㏷ | ㏸ | ㏹ | ㏺ | ㏻ | ㏼ | ㏽ | ㏾ | ㏿ |
Notes
| ||||||||||||||||
Katakana in other Unicode blocks:
- Dakuten and handakuten diacritics are located in the Hiragana block:
- U+3099 COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK (non-spacing dakuten): ゙
- U+309A COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK (non-spacing handakuten): ゚
- U+309B KATAKANA-HIRAGANA VOICED SOUND MARK (spacing dakuten): ゛
- U+309C KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK (spacing handakuten): ゜
- Two katakana-based emoji are in the Enclosed Ideographic Supplement block:
- U+1F201 SQUARED KATAKANA KOKO ('here' sign): 🈁
- U+1F202 SQUARED KATAKANA SA ('service' sign): 🈂
- A katakana-based Japanese TV symbol from the ARIB STD-B24 standard is in the Enclosed Ideographic Supplement block:
- U+1F213 SQUARED KATAKANA DE ('data broadcasting service linked with a main program' symbol): 🈓
Furthermore, as of Unicode 17.0, the following combinatory sequences have been explicitly named, despite having no precomposed symbols in the katakana block. Font designers may want to optimize the display of these composed glyphs. Some of them are mostly used for writing the Ainu language, the others are called bidakuon in Japanese. Other, arbitrary combinations with U+309A handakuten are also possible.
| Katakana named sequences Unicode Named Character Sequences Database | ||||||||||||||||
| Sequence name | Codepoints | Glyph | ||||||||||||||
| KATAKANA LETTER BIDAKUON NGA | U+30AB | U+309A | カ゚ | |||||||||||||
| KATAKANA LETTER BIDAKUON NGI | U+30AD | U+309A | キ゚ | |||||||||||||
| KATAKANA LETTER BIDAKUON NGU | U+30AF | U+309A | ク゚ | |||||||||||||
| KATAKANA LETTER BIDAKUON NGE | U+30B1 | U+309A | ケ゚ | |||||||||||||
| KATAKANA LETTER BIDAKUON NGO | U+30B3 | U+309A | コ゚ | |||||||||||||
| KATAKANA LETTER AINU CE | U+30BB | U+309A | セ゚ | |||||||||||||
| KATAKANA LETTER AINU TU | U+30C4 | U+309A | ツ゚ | |||||||||||||
| KATAKANA LETTER AINU TO | U+30C8 | U+309A | ト゚ | |||||||||||||
| KATAKANA LETTER AINU P | U+31F7 | U+309A | ㇷ゚ | |||||||||||||
See also
[edit]- Japanese phonology
- Hiragana
- Historical kana usage
- Rōmaji
- Gugyeol
- Tōdaiji Fujumonkō, oldest example of kanji text with katakana annotations
- File:Beschrijving van Japan - ABC (cropped).jpg for the kana as described by Engelbert Kaempfer in 1727
References
[edit]- ^ McAuley, Thomas E. (2001). Language change in East Asia. Routledge. p. 90. ISBN 0700713778.
- ^ Roy Andrew Miller (1966) A Japanese Reader: Graded Lessons in the Modern Language, Rutland, Vermont: Charles E. Tuttle Company, Tokyo, Japan, p. 28, Lesson 7: Katakana: a—no. "Side by side with hiragana, modern Japanese writing makes use of another complete set of similar symbols called the katakana."
- ^ Vance, Timothy J. (1987). An Introduction to Japanese Phonology. Albany, NY: State University of New York Press. ISBN 978-0-88706-360-2.
- ^ "How did katakana and hiragana originate?".
- ^ a b c d e f See obsolete kana
- ^ a b c "The Japanese Writing System (2) Katakana", p. 29 in Yookoso! An Invitation to Contemporary Japanese. McGraw-Hill, 1993, ISBN 0070722935
- ^ "Hiragana, Katakana & Kanji". Japanese Word Characters. 8 September 2010. Retrieved 15 October 2011.
- ^ "明治安田生命 全国同姓調査 [Meiji Yasuda Life Insurance Company – National same family name investigation]" (PDF) (Press release). Meiji Yasuda Life Insurance Company. 24 September 2008. Archived from the original (PDF) on 17 January 2012. Retrieved 24 May 2018.
- ^ Tackett, Rachel. "Why old Japanese women have names in katakana". RocketNews24. Archived from the original on 10 September 2015. Retrieved 19 September 2015.
- ^ Mutsuko Endo Simon (1984) Section 3.3 "Katakana", p. 36 in A Practical Guide for Teachers of Elementary Japanese, Center for Japanese Studies, the University of Michigan. ISBN 0939512165
- ^ Simon, p. 36
- ^ Reading Japanese, Lesson 1. joyo96.org
- ^ Cabinet of Japan. "平成3年6月28日内閣告示第2号:外来語の表記" [Japanese cabinet order No.2 (28 June 1991):The notation of loanword]. Ministry of Education, Culture, Sports, Science and Technology. Archived from the original on 6 January 2019. Retrieved 25 May 2011.
- ^ "米国規格(ANSI Z39.11-1972)―要約". Retrieved 27 February 2016.[self-published source]
- ^ "英国規格(BS 4812 : 1972)―要約". Retrieved 27 February 2016.[self-published source]
- ^ "標準式ローマ字つづり―引用". Retrieved 27 February 2016.[self-published source]
- ^ Cabinet of Japan (16 November 1946). 昭和21年内閣告示第33号 「現代かなづかい」 [Japanese Cabinet Order No.33 in 1946 – Modern kana usage] (in Japanese). Archived from the original on 6 October 2001. Retrieved 25 May 2011.
- ^ Cabinet of Japan (1 July 1986). 昭和61年内閣告示第1号 「現代仮名遣い」 [Japanese Cabinet Order No.1 in 1986 – Modern kana usage] (in Japanese). Ministry of Education, Culture, Sports, Science and Technology. Archived from the original on 24 May 2011. Retrieved 25 May 2011.
- ^ Japanese katakana. Omniglot.com
- ^ 『小学略則教授法』「五十音図」
- ^ Tranter, Nicolas (2012). The Languages of Japan and Korea. Routledge. p. 218. ISBN 978-0-415-46287-7.
External links
[edit]Katakana
View on GrokipediaWriting System
Overview and Characteristics
Katakana is a syllabary in the Japanese writing system, consisting of 46 basic characters representing morae (timing units approximating syllables), with additional diacritics for voicing (dakuten/handakuten) and small characters for gemination or palatalization (e.g., kya). It features angular, straight-lined strokes, contrasting with hiragana's cursive form, and is encoded in Unicode's Katakana block (U+30A0–U+30FF).[3] This design aids readability in technical and foreign contexts. Extended forms include phonetic extensions for non-Japanese languages.[4]Usage in Japanese
In standard Japanese orthography, katakana serves several primary functions, most notably for transliterating gairaigo, or loanwords borrowed from foreign languages, to approximate their original pronunciations using Japanese phonetics.[5] For instance, English words like "ice cream" become アイスクリーム (aisukurīmu), "coffee" as コーヒー (kōhī), "television" as テレビ (terebi), "hotel" as ホテル (hoteru), and "camera" as カメラ (kamera).[6] It is also the standard script for onomatopoeia and mimetic expressions (giongo and gitaigo), capturing sounds and sensory impressions, such as ピンポン (pinpon) for a doorbell ring, ワンワン (wanwan) for a dog's bark, ザーザー (zāzā) for heavy rain, and ドキドキ (dokidoki) for a pounding heartbeat.[6] Additionally, katakana denotes scientific and technical terms, particularly for non-native species like animals and plants, exemplified by ライオン (raion, lion) and トマト (tomato, though botanically a fruit).[5] Beyond these, it functions analogously to italics in English for emphasis, highlighting key words or phrases in text.[7] Orthographic conventions mandate katakana for foreign personal and place names, such as ジョン (Jon) or アメリカ (Amerika, America), and for acronyms, like ナサ (nasa) for NASA (National Aeronautics and Space Administration).[6] It is also required for certain emphatic elements, including particles in stylistic contexts, as seen in manga where ダ (da) might replace hiragana だ to stress a declaration.[6] Historically, katakana was obligatory in telegrams until 1988 for its clarity and brevity in transmission, in pre-World War II official government documents where it served as okurigana (inflectional endings) alongside kanji, and in railway signage for station names and warnings to ensure readability from a distance.[6] Examples include telegram-style messages rendered entirely in katakana and early signage like 新宿 (Shinjuku) with katakana annotations.[8] In modern Japanese writing, katakana's role has expanded stylistically, particularly in advertising, manga, and anime, where it denotes emphasis, foreign or alien speech, or a trendy tone—such as robots uttering コンニチワ (kon'nichiwa, a mangled "hello") or characters with accents speaking in katakana to signal otherness.[9] Particularly in science fiction and fantasy genres, katakana is frequently used to adapt loanwords, create neologisms for alien names, futuristic technologies, magical spells, and special abilities, enhancing the sense of exoticism or otherworldliness. Examples include the energy wave かめはめ波 (Kamehameha) and transformation スーパーサイヤ人 (Sūpā Saiya-jin, Super Saiyan) from Dragon Ball, as well as spell names like ファイラ (Faira) in the Final Fantasy series. In anime voice acting and media contexts, these katakana terms are pronounced according to Japanese phonology, adapting foreign-inspired sounds to the native mora structure (e.g., "laser" as レーザー rēzā), which contributes to the distinctive stylized delivery by voice actors. This usage persists in pop culture, with terms like オタク (otaku, obsessive fan) originating in subcultures and now mainstream.[9] In recent years, including 2025, katakana appears increasingly in social media hashtags and branding for visual impact and international appeal, such as #コーヒータイム (kōhī taimu, coffee time) or brand names like スターバックス (Sutābakkusu, Starbucks), blending loanword transcription with digital aesthetics to engage younger audiences.[10]Usage in Other Languages
Katakana has been adapted for transcribing the Ainu language, an indigenous language of Japan, to represent sounds absent in standard Japanese phonology, such as final consonants. These adaptations include the Katakana Phonetic Extensions Unicode block (U+31F0–U+31FF), which consists of small katakana variants used for phonetic notation in Ainu texts. For example, the Ainu word for "up" (meaning "sake" or "alcohol") is written as ウㇷ゚, employing the small pu (ㇷ゚) to indicate the final bilabial stop /p/, which requires gemination-like representation not possible with standard katakana.[11][12] Historically, Japanese scholars employed this katakana-based system to document Ainu oral traditions and folklore, facilitating linguistic analysis and preservation efforts from the late 19th century onward.[13][14] In Ryukyuan languages, such as Okinawan (Uchinaaguchi), katakana serves as a phonetic supplement alongside hiragana and kanji to capture unique phonological features, including glottal stops and compressed vowels. A 2022 proposal by the Shimakutuba Orthography Review Committee, with Unicode encoding proposed in 2023, incorporates superscript katakana letters to denote these sounds precisely; for instance, the superscript tu (proposed as U+1B16D) marks a glottal stop before a following consonant in words like those initiating utterances with abrupt closure.[15] Examples appear in contemporary Okinawan signage and literature, where katakana highlights dialectal elements. This usage supports cultural revitalization amid language endangerment, with less than 1% of Okinawans fluent as of 2025 (UNESCO), primarily among the elderly.[16] During Japanese colonial rule over Taiwan (1895–1945), katakana was modified into the Taiwanese kana system to romanize and phonetically transcribe Hokkien (Taiwanese Minnan), incorporating tonal marks and influences from bopomofo for the language's seven tones and nasal finals. This system paired katakana syllables with Chinese characters, as in the obsolete phrase りしれ供さ小 (lí sī-leh kóng-siánn-siâu), approximating "What the heck are you saying?" in Hokkien, used in colonial-era education and signage to promote Japanese assimilation. Though discontinued post-1945, the system has seen niche revival in linguistic studies for reconstructing historical Hokkien dialects and analyzing colonial language policies.[17][18][19] Beyond these contexts, katakana finds minor applications in Korean-Japanese (Zainichi) communities for transliterating Korean names into Japanese script, often as an alias (tsūmei) alongside hanja, to navigate daily interactions while preserving ethnic identity; for example, a Korean name might be rendered in katakana for official documents. In recent years, including 2025, katakana continues to be used for ad hoc transliteration of non-Latin scripts in multilingual settings, such as approximating Arabic or Cyrillic names in Japanese media, though it remains secondary to romaji systems.[20][21][22][23]Katakana Characters
Standard Katakana Table
The standard katakana syllabary comprises 46 basic characters, organized in the traditional gojūon (fifty sounds) order, which structures them into rows for vowels and consonants followed by vowels, excluding obsolete forms like ヰ and ヱ in modern usage. These characters represent open syllables (consonant-vowel), the vowel ン (a syllabic nasal), and the particle marker ヲ, which is pronounced identically to オ in contemporary Japanese. The following table lists them with Hepburn romanization (the most widely used system for English speakers) and International Phonetic Alphabet (IPA) transcriptions reflecting standard Tokyo dialect phonology, where u is realized as a compressed [ɯ] and certain consonants palatalize before i (e.g., s as [ɕ], t as [tɕ], h as [ç]).[24][25]| Row/Character | ア | イ | ウ | エ | オ |
|---|---|---|---|---|---|
| Vowels (Romaji) | a | i | u | e | o |
| IPA | [a] | [i] | [ɯ] | [e] | [o] |
| Notes | Compressed vowel, often [ɯᵝ] |
| Row/Character | カ | キ | ク | ケ | コ |
|---|---|---|---|---|---|
| K- (Romaji) | ka | ki | ku | ke | ko |
| IPA | [ka] | [ki] | [kɯ] | [ke] | [ko] |
| Notes | Used in loanwords like コーヒー (kōhī, coffee) |
| Row/Character | サ | シ | ス | セ | ソ |
|---|---|---|---|---|---|
| S- (Romaji) | sa | shi | su | se | so |
| IPA | [sa] | [ɕi] | [sɯ] | [se] | [so] |
| Notes | Palatalized s; e.g., シ in テレビ (terebi, television) for "shi" approximating /ʃi/ |
| Row/Character | タ | チ | ツ | テ | ト |
|---|---|---|---|---|---|
| T- (Romaji) | ta | chi | tsu | te | to |
| IPA | [ta] | [tɕi] | [tsɯ] | [te] | [to] |
| Notes | Palatalized t; affricate | Affricate; used for "ts" sounds in loanwords like テスト (tesuto, test) |
| Row/Character | ナ | ニ | ヌ | ネ | ノ |
|---|---|---|---|---|---|
| N- (Romaji) | na | ni | nu | ne | no |
| IPA | [na] | [ni] | [nɯ] | [ne] | [no] |
| Notes |
| Row/Character | ハ | ヒ | フ | ヘ | ホ |
|---|---|---|---|---|---|
| H- (Romaji) | ha | hi | fu | he | ho |
| IPA | [ha] | [çi] | [ɸɯ] | [he] | [ho] |
| Notes | Palatalized h as fricative [ç] | Bilabial fricative; used for "f" in loanwords like ファミリー (famirī, family) |
| Row/Character | マ | ミ | ム | メ | モ |
|---|---|---|---|---|---|
| M- (Romaji) | ma | mi | mu | me | mo |
| IPA | [ma] | [mi] | [mɯ] | [me] | [mo] |
| Notes |
| Row/Character | ヤ | ユ | ヨ | ||
|---|---|---|---|---|---|
| Y- (Romaji) | ya | (none) | yu | (none) | yo |
| IPA | [ja] | [jɯ] | [jo] | ||
| Notes | Palatal glide y; i and e positions unused in basic set |
| Row/Character | ラ | リ | ル | レ | ロ |
|---|---|---|---|---|---|
| R- (Romaji) | ra | ri | ru | re | ro |
| IPA | [ɾa] | [ɾi] | [ɾɯ] | [ɾe] | [ɾo] |
| Notes | Flap [ɾ], similar to Spanish r; used in loanwords like ラジオ (rajio, radio) |
| Row/Character | ワ | ヲ | |||
|---|---|---|---|---|---|
| W- (Romaji) | wa | (none) | (none) | (none) | wo/o |
| IPA | [wa] | [o] | |||
| Notes | i, u, e positions unused | Modern pronunciation as [o], distinct from wa only in particle use; rarely distinguished in speech |
| Standalone | ン |
|---|---|
| Romaji | n |
| IPA | [ɴ] |
| Notes | Syllabic nasal mora; assimilates to [m, ŋ, ɲ, n] before labials, velars, palatals, alveolars; e.g., in コン (kon, con) as [koŋ] |
Extended and Combined Katakana
Katakana employs two primary diacritics to modify base characters for voiced and semi-voiced sounds. The dakuten (゛), a double quotation mark-like mark placed in the upper right of a character, voices unvoiced consonants in the k-, s-, t-, and h-series, producing g-, z-, d-, and b-sounds, respectively. For instance, キ (ki) becomes ギ (gi), as in ギター (gitā, guitar). This diacritic is essential for adapting foreign words with voiced obstruents. The handakuten (゚), a small circle in the same position, semi-voices the h-series to create p-sounds, such as ヒ (hi) to ピ (pi), used in words like ピザ (piza, pizza). These marks are standardized and apply only to specific series, expanding katakana's phonetic range without altering vowel sounds. Note that many extended forms, particularly ts-clusters and some palatalized variants, are non-standard and used sparingly for precise foreign sound approximation, with preferences for standard yōon where possible.[27][3][28] The voiced series includes: ガ (ga), ギ (gi), グ (gu), ゲ (ge), ゴ (go) from k-series; ザ (za), ジ (ji), ズ (zu), ゼ (ze), ゾ (zo) from s-series; ダ (da), ヂ (ji), ヅ (zu), デ (de), ド (do) from t-series; and バ (ba), ビ (bi), ブ (bu), ベ (be), ボ (bo) from h-series. The p-series comprises: パ (pa), ピ (pi), プ (pu), ペ (pe), ポ (po). Examples demonstrate their use in loanwords: ザクロ (zakuro, pomegranate, za); ジューサー (jūsā, juicer, ji); ドリンク (dorinku, drink, do); バナナ (banana, banana, ba); ピンポン (pinpon, ping-pong, pi). These combinations follow the gojūon order and are integral to modern katakana orthography.[27][28] Yōon (palatalization) forms are created by attaching small versions of ya (ャ), yu (ュ), or yo (ョ) to i-row katakana, resulting in sounds like kya, sha, and cha. This contraction represents diphthong-like syllables absent in basic katakana, commonly used for foreign names and terms. For example, シャ (sha) appears in シャンプー (shanpū, shampoo). Sokuon, denoted by the small tsu (ッ), indicates gemination or doubling of the following consonant, creating a glottal stop or emphasis, as in キッス (kissu, kiss). These small kana are half the size of standard characters and pair with various bases to produce over 30 common yōon forms across series.[27][28]| Base Series | Yōon Forms | Romaji | Example Word (Romaji) |
|---|---|---|---|
| k/g | キャ, ギャ | kya, gya | キャンプ (kyanpu, camp); ギャラクシー (gyarakushī, galaxy) |
| キュ, ギュ | kyu, gyu | キュート (kyūto, cute); ギュッ (gyu, squeeze) | |
| キョ, ギョ | kyo, gyo | キョウ (kyō, capital); ギョウ (gyō, fish) | |
| s/z | シャ, ジャ | sha, ja | シャツ (shatsu, shirt); ジャンプ (janpu, jump) |
| シュ, ジュ | shu, ju | シューズ (shūzu, shoes); ジュース (jūsu, juice) | |
| ショ, ジョ | sho, jo | ショー (shō, show); ジョーク (jōku, joke) | |
| t/d/ch | チャ, ジャ/ヂャ | cha, ja/dja | チャイ (chai, tea); ジャズ (jazu, jazz) |
| チュ, ジュ/ヂュ | chu, ju/dju | チューリップ (chūrippu, tulip); ジュニア (junia, junior) | |
| チョ, ジョ/ヂョ | cho, jo/djo | チョコ (choko, chocolate); チョップ (shoppu, chop) | |
| n | ニャ, ニュ, ニェ, ニョ | nya, nyu, nye, nyo | ニャー (nyā, meow); ニューヨーク (nyūyōku, New York) |
| h/b/p | ヒャ, ビャ, ピャ | hya, bya, pya | ヒャク (hyaku, hundred); ピャノ (pyano, piano) |
| ヒュ, ビュ, ピュ | hyu, byu, pyu | ヒュー (hyū, hue); ビュー (byū, view) | |
| ヒェ, ビェ, ピェ | hye, bye, pye | ヒェッ (hye, rare); ピェロ (pyero, pierrot) | |
| ヒョ, ビョ, ピョ | hyo, byo, pyo | ヒョウ (hyō, leopard); ピョンピョン (pyonpyon, hop) | |
| m | ミャ, ミユ, ミェ, ミョ | mya, myu, mye, myo | ミャウ (myau, meow); ミルク (miruku, milk) |
| r | リャ, リュ, リェ, リョ | rya, ryu, rye, ryo | リャマ (ryama, llama); リュック (ryukku, rucksack) |
| Sokuon Examples | ッ (small tsu) | Gemination | ブック (bukku, book); テスト (tesuto, test) |
| Extended Form | Romaji | Example Word (Romaji) | Notes |
|---|---|---|---|
| ヴァ | va | ヴァイオリン (vaiorin, violin) | v-series based on ヴ (vu), with small kana for other vowels (e.g., ヴァ = ヴ + ァ) |
| ヴィ | vi | ヴィーナス (vīnasu, Venus) | |
| ヴ | vu | ヴァキューム (vakyūmu, vacuum) | Base for v |
| ヴェ | ve | ベクター (bekutā, vector, ve variant) | |
| ヴォ | vo | ボルト (boruto, volt, vo) | |
| ファ | fa | ファスト (fasuto, fast) | f-series |
| フィ | fi | ファイル (fairu, file) | |
| フュ | fyu | フューチャー (fyūchā, future) | |
| フェ | fe | フェンス (fensu, fence) | |
| フォ | fo | フォーク (fōku, fork) | |
| ファ | fya | ファミリー (famirī, family) | f + small ya |
| フュ | fyu | As above | f + small yu |
| フョ | fyo | フィヨルド (fiyorudo, fjord) | f + small yo |
| ツァ | tsa | Rarely used, e.g., in foreign names like ツァイ (Tsai) | ts + small a; non-standard |
| ツィ | tsi | Rarely used, e.g., in specific transcriptions | ts + small i; non-standard |
| ツェ | tse | Rarely used, e.g., in foreign names | ts + small e; non-standard |
| ツォ | tso | Rarely used, e.g., in specific loanwords | ts + small o; non-standard |
| ティ | ti | ティー (tī, tea) | t + small i |
| テュ | tyu | Very rare, e.g., in specialized transcriptions | t + small yu; non-standard |
| ウィ | wi | ウィスキー (uisukī, whiskey) | w + i |
| ウェ | we | ウェスト (wesuto, west) | w + e |
| ウォ | wo | ウォッシュ (wosshu, wash) | w + o |
| イェ | ye | イェロー (yerō, yellow) | Small ye |
| シェ | she | シェフ (shefu, chef) | sh + small e |
| ジェ | je | ジェット (jetto, jet) | j + small e |
| チェ | che | チェック (chekku, check) | ch + small e |
| トゥ | tu | インタビュー (intabyū, interview) | t + small u |
| ドゥ | du | ドゥーム (dūmu, doom) | d + small u |
| クァ | kwa | クォーター (kwōtā, quarter) | k + small a |
| グァ | gwa | グァバ (gwaba, guava) | g + small a |
| イィ | yi | イェス (yesu, yes, yi variant) | Small ii |
| ウィ | wi | As above | u + small i |
| ウェイ | wei | ウェイ (wei, way) | Rare |
History
Origins and Early Development
Katakana originated from the earlier system of man'yōgana, an 8th-century practice of employing selected Chinese characters (kanji) solely for their phonetic values to transcribe Japanese sounds, particularly in glosses and annotations within classical texts. This system first appeared prominently in the Man'yōshū, Japan's oldest anthology of poetry compiled around 759 CE, where man'yōgana was used to render native Japanese words and grammatical elements alongside semantic kanji, facilitating the reading of waka poetry in the vernacular. Such phonetic usage marked an initial adaptation of imported Chinese script to express Japanese phonology, laying the groundwork for later syllabaries.[29] In the 9th century, during the early Heian period, Buddhist monks further simplified man'yōgana by extracting partial components of kanji—often the initial or radical strokes—to create katakana, primarily for annotating and transliterating Sanskrit and Chinese Buddhist terms in sutras and religious manuscripts. For instance, the katakana character カ (ka) derives from the left portion of the kanji 加 (ka, meaning "add"). This development was driven by the need for concise glosses (known as kundoku) in esoteric Buddhist texts, where monks like those in the Shingon sect required a shorthand to indicate Japanese readings of foreign vocabulary without altering the original Chinese layout. Although legend attributes the invention of kana syllabaries, including katakana, to the influential monk Kūkai (774–835 CE), who studied esoteric Buddhism in Tang China and introduced the Siddhaṃ script, modern scholarship regards this as hagiographic myth rather than historical fact, with katakana emerging collectively among monastic scribes.[30][31] Katakana's early form contrasted with the contemporaneous evolution of hiragana, which arose from cursive, full-bodied simplifications of kanji by court women for native literature; katakana, by contrast, retained angular, fragmentary kanji elements suited for marginal annotations in scholarly and religious contexts. Surviving 9th- and 10th-century manuscripts, such as the Tōdaiji fujumonkō (a draft of Buddhist liturgical chants from around 854 CE), demonstrate katakana's initial application in mixed orthography for inflectional endings and particles within sacred texts. By the late Nara period (710–794 CE) and into early Heian, these precursors spread through poetry collections and religious documents, where phonetic fragments aided recitation and interpretation of Buddhist sutras, gradually distinguishing katakana as a tool for foreign and emphatic elements.[2][32]Standardization and Modernization
During the Heian period (794–1185), katakana saw widespread adoption in official government documents, religious texts, and annotations for classical poetry, where it served as a phonetic glossing system derived from abbreviated man'yōgana forms to aid in reading complex kanji-laden texts.[2] By the Kamakura period (1185–1333), diacritics were introduced to distinguish voiced sounds, enhancing its utility in scribal and scholarly work.[2] In the Edo period (1603–1868), rising literacy rates and the proliferation of woodblock printing further entrenched katakana in educational materials, Chinese classical studies, and early encounters with Western terms, during which gradual simplifications of its angular forms occurred to facilitate faster writing in administrative and commercial contexts.[2] The Meiji era (1868–1912) marked a pivotal phase of standardization influenced by Western modernization, as Japan integrated foreign concepts in science, technology, and governance, leading to the increased use of katakana for transcribing loanwords that could not be easily rendered in kanji.[33] These efforts aligned with broader linguistic shifts to promote national education and print media accessibility amid rapid industrialization.[33] Post-World War II reforms in 1946, including the adoption of the Tōyō kanji list limiting characters to 1,850 and the introduction of shinjitai (new character forms) to simplify kanji shapes, focused on simplifying the writing system to boost literacy and standardizing kana orthography to match contemporary pronunciation, which indirectly reinforced katakana's phonetic consistency by eliminating archaic historical usages influenced by variant hentaigana forms.[34] This overhaul, implemented under U.S. Occupation oversight, encouraged katakana for non-standard terms, including scientific and foreign names, to replace obscure kanji and reduce textual complexity.[35] The 1981 introduction of the Jōyō kanji list expanded the approved characters to 1,945, softening mandatory restrictions while promoting katakana as an alternative for less common botanical and zoological nomenclature, thereby minimizing reliance on specialized kanji and enhancing readability in educational and official materials.[35] In the 20th and 21st centuries, katakana has adapted to the digital era through conventions for rendering global loanwords in online media and international communication, reflecting Japan's deepened globalization and the influx of English-derived terms in technology and pop culture.[36] These adjustments include unconventional uses of katakana for native words to evoke foreignness or emphasis, as seen in advertising and digital content, which has expanded its phonetic flexibility without formal orthographic overhauls.[37]Variant and Obsolete Forms
Variant Forms
Variant forms of katakana encompass historical graphical alternatives derived from kanji components, influenced by the same man'yōgana origins that gave rise to hentaigana for hiragana. Although katakana was designed for concise annotation of classical texts and thus standardized earlier than hiragana, pre-modern writings featured multiple shapes for the same phoneme based on different source kanji, particularly evident in earlier periods before the Edo period (1603–1868), with forms stabilizing during the Edo era in manuscripts and printed materials like Buddhist commentaries and dictionaries. These variants served as stylistic choices rather than phonetic distinctions, allowing scribes flexibility in form while maintaining sound consistency.[2] Regional and calligraphic variants further highlight differences between handwritten and printed katakana, with handwriting in old manuscripts often adopting cursive or semi-cursive styles that deviated from rigid block forms. In Edo-period calligraphy, such as in ukiyo-e annotations or personal letters, katakana strokes were fluidly connected to enhance flow, contrasting with the angular printed versions in woodblock texts. These practices persisted beyond standardization, influencing modern shodō (Japanese calligraphy) where artists employ variant shapes for aesthetic effect in decorative pieces, such as hanging scrolls or signboards. Such variants emphasize katakana's adaptability in artistic contexts without altering pronunciation.[38] In contemporary usage, katakana variants survive as stylistic alternatives in limited, non-phonetic roles, particularly in trademarks and historical reproductions that evoke tradition. Brands in sectors like sake or traditional crafts occasionally incorporate Edo-inspired shapes to convey heritage, distinguishing them from standard digital fonts. This usage underscores variants as visual flair rather than functional changes, often seen in logos or packaging reproductions of classical literature.[39]| Sound | Standard Form | Variant Description | Context/Example |
|---|---|---|---|
| ne | ネ | Derived from multiple kanji sources like 祢 or 禰 | Historical man'yōgana origins[30] |
| shi | シ | Variations in stroke curvature in calligraphy | Artistic shodō uses |
| tsu | ツ | Cursive extensions in handwriting | Historical manuscripts |
| ki | キ | Alternative forms from source kanji | Pre-standardization texts |
| ka | カ | Brush-style thickenings | Trademark logos for traditional goods[39] |
| sa | サ | Angular hook variations | Historical text reproductions |
| ta | タ | Crossed stroke alternatives | Cursive calligraphy pieces |
| na | ナ | Curved regional forms | Edo ukiyo-e annotations |
| ha | ハ | Wavy artistic renderings | Modern vintage trademarks |
| ma | マ | Rounded variant arches | Decorative historical replicas |
Obsolete Kana
Obsolete katakana refer to characters and forms that represented sounds no longer distinct in modern standard Japanese phonology, primarily eliminated through the 1946 orthographic reforms known as gendai kanazukai (modern kana usage). These reforms, promulgated by Japan's Ministry of Education, standardized kana to align more closely with contemporary pronunciation, rendering characters for the sounds wi (ヰ), we (ヱ), and the distinct wo (ヲ) phoneme obsolete in everyday writing, except for wo as a grammatical particle. Historically, wi was pronounced approximately as /βi/ or /wi/ during the Heian period (794–1185 CE), evolving from earlier /ɸi/ in the Nara period (710–794 CE), before merging with /i/ by the Kamakura period (1185–1333 CE). Similarly, we shifted from /βe/ to /e/, and wo from /βo/ to /o/. An example of historical usage is ヰネ (wine), transliterating the English word "wine" in early Meiji-era (1868–1912) texts, now written as ワイン.[40] The sounds yi, ye, and wu were even more marginal, never achieving canonical status in the modern 46-kana set due to their absence or rarity in native Japanese phonology after the 9th century. Yi (/ji/ or /i/) derived from the kanji 以 (U+4EE5, "to use"), appearing in early man'yōgana (pre-kana phonetic usage of kanji) but fading as i absorbed it. Ye originated from 衣 (U+8863, "clothing"), representing /je/ before it merged with /e/ around the 12th century; its decline is tied to the loss of the /y/ glide before /e/ in Middle Japanese. Wu stemmed from 宇 (U+5B87, "space"), for /βu/ or /wu/, which evolved into plain /u/ by the Heian period, leaving no distinct representation. These sounds occasionally appeared in archaic texts or loanword adaptations, such as omoβu (思う, "to think") spelled with wu in early orthography.[41] In addition to single-mora obsolete forms, archaic katakana texts employed rare multi-mora representations, where a single character or fused form denoted compound syllables like kya, now standardized as combinations (e.g., キャ). These polysyllabic kana, drawn from variant man'yōgana derivations, were used in pre-10th-century Buddhist glosses and classical literature but were replaced by modern digraphs during the Taishō (1912–1926) and early Shōwa (1926–1989) standardization efforts, prioritizing phonetic simplicity. Such forms appear in digitized Heian-era manuscripts, illustrating the transition from fluid, kanji-derived scripts to rigid syllabaries. The following table lists key obsolete katakana characters, their represented sounds, modern replacements, and deriving kanji:| Obsolete Character | Represented Sound | Modern Replacement | Deriving Kanji |
|---|---|---|---|
| ヰ | wi (/βi/) | イ (i) or ウィ | 為 (U+70E7) |
| ヱ | we (/βe/) | エ (e) or ウェ | 恵 (U+6075) |
| ヲ | wo (/βo/) | オ (o) | 乎 (U+4E4E) |
| 𛄠 | yi (/ji/) | イ (i) or イィ | 以 (U+4EE5) |
| 𛄡 | ye (/je/) | イェ | 衣 (U+8863) |
| 𛄢 | wu (/βu/) | ウ (u) | 宇 (U+5B87) |
Writing Practices
Stroke Order
Katakana writing adheres to fundamental principles of Japanese script, emphasizing a consistent sequence to ensure legibility, balance, and ease of recognition. The primary rules include proceeding from top to bottom and left to right, with horizontal strokes drawn before vertical ones where applicable. Unlike the more fluid hiragana, katakana employs an angular style with straight, sharp lines, starting typically from the top-left for many characters; for instance, the character ア (a) begins with a downward left stroke.[43][44] These conventions, rooted in standard educational practices, promote uniform handwriting and prevent distortions in character formation.[45] To illustrate these principles, the following representative katakana characters demonstrate stroke breakdowns, focusing on sequence and direction for learner consistency. Each example includes the number of strokes and a step-by-step description, highlighting the angular execution.- ア (a): 3 strokes. First, a vertical line downward from the top; second, a diagonal line from the top-right downward to the left; third, a short horizontal line crossing the first stroke near the bottom. This maintains a triangular proportion.[45]
- カ (ka): 3 strokes. First, a long vertical line from top to bottom; second, a diagonal from the top of the vertical to the upper right; third, a short hook curving slightly rightward from the bottom of the vertical. Emphasize straight angles over curves.[45]
- サ (sa): 3 strokes. First, a diagonal from top-left downward to the right; second, a longer diagonal from the end of the first upward to the left; third, a vertical line from the top of the second stroke downward. The slants create a zigzag pattern.[45]
- タ (ta): 4 strokes. First, a horizontal line across the top; second, a vertical from the left end downward; third, a diagonal from the right end of the horizontal downward to the left; fourth, a short hook at the bottom center. Balance the "roof" and supports evenly.[45]
- ナ (na): 3 strokes. First, a diagonal from top-left to bottom-right; second, a vertical from the top-left downward past the first stroke; third, a diagonal from the bottom of the second upward to meet the first. Form a bent arrow shape.[45]
- ハ (ha): 4 strokes. First, a horizontal across the top; second, a vertical from the left end downward; third, a vertical from the right end downward to half height; fourth, a hook from the bottom of the third to the left. Keep verticals parallel.[45]
- マ (ma): 5 strokes. First, short horizontal at top; second, medium horizontal below it; third, longer horizontal at bottom; fourth, short vertical from right of first downward; fifth, short vertical from right of third downward. Stack horizontals proportionally.[45]
- ヤ (ya): 3 strokes. First, a vertical from top to bottom; second, a diagonal from top of vertical to upper right; third, a diagonal from middle of vertical to bottom right. Resemble an inverted V on a stem.[45]
- ラ (ra): 2 strokes. First, a vertical from top to bottom; second, a diagonal from top of vertical downward to the right, ending with a slight hook. Avoid making the diagonal too curved.[45]
- ワ (wa): 3 strokes. First, a diagonal from top-left to bottom-right; second, a shorter diagonal from top-right downward to meet the first; third, a vertical from the junction downward. Create a V with a descending leg.[45]
Typography and Styles
In Japanese typography, katakana characters are rendered in full-width forms as the standard, occupying a square space aligned with other scripts like kanji and hiragana for balanced vertical composition in traditional print media.[48] Half-width katakana, such as カ (representing "ka"), emerged in the mid-20th century to conserve space in compact formats like newspapers and forms, mimicking the narrower proportions of Latin characters while maintaining readability.[49][50] Font variations for katakana include mincho styles, which feature serif-like brackets and tapered strokes for an elegant, classical appearance suited to book printing and formal documents.[51] In contrast, gothic fonts present katakana with angular, sans-serif lines for a modern, clean look, commonly used in headlines, advertisements, and digital interfaces to convey simplicity and impact.[52] Stylistic emphasis in Japanese typesetting avoids Western italics for katakana, instead employing bolder gothic weights (futoji) or switching to katakana itself from hiragana to highlight foreign terms or add emphasis, as seen in editorial and promotional texts.[53][54] Decorative applications extend to signage and logos, where stylized katakana—often elongated or curved—lend a dynamic, phonetic flair to brand identities, while in calligraphy, fluid brush variants emphasize rhythmic flow for artistic expressions like shop names or event posters.[55][56] The evolution of katakana typography traces from woodblock printing in the Edo period (1603–1868), where carved blocks produced uniform, angular forms for mass reproduction in literature and ephemera, to metal movable type in the late 19th century, which introduced precision but required standardization amid Japan's Meiji-era modernization.[57] In the mid-20th century, particularly with the 1946 orthographic reforms and the rise of phototypesetting, katakana glyphs were formalized for legibility, bridging traditional aesthetics with industrial needs until digital fonts dominated post-1980s.[35][58] As of 2025, katakana integrates into modern web aesthetics through variable fonts, such as those from Kinuta Typeface Factory, which allow dynamic weight and width adjustments for responsive designs supporting CJK scripts, enhancing cross-device readability in multilingual sites.[59][60] Additionally, katakana appears in emoji designs via East Asian width variants, where full-width forms ensure harmonious alignment in digital messaging platforms like iOS and Android.[61]Learning and Pedagogy
Teaching approaches for katakana emphasize mnemonic devices to link characters with visual or phonetic cues, enhancing retention for non-native learners. For instance, associating the character シ (shi) with an image of a ship aids in memorizing its form and sound through pictorial mnemonics. Mnemonic approaches vary by learner background; Korean-speaking learners often adapt these techniques by drawing pronunciation comparisons to Hangul and employing visual associations tailored to their linguistic context. Resources shared on Korean-language platforms such as Naver blogs provide guides and tables featuring 연상법 (mnemonic methods), with examples such as associating the character ナ (na) with a door without a knocker to facilitate recall.[62] Rote memorization using gojūon grids, as featured in standard textbooks like Genki I and Nakama I, provides a structured visual layout for learning letter-sound correspondences. Comparative exercises, such as applying transliteration rules to convert English words into katakana while contrasting them with hiragana equivalents, help distinguish the scripts and build accuracy in production. Katakana holds a standard place in Japanese language curricula, particularly in proficiency programs like the Japanese Language Proficiency Test (JLPT), where it is required from the N5 level onward for reading and understanding basic sentences in hiragana, katakana, and simple kanji. Instruction typically prioritizes recognition of katakana for loanword reading before full production, as early mastery of recognition prevents comprehension barriers in texts containing foreign terms. This integration supports foundational literacy in classroom settings, focusing on everyday topics and short conversations. As of 2025, modern tools facilitate katakana acquisition through interactive digital platforms. Apps like Duolingo incorporate gamified lessons for practicing katakana alongside vocabulary, promoting consistent exposure. Spaced repetition systems such as Anki enable customizable flashcards for rote review of character grids and loanwords. The Japan Foundation's Minato e-Learning platform offers apps like KATAKANA Memory Hint, which uses mnemonic pictures and quizzes for fun, targeted practice.[63] Virtual reality simulations, including tools like You Can Kana on Steam, provide immersive environments for character recognition and meditative repetition.[64] Non-native learners commonly face challenges distinguishing visually similar katakana characters, such as ツ (tsu) and シ (shi), due to their angular forms and limited exposure compared to hiragana.[65] This often leads to errors in loanword recognition, exacerbating comprehension issues in reading materials.[66] To overcome these, educators recommend targeted drills on confusing pairs and immersion in cultural contexts, such as pop media where katakana denotes onomatopoeia or foreign brands, to reinforce practical usage.[66]Digital Representation
Computer Encodings
In the pre-Unicode era, katakana representation in computing relied on byte-oriented encoding standards developed primarily in Japan to handle Japanese scripts within the limitations of early 8-bit and multi-byte systems. The foundational standard was JIS X 0201, established in 1969 as JIS C 6220 and revised in subsequent years, which introduced half-width katakana as a single-byte encoding (occupying positions 0xA1 to 0xDF) to facilitate compact storage and display on early computers and terminals, where full-width forms would have been inefficient for text-heavy applications like teletext and basic word processing.[67][68] This half-width variant compressed katakana characters to match the width of Roman letters, enabling seamless integration with ASCII-compatible systems while supporting 63 basic katakana glyphs without diacritics.[67] As computing needs expanded in the 1980s and 1990s, multi-byte encodings like Shift JIS and EUC-JP built upon JIS X 0201 to incorporate katakana alongside kanji and hiragana. Shift JIS, introduced in 1983 by Microsoft and other vendors, maintained compatibility with JIS X 0201 by encoding half-width katakana as single-byte characters in the range 0xA1–0xDF, while full-width katakana from JIS X 0208 used two-byte sequences starting with a high byte of 0x81–0x9F or 0xE0–0xEF followed by a low byte.[67][68] This dual approach allowed Shift JIS to handle mixed-script documents efficiently in Windows environments and early web content, but it introduced ambiguities, as the single-byte half-width range overlapped with lead bytes for multi-byte characters, leading to parsing errors in unshifted contexts.[67] EUC-JP, standardized in the late 1980s for Unix systems, encoded katakana from JIS X 0201 and JIS X 0208 using two-byte sequences (e.g., 0x8EA1–0x8EDF for half-width katakana), forgoing single-byte support to ensure strict compatibility with ASCII and avoid overlaps, though this doubled the storage for half-width forms compared to Shift JIS.[67][68] In Taiwanese computing contexts during the 1990s, where Japanese influences persisted in business and media, some katakana characters were included in the Big5 encoding, a primarily Chinese standard, using two-byte sequences to accommodate loanwords and technical terms without full Unicode adoption. Compatibility challenges arose frequently in legacy software, such as word processors like Ichitaro and early Microsoft Word for Japanese, where mixing half-width and full-width katakana could cause mojibake—garbled text from misinterpretation of byte sequences across encodings—or alignment issues in vertical typesetting for print previews.[68] For instance, a half-width "カ" (0xCA in JIS X 0201) might render as a full-width form or garbage if decoded under EUC-JP without proper conversion, a common pitfall in 1990s file transfers between DOS-based Japanese PCs and international networks.[67] These issues underscored the fragmented nature of pre-Unicode Japanese text handling, prompting gradual shifts toward unified standards by the late 1990s.[68]Unicode and Input Methods
Katakana characters are encoded in the Unicode Standard within the dedicated Katakana block spanning U+30A0 to U+30FF, which includes 96 code points for the standard forms used in modern Japanese writing.[3] For example, the character ア (katakana letter A) is assigned the code point U+30A2. Additional support appears in the Halfwidth and Fullwidth Forms block (U+FF00–U+FFEF), where halfwidth katakana variants occupy U+FF65 to U+FF9F, facilitating compatibility with legacy systems that require narrower character widths.[69] The Phonetic Extensions block (U+31F0–U+31FF) provides 16 specialized small katakana letters exclusively for Ainu language orthography, such as ㇰ (U+31F0, katakana letter small ku).[4] Katakana encoding was introduced in Unicode version 1.0, released in October 1991, with 90 characters in the main block. Version 1.1 added four more to reach 94, and version 3.2 added two to reach the current 96.[70] As of Unicode 17.0, published on September 9, 2025, no major expansions have been made to the core Katakana block, though the Unicode Collation Algorithm (UTS #10) has seen iterative refinements to enhance sorting accuracy for Japanese scripts, including better handling of katakana in multilingual contexts.[71] These collation updates ensure consistent ordering in applications like databases and search engines, prioritizing phonetic equivalence over strict code point sequence.[72] Inputting katakana primarily relies on Input Method Editors (IMEs), software that converts romanized input (romaji) into Japanese characters. Popular IMEs include the Microsoft Japanese IME, which supports direct romaji-to-katakana conversion—for instance, typing "ai" followed by a mode switch yields アイ—and offers customizable dictionaries for loanwords.[73] Similarly, Google Input Tools enables seamless katakana entry via romaji, with options to toggle between hiragana and katakana modes during composition.[74] On mobile devices, swipe-based keyboards like those in Gboard or Apple's Japanese input integrate gesture recognition for faster romaji-to-katakana transformation, reducing keystrokes for common foreign terms. In web development, katakana's Unicode encoding ensures broad compatibility through HTML5's UTF-8 declaration and CSS font stacking, where fallback mechanisms likefont-family: 'Noto Sans JP', [sans-serif](/page/Sans-serif); render characters reliably across browsers if primary fonts lack Japanese glyph support. As of 2025, emerging trends incorporate AI enhancements in IMEs, such as predictive suggestions for katakana representations of loanwords, improving accuracy for neologisms by analyzing context and etymology.