Jump to content
Main menu
Main menu
move to sidebar
hide
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Special pages
Niidae Wiki
Search
Search
Appearance
Create account
Log in
Personal tools
Create account
Log in
Pages for logged out editors
learn more
Contributions
Talk
Editing
Katakana
(section)
Page
Discussion
English
Read
Edit
View history
Tools
Tools
move to sidebar
hide
Actions
Read
Edit
View history
General
What links here
Related changes
Page information
Appearance
move to sidebar
hide
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==Computer encoding== In addition to fonts intended for Japanese text and Unicode catch-all fonts (like [[Arial Unicode MS]]), many fonts intended for Chinese (such as MS Song) and Korean (such as Batang) also include katakana. === Hiragana and katakana === {{Unreferenced section|date=September 2009}} {{main|Half-width kana}} In addition to the usual {{nihongo|full-width|ε ¨θ§|''zenkaku''}} display forms of characters, katakana has a second form, {{nihongo|[[Half-width kana|half-width]]|εθ§|''hankaku''}}. The half-width forms were originally associated with the [[JIS X 0201]] encoding. Although their display form is not specified in the standard, in practice they were designed to fit into the same rectangle of pixels as Roman letters to enable easy implementation on the computer equipment of the day. This space is narrower than the square space traditionally occupied by Japanese characters, hence the name "half-width". In this scheme, diacritics (dakuten and handakuten) are separate characters. When originally devised, the half-width katakana were represented by a single byte each, as in JIS X 0201, again in line with the capabilities of contemporary computer technology. In the late 1970s, two-byte character sets such as [[JIS X 0208]] were introduced to support the full range of Japanese characters, including katakana, hiragana and kanji. Their display forms were designed to fit into an approximately square array of pixels, hence the name "full-width". For backward compatibility, separate support for half-width katakana has continued to be available in modern multi-byte encoding schemes such as Unicode, by having two separate blocks of characters β one displayed as usual (full-width) katakana, the other displayed as half-width katakana. Although often said to be obsolete, the half-width katakana are still used in many systems and encodings. For example, the titles of [[mini disc]]s can only be entered in ASCII or half-width katakana, and half-width katakana are commonly used in computerized cash register displays, on shop receipts, and Japanese digital television and DVD subtitles. Several popular Japanese encodings such as [[Extended Unix Code|EUC-JP]], [[Unicode]] and [[Shift JIS]] have half-width katakana code as well as full-width. By contrast, [[ISO/IEC 2022|ISO-2022-JP]] has no half-width katakana, and is mainly used over [[SMTP]] and [[NNTP]]. ===Unicode=== {{Main|Katakana (Unicode block)|Halfwidth and Fullwidth Forms (Unicode block)|Enclosed CJK Letters and Months|l3=Enclosed CJK Letters and Months (Unicode block)|Katakana Phonetic Extensions|l4=Katakana Phonetic Extensions (Unicode block)|Kana Supplement|l5=Kana Supplement (Unicode block)|Small Kana Extension|l6=Small Kana Extension (Unicode block)|Kana Extended-A|l7=Kana Extended-A (Unicode block)|Kana Extended-B|l8=Kana Extended-B (Unicode block)|CJK Compatibility}} Katakana was added to the Unicode Standard in October, 1991 with the release of version 1.0. The Unicode block for (full-width) katakana is U+30A0βU+30FF. Encoded in this block along with the katakana are the ''nakaguro'' word-separation [[middle dot]], the ''chΕon'' vowel extender, the katakana [[iteration mark]]s, and a [[Typographic ligature|ligature]] of [[Koto (kana)|γ³γ]] sometimes used in vertical writing. {{unicode chart Katakana}} Half-width equivalents to the usual full-width katakana also exist in Unicode. These are encoded within the [[Halfwidth and Fullwidth Forms]] block (U+FF00βU+FFEF) (which also includes full-width forms of Latin characters, for instance), starting at U+FF65 and ending at U+FF9F (characters U+FF61βU+FF64 are half-width punctuation marks). This block also includes the half-width dakuten and handakuten. The full-width versions of these characters are found in the Hiragana block. {{Unicode chart Halfwidth and Fullwidth Forms|subset=katakana}} Circled katakana are code points U+32D0βU+32FE in the Enclosed CJK Letters and Months block (U+3200βU+32FF). A circled γ³ (n) is not included. {{Unicode chart Enclosed CJK Letters and Months|subset=katakana}} Extensions to Katakana for phonetic transcription of Ainu and other languages were added to the Unicode standard in March 2002 with the release of version 3.2. The Unicode block for Katakana Phonetic Extensions is U+31F0βU+31FF: {{Unicode chart Katakana Phonetic Extensions}} Historic and variant forms of Japanese kana characters were added to the Unicode standard in October 2010 with the release of version 6.0. The Unicode block for Kana Supplement is U+1B000βU+1B0FF: {{Unicode chart Kana Supplement}} The Unicode block for Small Kana Extension is U+1B130βU+1B16F: {{Unicode chart Small Kana Extension}} The Kana Extended-A Unicode block is U+1B100β1B12F. It contains [[hentaigana]] (non-standard hiragana) and historic kana characters. {{Unicode chart Kana Extended-A}} The Kana Extended-B Unicode block is U+1AFF0β1AFFF. It contains kana originally created by Japanese linguists to write [[Taiwanese Hokkien]] known as [[Taiwanese kana]]. {{Unicode chart Kana Extended-B}} The CJK Compatibility block contains in the range U+3330-3357 square versions of katakana words, usually measure units or abbreviations of loanwords: {{Unicode chart CJK Compatibility|subset=katakana}} Katakana in other Unicode blocks: * [[Dakuten]] and [[dakuten|handakuten]] diacritics are located in the [[Hiragana (Unicode block)|Hiragana block]]: ** U+3099 COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK (non-spacing dakuten): γ ** U+309A COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK (non-spacing handakuten): γ ** U+309B KATAKANA-HIRAGANA VOICED SOUND MARK (spacing dakuten): γ ** U+309C KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK (spacing handakuten): γ * Two katakana-based [[emoji]] are in the [[Enclosed Ideographic Supplement|Enclosed Ideographic Supplement block]]: ** U+1F201 SQUARED KATAKANA KOKO ('here' sign): π ** U+1F202 SQUARED KATAKANA SA ('service' sign): π * A katakana-based Japanese TV symbol from the [[ARIB STD-B24]] standard is in the [[Enclosed Ideographic Supplement|Enclosed Ideographic Supplement block]]: ** U+1F213 SQUARED KATAKANA DE ('data broadcasting service linked with a main program' symbol): π Furthermore, as of Unicode {{Unicode version|version=16.0}}, the following combinatory sequences have been explicitly named, despite having no precomposed symbols in the katakana block. Font designers may want to optimize the display of these composed glyphs. Some of them are mostly used for writing the [[Ainu language]], the others are called {{ill|Dakuten#Phonetic shifts|lt=''bidakuon''|ja|ιΌ»ζΏι³}} in Japanese. Other, arbitrary combinations with U+309A handakuten are also possible. {| class="wikitable nounderlines" style="border-collapse:collapse;background:#FFFFFF;font-size:large;text-align:center" | colspan="17" style="background:#F8F8F8;font-size:small" | '''Katakana named sequences'''<br />[https://unicode.org/Public/UNIDATA/NamedSequences.txt Unicode Named Character Sequences Database] |- style="background:#F8F8F8;font-size:small" || Sequence name || colspan="2" style="width:20pt" | Codepoints || style="width:20pt" | Glyph |- | style="text-align:left;font-size:small;"| KATAKANA LETTER BIDAKUON NGA || style="font-size:small;" | U+30AB || style="font-size:small;" | U+309A || γ«γ |- | style="text-align:left;font-size:small;"| KATAKANA LETTER BIDAKUON NGI || style="font-size:small;" | U+30AD || style="font-size:small;" | U+309A || γγ |- | style="text-align:left;font-size:small;"| KATAKANA LETTER BIDAKUON NGU || style="font-size:small;" | U+30AF || style="font-size:small;" | U+309A || γ―γ |- | style="text-align:left;font-size:small;"| KATAKANA LETTER BIDAKUON NGE || style="font-size:small;" | U+30B1 || style="font-size:small;" | U+309A || γ±γ |- | style="text-align:left;font-size:small;"| KATAKANA LETTER BIDAKUON NGO || style="font-size:small;" | U+30B3 || style="font-size:small;" | U+309A || γ³γ |- | style="text-align:left;font-size:small;"| KATAKANA LETTER AINU CE || style="font-size:small;" | U+30BB || style="font-size:small;" | U+309A || γ»γ |- | style="text-align:left;font-size:small;"| KATAKANA LETTER AINU TU || style="font-size:small;" | U+30C4 || style="font-size:small;" | U+309A || γγ |- | style="text-align:left;font-size:small;"| KATAKANA LETTER AINU TO || style="font-size:small;" | U+30C8 || style="font-size:small;" | U+309A || γγ |- | style="text-align:left;font-size:small;"| KATAKANA LETTER AINU P || style="font-size:small;" | U+31F7 || style="font-size:small;" | U+309A || γ·γ |}
Summary:
Please note that all contributions to Niidae Wiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Encyclopedia:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Search
Search
Editing
Katakana
(section)
Add topic