Jump to content
Main menu
Main menu
move to sidebar
hide
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Special pages
Niidae Wiki
Search
Search
Appearance
Create account
Log in
Personal tools
Create account
Log in
Pages for logged out editors
learn more
Contributions
Talk
Editing
ISO/IEC 8859
Page
Discussion
English
Read
Edit
View history
Tools
Tools
move to sidebar
hide
Actions
Read
Edit
View history
General
What links here
Related changes
Page information
Appearance
move to sidebar
hide
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
{{Short description|Series of standards for 8-bit character encodings}} {{Use Oxford spelling|date=December 2011}} {{Infobox character encoding |name = ISO 8859 encoding family |alias = |mime = |image = |caption = |standard = ISO/IEC 8859 |extends = [[ASCII]] |prev = [[ISO/IEC 646]] |next = [[ISO/IEC 10646]] ([[Unicode]]) |otherrelated = [[ISO/IEC 10367]], [[Windows-125x]] |classification = 8-bit [[extended ASCII]], [[ISO/IEC 4873]] level 1 }} '''ISO/IEC 8859''' is a joint [[International Organization for Standardization|ISO]] and [[International Electrotechnical Commission|IEC]] series of standards for 8-bit [[character encoding]]s. The series of standards consists of numbered parts, such as [[ISO/IEC 8859-1]], [[ISO/IEC 8859-2]], etc. There are 15 parts, excluding the abandoned [[ISO/IEC 8859-12]].<ref>{{Citation |last1=Chaudhuri |first1=Arindam |title=Optical Character Recognition Systems for French Language |date=2016-12-24 |url=http://dx.doi.org/10.1007/978-3-319-50252-6_5 |work=Optical Character Recognition Systems for Different Languages with Soft Computing |pages=109–136 |access-date=2023-12-04 |place=Cham |publisher=Springer International Publishing |isbn=978-3-319-50251-9 |last2=Mandaviya |first2=Krupa |last3=Badelia |first3=Pratixa |last4=Ghosh |first4=Soumya K.|doi=10.1007/978-3-319-50252-6_5 }}</ref> The ISO working group maintaining this series of standards has been disbanded. ISO/IEC 8859 parts 1, 2, 3, and 4 were originally [[Ecma International]] standard '''ECMA-94'''. ==Introduction== While the bit patterns of the 95 [[graphic character|printable]] [[ASCII]] characters are sufficient to exchange information in modern [[English language|English]], most other languages that use [[Latin script|Latin alphabet]]s need additional symbols not covered by ASCII. ISO/IEC 8859 sought to remedy this problem by utilizing the eighth bit in an 8-bit [[byte]] to allow positions for another 96 printable characters. Early encodings were limited to 7 bits because of restrictions of some data transmission protocols, and partially for historical reasons. However, more characters were needed than could fit in a single 8-bit character encoding, so several mappings were developed, including at least ten suitable for various Latin alphabets. The ISO/IEC 8859 standard parts only define printable characters, although they explicitly set apart the byte ranges 0x00–1F and 0x7F–9F as "combinations that do not represent graphic characters" (i.e. which are reserved for use as [[control character]]s) in accordance with [[ISO/IEC 4873]]; they were designed to be used in conjunction with a separate standard defining the control functions associated with these bytes, such as [[ISO 6429]] or [[ISO 6630]].<ref>{{citation|mode=cs1 |quotation=This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. [...] The shaded positions in the code table correspond to bit combinations that do not represent graphic characters. Their use is outside the scope of ISO/IEC 8859; it is specified in other International Standards, for example ISO/IEC 6429. |url=http://www.open-std.org/JTC1/sc2/wg3/docs/n411.pdf |title=Final Text of DIS 8859-1, 8-bit single-byte coded graphic character sets—Part 1: Latin alphabet No.1 |author=ISO/IEC JTC 1/SC 2/WG 3 |author-link=ISO/IEC JTC 1/SC 2 |id=[[ISO]]/[[International Electrotechnical Commission|IEC]] [[International Organization for Standardization#Standardization process|FDIS]] 8859-1:1998; JTC1/SC2/N2988; WG3/N411 |date=1998-02-12}}</ref> To this end a series of encodings registered with the [[Internet Assigned Numbers Authority|IANA]] add the [[C0 and C1 control codes|C0]] control set (control characters mapped to bytes 0 to 31) from [[ISO/IEC 646|ISO 646]] and the [[C0 and C1 control codes|C1]] control set (control characters mapped to bytes 128<!-- Character 127, the delete character, does not belong to C1 --> to 159) from ISO 6429, resulting in full 8-bit character maps with most, if not all, bytes assigned. These sets have ISO-8859-''n'' as their preferred [[MIME]] name or, in cases where a preferred MIME name is not specified, their canonical name. Many people use the terms ISO/IEC 8859-''n'' and ISO-8859-''n'' interchangeably. [[ISO/IEC 8859-11]] did not get such a charset assigned, presumably because it was almost identical to [[TIS 620]]. ==Characters== The ISO/IEC 8859 standard is designed for reliable information exchange, not [[typography]]; the standard omits symbols needed for high-quality typography, such as optional ligatures, curly quotation marks, dashes, etc. As a result, high-quality typesetting systems often use proprietary or idiosyncratic extensions on top of the [[ASCII]] and ISO/IEC 8859 standards, or use [[Unicode]] instead. An inexact rule based on practical experience states that if a character or symbol was not already part of a widely used data-processing character set and was also not usually provided on typewriter keyboards for a national language, it did not get in. Hence the directional double quotation marks ''«'' and ''»'' used for some European languages were included, but not the directional double quotation marks ''“'' and ''”'' used for English and some other languages. French did not get its ''œ'' and ''Œ'' ligatures because they could be typed as 'oe'. Likewise, ''Ÿ'', needed for all-caps text, was dropped as well.<ref name="Haralambous_2007"/><ref name="Andre_2003"/><ref name="Andre_1996"/> Albeit under different codepoints, these three characters were later reintroduced with [[ISO/IEC 8859-15]] in 1999, which also introduced the new [[euro sign]] character €. Likewise Dutch did not get the ''ij'' and ''IJ'' letters, because Dutch speakers had become used to typing these as two letters instead. Romanian did not initially get its ''Ș''/''ș'' and ''Ț''/''ț'' ([[Comma (punctuation)#Diacritical usage|with comma]]) letters, because these letters were initially unified with ''Ş''/''ş'' and ''Ţ''/''ţ'' ([[Cedilla|with cedilla]]) by the [[Unicode Consortium]], considering the shapes with comma beneath to be [[typographic approximation|glyph variants]] of the shapes with cedilla. However, the letters with explicit comma below were later added to the Unicode standard and are also in [[ISO/IEC 8859-16]]. Most of the ISO/IEC 8859 encodings provide diacritic marks required for various European languages using the Latin script. Others provide non-Latin alphabets: [[Greek alphabet|Greek]], [[Cyrillic script|Cyrillic]], [[Hebrew alphabet|Hebrew]], [[Arabic alphabet|Arabic]] and [[Thai alphabet|Thai]]. Most of the encodings contain only [[spacing characters]], although the Thai, Hebrew, and Arabic ones do also contain [[combining character]]s. The standard makes no provision for the scripts of East Asian languages (''[[CJK characters|CJK]]''), as their ideographic [[writing system]]s require many thousands of code points. Although it uses Latin based characters, [[Vietnamese language|Vietnamese]] does not fit into 96 positions (without using combining diacritics such as in [[Windows-1258]]) either. Each Japanese syllabic alphabet (hiragana or katakana, see [[Kana]]) would fit, as in [[JIS X 0201]], but like several other alphabets of the world they are not encoded in the ISO/IEC 8859 system. ==The parts of ISO/IEC 8859== ISO/IEC 8859 is divided into the following parts: {| class="wikitable" ! Part ! Name ! Revisions ! Other standards ! Description |- ! [[ISO/IEC 8859-1|Part 1]] | ''Latin-1''<br />''Western European'' | [[ISO/IEC 8859-1:1987|1987]], [[ISO/IEC 8859-1:1998|1998]] | [[ECMA-94]] ([[ISO/IEC 8859-1#History|1985]], 1986) | Perhaps the most widely used part of ISO/IEC 8859, covering most Western European languages: [[Danish language|Danish]] (partial),<ref group="nb" name="NB_Missing_vowels"/> [[Dutch language|Dutch]],<ref group="nb" name="NB_IJ"/> [[English language|English]], [[Faroese language|Faeroese]], [[Finnish language|Finnish]] (partial),<ref group="nb" name="two"/> [[French language|French]] (partial),<ref group="nb" name="two"/> [[German language|German]], [[Icelandic language|Icelandic]], [[Irish language|Irish]], [[Italian language|Italian]], [[Norwegian language|Norwegian]], [[Portuguese language|Portuguese]], [[Romansh language|Rhaeto-Romanic]], [[Scottish Gaelic language|Scottish Gaelic]], [[Spanish language|Spanish]], [[Catalan language|Catalan]], and [[Swedish (language)|Swedish]]. Languages from other parts of the world are also covered, including: Eastern European [[Albanian language|Albanian]], Southeast Asian [[Indonesian language|Indonesian]], as well as the African languages [[Afrikaans language|Afrikaans]] and [[Swahili language|Swahili]]. A modification of [[DEC MCS]]; the first (1985) standard version at the ECMA level lacked the [[times sign]] and [[division sign|division obelus]], which were added the next year. The missing [[euro sign]] and capital ''Ÿ'' are in the revised version ISO/IEC 8859-15 (see below). The corresponding IANA character set is ISO-8859-1. |- ! [[ISO/IEC 8859-2|Part 2]] | ''Latin-2'' <br />''Central European'' | [[ISO/IEC 8859-2:1987|1987]], [[ISO/IEC 8859-2:1999|1999]] | rowspan=3 | [[ECMA-94]] (1986)<ref group="nb">The 1985 edition includes only a version of ISO-8859-1.</ref> | Supports those Central and Eastern European languages that use the Latin alphabet, including [[Bosnian language|Bosnian]], [[Polish language|Polish]], [[Croatian language|Croatian]], [[Czech language|Czech]], [[Slovak language|Slovak]], [[Slovene language|Slovene]], [[Serbian language|Serbian]], and [[Hungarian language|Hungarian]]. The missing [[euro sign]] can be found in version ISO/IEC 8859-16. |- ! [[ISO/IEC 8859-3|Part 3]] | ''Latin-3''<br />''South European'' | [[ISO/IEC 8859-3:1988|1988]], [[ISO/IEC 8859-3:1999|1999]] | [[Turkish language|Turkish]], [[Maltese language|Maltese]], and [[Esperanto]]. Largely superseded by [[ISO/IEC 8859-9]] for Turkish. |- ! [[ISO/IEC 8859-4|Part 4]] | ''Latin-4''<br />''North European'' | [[ISO/IEC 8859-4:1988|1988]], [[ISO/IEC 8859-4:1998|1998]] | [[Estonian language|Estonian]], [[Latvian language|Latvian]], [[Lithuanian language|Lithuanian]], [[Greenlandic language|Greenlandic]], and [[Sami languages|Sami]]. |- ! [[ISO/IEC 8859-5|Part 5]] | ''Latin/Cyrillic'' | [[ISO/IEC 8859-5:1988|1988]], [[ISO/IEC 8859-5:1999|1999]] | [[ECMA-113]] (1988, 1999)<ref group="nb">The 1986 edition defines [[KOI8-E]], which is an entirely different encoding.</ref> | Covers mostly Slavic languages that use a [[Cyrillic alphabets|Cyrillic alphabet]], including [[Belarusian language|Belarusian]], [[Bulgarian language|Bulgarian]], [[Macedonian language|Macedonian]], [[Russian language|Russian]], [[Serbian language|Serbian]], and [[Ukrainian language|Ukrainian]] (partial).<ref group="nb" name="NB_Ghe"/> |- ! [[ISO/IEC 8859-6|Part 6]] | ''Latin/Arabic'' | [[ISO/IEC 8859-6:1987|1987]], [[ISO/IEC 8859-6:1999|1999]] | {{plainlist| * [[ASMO 708]] (1986) * [[ECMA-114]] (1986, 2000) }} | Covers the most common [[Arabic language]] characters. Does not support other languages using the [[Arabic script]]. Needs to be [[BiDi]] and [[Cursive#Arabic|cursive joining]] processed for display. |- ! [[ISO/IEC 8859-7|Part 7]] | ''Latin/Greek'' | [[ISO/IEC 8859-7:1987|1987]], [[ISO/IEC 8859-7:2003|2003]] | {{plainlist| * [[ELOT 928]] (1986) * [[ECMA-118]] (1986) }} | Covers the modern [[Greek language]] ([[monotonic orthography]]). Can also be used for Ancient [[Greek language|Greek]] written without accents or in monotonic orthography, but lacks the diacritics for [[polytonic orthography]]. These were introduced with Unicode. Updated 2003 to add the [[euro sign]], [[Modern drachma|drachma]] sign and spacing [[ypogegrammeni]]. |- ! [[ISO/IEC 8859-8|Part 8]] | ''Latin/Hebrew'' | [[ISO/IEC 8859-8:1988|1988]], [[ISO/IEC 8859-8:1999|1999]] | {{plainlist| * [[ECMA-121]] (1987, 2000) * [[SI 1311]] ([[ISO/IEC 8859-8#2002 Israeli Standard extensions|2002]]) }} | Covers the modern [[Hebrew alphabet]] as used in Israel. In practice two different encodings exist, logical order (needs to be [[BiDi]] processed for display) and visual (left-to-right) order (in effect, after bidi processing and line breaking). Updated 1999 to add {{ctrl|LRM}} and {{ctrl|RLM}}. Updated at national standard level in 2002 to add euro and shekel signs and more bidirectional format effectors; the 2002 additions were never incorporated back into the ISO standard version. |- ! [[ISO/IEC 8859-9|Part 9]] | ''Latin-5''<br />''Turkish'' | [[ISO/IEC 8859-9:1989|1989]], [[ISO/IEC 8859-9:1999|1999]] | {{plainlist| * [[TS 5881]] (1988) * [[ECMA-128]] (1988, 1999) }} | Largely the same as ISO/IEC 8859-1, replacing the rarely used [[Icelandic language|Icelandic]] letters with [[Turkish language|Turkish]] ones. |- ! [[ISO/IEC 8859-10|Part 10]] | ''Latin-6'' <br />''Nordic'' | [[ISO/IEC 8859-10:1992|1992]], [[ISO/IEC 8859-10:1998|1998]] | [[ECMA-144]] (1990, 1992, 2000) | A rearrangement of Latin-4. Considered more useful for Nordic languages. Baltic languages use Latin-4 more. |- ! [[ISO/IEC 8859-11|Part 11]] | ''Latin/Thai'' | [[ISO/IEC 8859-11:2001|2001]] | [[TIS-620]] (1986, 1990) | Contains characters needed for the [[Thai language]]. First revision established in 1986 at national standard level as [[TIS 620]]. Elevated to ISO standard status as a part of ISO 8859 in 2001, with the addition of a [[non-breaking space]]. |- ! <s>Part 12</s> | ''Latin/Devanagari'' | N/A | - | Originally proposed to support the [[Celtic languages]],<ref>{{cite web|last1=Everson|first1=Michael|title=Proposed ISO 8859-12 (later 14)|url=http://www.evertype.com/pics/emonopics/latin7.jpg}}</ref><ref>{{cite web |author-first=Roman |author-last=Czyborra |title=The ISO 8859 Alphabet Soup |date=1997-10-12 |url=http://wwwwbs.cs.tu-berlin.de/user/czyborra/charsets/ |url-status=dead |archive-url=https://web.archive.org/web/20000817190215/http://wwwwbs.cs.tu-berlin.de/user/czyborra/charsets/ |archive-date=2000-08-17}} (NB. "Celtic" note on old Czyborra page.)</ref> then slated for Latin/[[Devanagari]],<ref>{{cite web| title=ISO-8859-10; registration of new charset values; error in MIME draft |author-first=Olle |author-last=Jarnefors |publisher=Royal Institute of Technology (KTH) |date=1996-04-11 |url=http://mail.apps.ietf.org/ietf/charsets/msg00190.html |url-status=dead |archive-url=https://web.archive.org/web/20120204055222/http://mail.apps.ietf.org/ietf/charsets/msg00190.html |archive-date=2012-02-04}} (NB. Note about forthcoming "Devanagari" standard part on IETF charsets mailing list.)</ref> but abandoned in 1997, during the 12th meeting of ISO/IEC JTC 1/SC 2/WG 3.<ref>{{cite web |title=Resolutions of the 12th Meeting of ISO/IEC JTC 1/SC 2/WG 3, Iraklion-Crete, Greece, 1997-07-04, 07 |location=Iraklion-Crete, Greece |publisher=ISO/IEC JTC 1/SC 2 N 2933, ISO/IEC JTC 1/SC 2/WG 3 N 401 |date=1997-07-04 |url=http://anubis.dkuug.dk/jtc1/sc2/open/2n2933.pdf |url-status=dead |archive-url=https://web.archive.org/web/20110607062301/http://anubis.dkuug.dk/jtc1/sc2/open/2n2933.pdf |archive-date=2011-06-07 |quote=WG 3 resolves to suspend any activities on this subject until general agreement on combining characters is obtained and until the further contributions are received.}}</ref> The Celtic proposal was changed to ISO 8859-14, with part 12 possibly being reserved for ISCII Indian.<ref>{{cite web |author-first=Roman |author-last=Czyborra |title=The ISO 8859 Alphabet Soup |date=1998-12-01 |url=http://czyborra.com/charsets/iso8859.html |url-status=live |archive-url=https://web.archive.org/web/20160320021904/http://czyborra.com/charsets/iso8859.html |archive-date=2016-03-20}} (NB. "ISCII" note on new Czyborra page.)</ref> |- ! [[ISO/IEC 8859-13|Part 13]] | ''Latin-7'' <br />''Baltic Rim'' | [[ISO/IEC 8859-13:1998|1998]] | - | Added some characters for Baltic languages which were missing from Latin-4 and Latin-6. Related to the earlier-published{{refn|group=nb|Published 1995, registered 1996.<ref>{{cite web |url=https://www.iana.org/assignments/charset-reg/windows-1257 |title=Registration of new MIME charset: Windows-1257 |first=Katya |last=Lazhintseva |date=1996-05-03 |publisher=IANA}}</ref>}} [[Windows-1257]]. |- ! [[ISO/IEC 8859-14|Part 14]] | ''Latin-8'' <br />''Celtic'' | [[ISO/IEC 8859-14:1998|1998]] | - | Covers Celtic languages such as [[Scottish Gaelic language|Gaelic]] and the [[Breton language]]. Welsh letters correspond to the earlier (1994) [[ISO-IR-182]]. |- ! [[ISO/IEC 8859-15|Part 15]] | ''Latin-9'' | [[ISO/IEC 8859-15:1999|1999]] | - | A revision of 8859-1 that removes some little-used symbols, replacing them with the [[euro sign]] ''€'' and the letters ''Š'', ''š'', ''Ž'', ''ž'', ''Œ'', ''œ'', and ''Ÿ'', which completes the coverage of [[French language|French]], [[Finnish language|Finnish]] and [[Estonian language|Estonian]]. |- ! [[ISO/IEC 8859-16|Part 16]] | ''Latin-10'' <br />''South-Eastern European'' | [[ISO/IEC 8859-16:2001|2001]] | [[SR 14111]] (1998) | Intended for [[Albanian language|Albanian]], [[Croatian language|Croatian]], [[Hungarian language|Hungarian]], [[Italian language|Italian]], [[Polish language|Polish]], [[Romanian language|Romanian]] and [[Slovene language|Slovene]], but also Finnish, French, German and [[Irish language|Irish Gaelic]] (new orthography). The focus lies more on letters than symbols. The [[Currency sign (generic)|generic currency sign]] is replaced with the [[euro sign]]. |} Each part of ISO/IEC 8859 is designed to support languages that often borrow from each other, so the characters needed by each language are usually accommodated by a single part. However, there are some characters and language combinations that are not accommodated without transcriptions. Efforts were made to make conversions as smooth as possible. For example, German has all of its seven special characters at the same positions in all Latin variants (1–4, 9, 10, 13–16), and in many positions the characters only differ in the diacritics between the sets. In particular, variants 1–4 were designed jointly, and have the property that every encoded character appears either at a given position or not at all. ===Table=== {| class="wikitable" style="text-align:center;" |+Comparison of the various parts (1–16) of ISO/IEC 8859 ![[Binary numeral system|Binary]]!![[Octal|Oct]]!![[Decimal|Dec]]!![[Hexadecimal|Hex]] ![[ISO/IEC 8859-1|1]]!![[ISO/IEC 8859-2|2]]!![[ISO/IEC 8859-3|3]]!![[ISO/IEC 8859-4|4]]!![[ISO/IEC 8859-5|5]]!![[ISO/IEC 8859-6|6]]!![[ISO/IEC 8859-7|7]]!![[ISO/IEC 8859-8|8]]!![[ISO/IEC 8859-9|9]]!![[ISO/IEC 8859-10|10]]!![[ISO/IEC 8859-11|11]]!![[ISO/IEC 8859-13|13]]!![[ISO/IEC 8859-14|14]]!![[ISO/IEC 8859-15|15]]!![[ISO/IEC 8859-16|16]] |- !1010 0000!!240!!160!!A0 |colspan="16" |[[Non-breaking space]] (NBSP) |- !1010 0001!!241!!161!!A1 |[[¡]]||[[Ą]]||[[Ħ]]||[[Ą]]||[[Yo (Cyrillic)|Ё]]||style="background-color:#ccffcc;"| ||[[‘]]||style="background-color:#ccffcc;"| ||[[¡]]||[[Ą]]||[[ก]]||[[”]]||[[Ḃ]]||[[¡]]||[[Ą]] |- !1010 0010!!242!!162!!A2 |[[¢]]||colspan="2"|[[˘]]||[[ĸ]]||[[Dje|Ђ]]||style="background-color:#ccffcc;"| ||[[Apostrophe|’]]||colspan="2"|[[¢]]||[[Ē]]||[[ข]]||[[¢]]||[[ḃ]]||[[¢]]||[[ą]] |- !1010 0011!!243!!163!!A3 |[[Pound sign|£]]||[[Ł]]||[[Pound sign|£]]||[[Ŗ]]||[[Gje|Ѓ]]||style="background-color:#ccffcc;"| ||colspan="3"|[[Pound sign|£]]||[[Ģ]]||[[ฃ]]||colspan="3"|[[Pound sign|£]]||[[Ł]] |- !1010 0100 !!244!!164!!A4 |colspan="4"|[[¤]]||[[Ukrainian Ye|Є]]||[[¤]]||style="background-color:#ffff99;"|[[euro sign|€]]||colspan="2"|[[¤]]||[[Ī]]||[[ค]]||[[¤]]||[[Ċ]]||colspan="2"|[[euro sign|€]] |- !1010 0101!!245!!165!!A5 |[[¥]]||[[Ľ]]||style="background-color:#ccffcc;"| ||[[Ĩ]]||[[Dze|Ѕ]]||style="background-color:#ccffcc;"| ||style="background-color:#ffff99;"|[[Modern drachma|₯]]||colspan="2"|[[¥]]||[[Ĩ]]||[[ฅ]]||[[„]]||[[ċ]]||[[¥]]||[[„]] |- !1010 0110!!246!!166!!A6 |[[¦]]||[[Ś]]||[[Ĥ]]||[[Ļ]]||[[Dotted I (Cyrillic)|І]]||style="background-color:#ccffcc;"| ||colspan="3"|[[¦]]||[[Ķ]]||[[ฆ]]||[[¦]]||[[Ḋ]]||colspan="2"|[[Š]] |- !1010 0111!!247!!167!!A7 |colspan="4"|[[§]]||[[Yi (Cyrillic)|Ї]]||style="background-color:#ccffcc;"| ||colspan="4"|[[§]]||[[ง]]||colspan="4"|[[§]] |- !1010 1000!!250!!168!!A8 |colspan="4"|[[¨]]||[[Je (Cyrillic)|Ј]]||style="background-color:#ccffcc;"| ||colspan="3"|[[¨]]||[[Ļ]]||[[จ]]||[[Ø]]||[[Ẁ]]||colspan="2"|[[š]] |- !1010 1001!!251!!169!!A9 |[[©]]||[[Š]]||[[İ]]||[[Š]]||[[Lje|Љ]]||style="background-color:#ccffcc;"| ||colspan="3"|[[©]]||[[Đ]]||[[ฉ]]||colspan="4"|[[©]] |- !1010 1010!!252!!170!!AA |[[ª]]||colspan="2"|[[Ş]]||[[Ē]]||[[Nje|Њ]]||style="background-color:#ccffcc;"| ||style="background-color:#ffff99;"|[[ͺ]]||[[×]]||[[ª]]||[[Š]]||[[ช]]||[[Ŗ]]||[[Ẃ]]||[[ª]]||[[Ș]] |- !1010 1011!!253!!171!!AB |[[«]]||[[Ť]]||[[Ğ]]||[[Ģ]]||[[Tshe|Ћ]]||style="background-color:#ccffcc;"| ||colspan="3"|[[«]]||[[Ŧ]]||[[ซ]]||[[«]]||[[ḋ]]||colspan="2"|[[«]] |- !1010 1100!!254!!172!!AC |[[¬]]||[[Ź]]||[[Ĵ]]||[[Ŧ]]||[[Kje|Ќ]]||[[،]]||colspan="3"|[[¬]]||[[Ž]]||[[ฌ]]||[[¬]]||[[Ỳ]]||[[¬]]||[[Ź]] |- !1010 1101!!255!!173!!AD |colspan=10 align=center|[[Soft hyphen]] (SHY)||[[ญ]]||colspan=4 align=center|[[Soft hyphen|SHY]] |- !1010 1110!!256!!174!!AE |[[®]]||[[Ž]]||style="background-color:#ccffcc;"| ||[[Ž]]||[[Short U (Cyrillic)|Ў]]||style="background-color:#ccffcc;"| ||style="background-color:#ccffcc;"| ||colspan="2"|[[®]]||[[Ū]]||[[ฎ]]||colspan="3"|[[®]]||[[ź]] |- !1010 1111!!257!!175!!AF |[[¯]]||colspan="2"|[[Ż]]||[[¯]]||[[Dzhe|Џ]]||style="background-color:#ccffcc;"| ||[[―]]||colspan="2"|[[¯]]||[[Ŋ]]||[[ฏ]]||[[Æ]]||[[Ÿ]]||[[¯]]||[[Ż]] |- !1011 0000!!260!!176!!B0 |colspan="4"|[[°]]||[[A (Cyrillic)|А]]||style="background-color:#ccffcc;"| ||colspan="4"|[[°]]||[[ฐ]]||[[°]]||[[Ḟ]]||colspan="2"|[[°]] |- !1011 0001!!261!!177!!B1 |[[±]]||[[ą]]||[[ħ]]||[[ą]]||[[Be (Cyrillic)|Б]]||style="background-color:#ccffcc;"| ||colspan="3"|[[±]]||[[ą]]||[[ฑ]]||[[±]]||[[ḟ]]||colspan="2"|[[±]] |- !1011 0010!!262!!178!!B2 |[[²]]||[[˛]]||[[²]]||[[˛]]||[[Ve (Cyrillic)|В]]||style="background-color:#ccffcc;"| ||colspan="3"|[[²]]||[[ē]]||[[ฒ]]||[[²]]||[[Ġ]]||[[²]]||[[Č]] |- !1011 0011!!263!!179!!B3 |[[³]]||[[ł]]||[[³]]||[[ŗ]]||[[Ge (Cyrillic)|Г]]||style="background-color:#ccffcc;"| ||colspan="3"|[[³]]||[[ģ]]||[[ณ]]||[[³]]||[[ġ]]||[[³]]||[[ł]] |- !1011 0100!!264!!180!!B4 |colspan="4"|[[´]]||[[De (Cyrillic)|Д]]||style="background-color:#ccffcc;"| ||[[΄]]||colspan="2"|[[´]]||[[ī]]||[[ด]]||[[“]]||[[Ṁ]]||colspan="2"|[[Ž]] |- !1011 0101!!265!!181!!B5 |{{not a typo|[[µ]]}}||[[ľ]]||{{not a typo|[[µ]]}}||[[ĩ]]||[[Ye (Cyrillic)|Е]]||style="background-color:#ccffcc;"| ||[[΅]]||colspan="2"|{{not a typo|[[µ]]}}||[[ĩ]]||[[ต]]||{{not a typo|[[µ]]}}||[[ṁ]]||{{not a typo|[[µ]]}}||[[”]] |- !1011 0110!!266!!182!!B6 |[[¶]]||[[ś]]||[[ĥ]]||[[ļ]]||[[Zhe (Cyrillic)|Ж]]||style="background-color:#ccffcc;"| ||[[Ά]]||colspan="2"|[[¶]]||[[ķ]]||[[ถ]]||colspan="4"|[[¶]] |- !1011 0111!!267!!183!!B7 |[[·]]||[[ˇ]]||[[·]]||[[ˇ]]||[[Ze (Cyrillic)|З]]||style="background-color:#ccffcc;"| ||colspan="4"|[[·]]||[[ท]]||[[·]]||[[Ṗ]]||colspan="2"|[[·]] |- !1011 1000!!270!!184!!B8 |colspan="4"|[[¸]]||[[I (Cyrillic)|И]]||style="background-color:#ccffcc;"| ||[[Έ]]||colspan="2"|[[¸]]||[[ļ]]||[[ธ]]||[[ø]]||[[ẁ]]||colspan="2"|[[ž]] |- !1011 1001!!271!!185!!B9 |[[¹]]||[[š]]||[[ı]]||[[š]]||[[Short I|Й]]||style="background-color:#ccffcc;"| ||[[Ή]]||colspan="2"|[[¹]]||[[đ]]||[[น]]||[[¹]]||[[ṗ]]||[[¹]]||[[č]] |- !1011 1010!!272!!186!!BA |[[º]]||colspan="2"|[[ş]]||[[ē]]||[[Ka (Cyrillic)|К]]||style="background-color:#ccffcc;"| ||[[Ί]]||[[÷]]||[[º]]||[[š]]||[[บ]]||[[ŗ]]||[[ẃ]]||[[º]]||[[ș]] |- !1011 1011!!273!!187!!BB |[[»]]||[[ť]]||[[ğ]]||[[ģ]]||[[El (Cyrillic)|Л]]||[[؛]]||colspan="3"|[[»]]||[[ŧ]]||[[ป]]||[[»]]||[[Ṡ]]||colspan="2"|[[»]] |- !1011 1100!!274!!188!!BC |[[Fraction|¼]]||[[ź]]||[[ĵ]]||[[ŧ]]||[[Em (Cyrillic)|М]]||style="background-color:#ccffcc;"| ||[[Ό]]||colspan="2"|¼||[[ž]]||[[ผ]]||¼||[[ỳ]]||colspan="2"|[[Œ]] |- !1011 1101!!275!!189!!BD |[[½]]||[[˝]]||[[½]]||[[Ŋ]]||[[En (Cyrillic)|Н]]||style="background-color:#ccffcc;"| ||colspan="3"|½||[[―]]||[[ฝ]]||½||[[Ẅ]]||colspan="2"|[[œ]] |- !1011 1110!!276!!190!!BE |[[fraction|¾]]||[[ž]]||style="background-color:#ccffcc;"| ||[[ž]]||[[O (Cyrillic)|О]]||style="background-color:#ccffcc;"| ||[[Ύ]]||colspan="2"|¾||[[ū]]||[[พ]]||¾||[[ẅ]]||colspan="2"|[[Ÿ]] |- !1011 1111!!277!!191!!BF |[[¿]]||colspan="2"|[[ż]]||[[ŋ]]||[[Pe (Cyrillic)|П]]||[[؟]]||[[Ώ]]||style="background-color:#ccffcc;"| ||[[¿]]||[[ŋ]]||[[ฟ]]||[[æ]]||[[ṡ]]||[[¿]]||[[ż]] |- !1100 0000!!300!!192!!C0 |[[À]]||[[Ŕ]]||[[À]]||[[Ā]]||[[Er (Cyrillic)|Р]]||style="background-color:#ccffcc;"| ||[[Ϊ́|ΐ]]|| style="background-color:#ccffcc;" | ||[[À]]||[[Ā]]||[[ภ]]||[[Ą]]||colspan="3"|[[À]] |- !1100 0001!!301!!193!!C1 |colspan="4"|[[Á]]||[[Es (Cyrillic)|С]]||[[ء]]||[[Α]]||style="background-color:#ccffcc;"| ||colspan="2"|[[Á]]||[[ม]]||[[Į]]||colspan="3"|[[Á]] |- !1100 0010!!302!!194!!C2 |colspan="4"|[[Â]]||[[Te (Cyrillic)|Т]]||[[آ]]||[[Beta (letter)|Β]]||style="background-color:#ccffcc;"| ||colspan="2"|[[Â]]||[[ย]]||[[Ā]]||colspan="3"|[[Â]] |- !1100 0011!!303!!195!!C3 |[[Ã]]||[[Ă]]||style="background-color:#ccffcc;"| ||[[Ã]]||[[U (Cyrillic)|У]]||[[أ]]||[[Γ]]||style="background-color:#ccffcc;"| ||colspan="2"|[[Ã]]||[[ร]]||[[Ć]]||colspan="2"|[[Ã]]||[[Ă]] |- !1100 0100!!304!!196!!C4 |colspan="4"|[[Ä]]||[[Ef (Cyrillic)|Ф]]||[[ؤ]]||[[Δ]]||style="background-color:#ccffcc;"| ||colspan="2"|[[Ä]]||[[ฤ]]||colspan="4"|[[Ä]] |- !1100 0101!!305!!197!!C5 |[[Å]]||[[Ĺ]]||[[Ċ]]||[[Å]]||[[Kha (Cyrillic)|Х]]||[[إ]]||[[Ε]]||style="background-color:#ccffcc;"| ||colspan="2"|[[Å]]||[[ล]]||colspan="3"|[[Å]]||[[Ć]] |- !1100 0110!!306!!198!!C6 |[[Æ]]||[[Ć]]||[[Ĉ]]||[[Æ]]||[[Tse (Cyrillic)|Ц]]||[[ئ]]||[[Ζ]]||style="background-color:#ccffcc;"| ||colspan="2"|[[Æ]]||[[ฦ]]||[[Ę]]||colspan="3"|[[Æ]] |- !1100 0111!!307!!199!!C7 |colspan="3"|[[Ç]]||[[Į]]||[[Che (Cyrillic)|Ч]]||[[ا]]||[[Η]]||style="background-color:#ccffcc;"| ||[[Ç]]||[[Į]]||[[ว]]||[[Ē]]||colspan="3"|[[Ç]] |- !1100 1000!!310!!200!!C8 |[[È]]||[[Č]]||[[È]]||[[Č]]||[[Sha (Cyrillic)|Ш]]||[[ب]]||[[Θ]]||style="background-color:#ccffcc;"| ||[[È]]||[[Č]]||[[ศ]]||[[Č]]||colspan="3"|[[È]] |- !1100 1001!!311!!201!!C9 |colspan="4"|[[É]]||[[Shcha|Щ]]||[[ة]]||[[Ι]]||style="background-color:#ccffcc;"| ||colspan="2"|[[É]]||[[ษ]]||colspan="4"|[[É]] |- !1100 1010!!312!!202!!CA |[[Ê]]||[[Ę]]||[[Ê]]||[[Ę]]||[[Yer|Ъ]]||[[ت]]||[[Κ]]||style="background-color:#ccffcc;"| ||[[Ê]]||[[Ę]]||[[ส]]||[[Ź]]||colspan="3"|[[Ê]] |- !1100 1011!!313!!203!!CB |colspan="4"|[[Ë]]||[[Yery|Ы]]||[[ث]]||[[Λ]]||style="background-color:#ccffcc;"| ||colspan="2"|[[Ë]]||[[ห]]||[[Ė]]||colspan="3"|[[Ë]] |- !1100 1100!!314!!204!!CC |[[Ì]]||[[Ě]]||[[Ì]]||[[Ė]]||[[Soft sign|Ь]]||[[ج]]||[[Μ]]||style="background-color:#ccffcc;"| ||[[Ì]]||[[Ė]]||[[ฬ]]||[[Ģ]]||colspan="3"|[[Ì]] |- !1100 1101!!315!!205!!CD |colspan="4"|[[Í]]||[[E (Cyrillic)|Э]]||[[ح]]||[[Ν]]||style="background-color:#ccffcc;"| ||colspan="2"|[[Í]]||[[อ]]||[[Ķ]]||colspan="3"|[[Í]] |- !1100 1110!!316!!206!!CE |colspan="4"|Î||[[Yu (Cyrillic)|Ю]]||[[خ]]||[[Ξ]]||style="background-color:#ccffcc;"| ||colspan="2"|Î||[[ฮ]]||[[Ī]]||colspan="3"|Î |- !1100 1111!!317!!207!!CF |Ï||[[Ď]]||Ï||[[Ī]]||[[Ya (Cyrillic)|Я]]||[[د]]||[[Ο]]||style="background-color:#ccffcc;"| ||colspan="2"|Ï||[[ฯ]]||[[Ļ]]||colspan="3"|Ï |- ![[Binary numeral system|Binary]]!![[Octal|Oct]]!![[Decimal|Dec]]!![[Hexadecimal|Hex]] !1!!2!!3!!4!!5!!6!!7!!8!!9!!10!!11!!13!!14!!15!!16 |- !1101 0000!!320!!208!!D0 |Ð||[[Đ]]||style="background-color:#ccffcc;"| ||[[Đ]]||[[A (Cyrillic)|а]]||[[ذ]]||[[Pi (letter)|Π]]||style="background-color:#ccffcc;"| ||[[Ğ]]||Ð||[[ะ]]||[[Š]]||[[Ŵ]]||colspan="2"|Ð |- !1101 0001!!321!!209!!D1 |Ñ||[[Ń]]||Ñ||[[Ņ]]||[[Be (Cyrillic)|б]]||[[ر]]||[[Ρ]]||style="background-color:#ccffcc;"| ||Ñ||[[Ņ]]||[[ั]]||[[Ń]]||colspan="2"|Ñ||[[Ń]] |- !1101 0010!!322!!210!!D2 |Ò||[[Ň]]||Ò||[[Ō]]||[[Ve (Cyrillic)|в]]||[[ز]]||style="background-color:#ccffcc;"| ||style="background-color:#ccffcc;"| ||Ò||[[Ō]]||[[า]]||[[Ņ]]||colspan="3"|Ò |- !1101 0011!!323!!211!!D3 |colspan="3"|Ó||[[Ķ]]||[[Ge (Cyrillic)|г]]||[[س]]||[[Sigma|Σ]]||style="background-color:#ccffcc;"| ||colspan="2"|Ó||[[ำ]]||colspan="4"|Ó |- !1101 0100!!324!!212!!D4 |colspan="4"|Ô||[[De (Cyrillic)|д]]||[[ش]]||[[Τ]]||style="background-color:#ccffcc;"| ||colspan="2"|Ô||[[ิ]]||[[Ō]]||colspan="3"|Ô |- !1101 0101!!325!!213!!D5 |Õ||[[Ő]]||[[Ġ]]||Õ||[[Ye (Cyrillic)|е]]||[[ص]]||[[Υ]]||style="background-color:#ccffcc;"| ||colspan="2"|Õ||[[ี]]|| colspan="3" |Õ |[[Ő]] |- !1101 0110!!326!!214!!D6 |colspan="4"|Ö||[[Zhe (Cyrillic)|ж]]||[[ض]]||[[Φ]]||style="background-color:#ccffcc;"| ||colspan="2"|Ö||[[ึ]]||colspan="4"|Ö |- !1101 0111!!327!!215!!D7 |colspan="4"|×||[[Ze (Cyrillic)|з]]||[[ط]]||[[Χ]]||style="background-color:#ccffcc;"| ||×||[[Ũ]]||[[ื]]||×||[[Ṫ]]||×||[[Ś]] |- !1101 1000!!330!!216!!D8 |Ø||[[Ř]]||[[Ĝ]]||Ø||[[I (Cyrillic)|и]]||[[ظ]]||[[Ψ]]||style="background-color:#ccffcc;"| ||colspan="2"|Ø||[[ุ]]||[[Ų]]||colspan="2"|Ø||[[Ű]] |- !1101 1001!!331!!217!!D9 |Ù||[[Ů]]||Ù||[[Ų]]||[[Short I|й]]||[[ع]]||[[Ω]]||style="background-color:#ccffcc;"| ||Ù||[[Ų]]||[[ู]]||[[Ł]]||colspan="3"|Ù |- !1101 1010!!332!!218!!DA |colspan="4"|Ú||[[Ka (Cyrillic)|к]]||[[غ]]||[[Ϊ]]||style="background-color:#ccffcc;"| ||colspan="2"|Ú||[[ฺ]]||[[Ś]]||colspan="3"|Ú |- !1101 1011!!333!!219!!DB |Û||[[Ű]]||colspan="2"|Û||[[El (Cyrillic)|л]]||style="background-color:#ccffcc;"| ||[[Ϋ]]||style="background-color:#ccffcc;"| ||colspan="2"|Û||style="background-color:#ccffcc;"| ||[[Ū]]||colspan="3"|Û |- !1101 1100!!334!!220!!DC |colspan="4"|Ü||[[Em (Cyrillic)|м]]||style="background-color:#ccffcc;"| ||[[ά]]||style="background-color:#ccffcc;"| ||colspan="2"|Ü||style="background-color:#ccffcc;"| ||colspan="4"|Ü |- !1101 1101!!335!!221!!DD |colspan="2"|Ý||[[Ŭ]]||[[Ũ]]||[[En (Cyrillic)|н]]||style="background-color:#ccffcc;"| ||[[έ]]||style="background-color:#ccffcc;"| ||[[İ]]||Ý||style="background-color:#ccffcc;"| ||[[Ż]]||colspan="2"|Ý||[[Ę]] |- !1101 1110!!336!!222!!DE |[[Þ]]||[[Ţ]]||[[Ŝ]]||[[Ū]]||[[O (Cyrillic)|о]]||style="background-color:#ccffcc;"| ||[[ή]]||style="background-color:#ccffcc;"| ||[[Ş]]||[[Þ]]||style="background-color:#ccffcc;"| ||[[Ž]]||[[Ŷ]]||[[Þ]]||[[Ț]] |- !1101 1111!!337!!223!!DF |colspan="4"|[[ß]]||[[Pe (Cyrillic)|п]]||style="background-color:#ccffcc;"| ||[[ί]]||[[‗]]||colspan="2"|[[ß]]||[[฿]]||colspan="4"|[[ß]] |- !1110 0000!!340!!224!!E0 |à||[[ŕ]]||à||[[ā]]||[[Er (Cyrillic)|р]]||[[ـ]]||[[Ϋ́|ΰ]]||[[א]]||à||[[ā]]||[[เ]]||[[ą]]||colspan="3"|à |- !1110 0001!!341!!225!!E1 |colspan="4"|á||[[Es (Cyrillic)|с]]||[[Pe (Semitic letter)|ف]]||[[α]]||[[ב]]||colspan="2"|á||[[แ]]||[[į]]||colspan="3"|á |- !1110 0010!!342!!226!!E2 |colspan="4"|â||[[Te (Cyrillic)|т]]||[[ق]]||[[Beta (letter)|β]]||[[ג]]||colspan="2"|â||[[โ]]||[[ā]]||colspan="3"|â |- !1110 0011!!343!!227!!E3 |ã||[[ă]]||style="background-color:#ccffcc;"| ||ã||[[U (Cyrillic)|у]]||[[ك]]||[[γ]]||[[ד]]||colspan="2"|ã||[[ใ]]||[[ć]]||colspan="2"|ã||[[ă]] |- !1110 0100!!344!!228!!E4 |colspan="4"|ä||[[Ef (Cyrillic)|ф]]||[[ل]]||[[δ]]||[[ה]]||colspan="2"|ä||[[ไ]]||colspan="4"|ä |- !1110 0101!!345!!229!!E5 |å||[[ĺ]]||[[ċ]]||å||[[Kha (Cyrillic)|х]]||[[م]]||[[ε]]||[[ו]]||colspan="2"|å||[[ๅ]]||colspan="3"|å||[[ć]] |- !1110 0110!!346!!230!!E6 |æ||[[ć]]||[[ĉ]]||æ||[[Tse (Cyrillic)|ц]]||[[ن]]||[[ζ]]||[[ז]]||colspan="2"|æ||[[ๆ]]||[[ę]]||colspan="3"|æ |- !1110 0111!!347!!231!!E7 |colspan="3"|[[ç]]||[[į]]||[[Che (Cyrillic)|ч]]||[[ه]]||[[η]]||[[ח]]||[[ç]]||[[į]]||[[็]]||[[ē]]||colspan="3"|[[ç]] |- !1110 1000!!350!!232!!E8 |[[è]]||[[č]]||[[è]]||[[č]]||[[Sha (Cyrillic)|ш]]||[[و]]||[[θ]]||[[ט]]||[[è]]||[[č]]||[[่]]||[[č]]||colspan="3"|[[è]] |- !1110 1001!!351!!233!!E9 |colspan="4"|[[é]]||[[Shcha|щ]]||[[ى]]||[[ι]]||[[י]]||colspan="2"|[[é]]||[[้]]||colspan="4"|[[é]] |- !1110 1010!!352!!234!!EA |[[ê]]||[[ę]]||[[ê]]||[[ę]]||[[Yer|ъ]]||[[ي]]||[[κ]]||[[ך]]||[[ê]]||[[ę]]||[[๊]]||[[ź]]||colspan="3"|[[ê]] |- !1110 1011!!353!!235!!EB |colspan="4"|[[ë]]||[[Yery|ы]]||[[ً]]||[[λ]]||[[כ]]||colspan="2"|[[ë]]||[[๋]]||[[ė]]||colspan="3"|[[ë]] |- !1110 1100!!354!!236!!EC |[[ì]]||[[ě]]||[[ì]]||[[ė]]||[[Soft sign|ь]]||[[ٌ]]||[[μ]]||[[ל]]||[[ì]]||[[ė]]||[[์]]||[[ģ]]||colspan="3"|[[ì]] |- !1110 1101!!355!!237!!ED |colspan="4"|[[í]]||[[E (Cyrillic)|э]]||[[ٍ]]||[[ν]]||[[ם]]||colspan="2"|[[í]]||[[ํ]]||[[ķ]]||colspan="3"|[[í]] |- !1110 1110!!356!!238!!EE |colspan="4"|î||[[Yu (Cyrillic)|ю]]||[[َ]]||[[ξ]]||[[מ]]||colspan="2"|î||[[๎]]||[[ī]]||colspan="3"|î |- !1110 1111!!357!!239!!EF |ï||[[ď]]||ï||[[ī]]||[[Ya (Cyrillic)|я]]||[[ُ]]||[[ο]]||[[ן]]||colspan="2"|ï||[[๏]]||[[ļ]]||colspan="3"|ï |- !1111 0000!!360!!240!!F0 |ð||[[đ]]||style="background-color:#ccffcc;"| ||[[đ]]||[[Numero sign|{{as written|№}}]]||[[ِ]]||[[Pi (letter)|π]]||[[נ]]||[[ğ]]||ð||๐||[[š]]||[[ŵ]]||ð||[[đ]] |- !1111 0001!!361!!241!!F1 |ñ||[[ń]]||ñ||[[ņ]]||[[Yo (Cyrillic)|ё]]||[[ّ]]||[[ρ]]||[[ס]]||ñ||[[ņ]]||๑||[[ń]]||colspan="2"|ñ||[[ń]] |- !1111 0010!!362!!242!!F2 |ò||[[ň]]||ò||[[ō]]||[[Dje|ђ]]||[[ْ]]||[[Sigma|ς]]||[[ע]]||ò||[[ō]]||๒||[[ņ]]||colspan="3"|ò |- !1111 0011!!363!!243!!F3 |colspan="3"|ó||[[ķ]]||[[Gje|ѓ]]||style="background-color:#ccffcc;"| ||[[Sigma|σ]]||[[ף]]||colspan="2"|ó||๓||colspan="4"|ó |- !1111 0100!!364!!244!!F4 |colspan="4"|ô||[[Ukrainian Ye|є]]||style="background-color:#ccffcc;"| ||[[τ]]||[[פ]]||colspan="2"|ô||๔||[[ō]]||colspan="3"|ô |- !1111 0101!!365!!245!!F5 |õ||[[ő]]||[[ġ]]||õ||[[Dze|ѕ]]||style="background-color:#ccffcc;"| ||[[υ]]||[[ץ]]||colspan="2"|õ||๕|| colspan="3" |õ |[[ő]] |- !1111 0110!!366!!246!!F6 |colspan="4"|ö||[[Dotted I (Cyrillic)|і]]||style="background-color:#ccffcc;"| ||[[φ]]||[[צ]]||colspan="2"|ö||๖||colspan="4"|ö |- !1111 0111!!367!!247!!F7 |colspan="4"|÷||[[Yi (Cyrillic)|ї]]||style="background-color:#ccffcc;"| ||[[χ]]||[[ק]]||÷||[[ũ]]||๗||÷||[[ṫ]]||÷||[[ś]] |- !1111 1000!!370!!248!!F8 |ø||[[ř]]||[[ĝ]]||ø||[[Je (Cyrillic)|ј]]||style="background-color:#ccffcc;"| ||[[ψ]]||[[ר]]||colspan="2"|ø||๘||[[ų]]||colspan="2"|ø||[[ű]] |- !1111 1001!!371!!249!!F9 |ù||[[ů]]||ù||[[ų]]||[[Lje|љ]]||style="background-color:#ccffcc;"| ||[[ω]]||[[ש]]||ù||[[ų]]||๙||[[ł]]||colspan="3"|ù |- !1111 1010!!372!!250!!FA |colspan="4"|ú||[[Nje|њ]]||style="background-color:#ccffcc;"| ||[[ϊ]]||[[ת]]||colspan="2"|ú||[[๚]]||[[ś]]||colspan="3"|ú |- !1111 1011!!373!!251!!FB |û||[[ű]]||colspan="2"|û||[[Tshe|ћ]]||style="background-color:#ccffcc;"| ||[[ϋ]]||style="background-color:#ccffcc;"| ||colspan="2"|û||[[๛]]||[[ū]]||colspan="3"|û |- !1111 1100!!374!!252!!FC |colspan="4"|ü||[[Kje|ќ]]||style="background-color:#ccffcc;"| ||[[ό]]||style="background-color:#ccffcc;"| ||colspan="2"|ü||style="background-color:#ccffcc;"| ||colspan="4"|ü |- !1111 1101!!375!!253!!FD |colspan="2"|ý||[[ŭ]]||[[ũ]]||§||style="background-color:#ccffcc;"| ||[[ύ]]||style="background-color:#ffff99; font-family: monospace;"|<small>[[left-to-right mark|LRM]]</small>||[[ı]]||ý||style="background-color:#ccffcc;"| ||[[ż]]||colspan="2"|ý||[[ę]] |- !1111 1110!!376!!254!!FE |[[þ]]||[[ţ]]||[[ŝ]]||[[ū]]||[[Short U (Cyrillic)|ў]]||style="background-color:#ccffcc;"| ||[[ώ]]||style="background-color:#ffff99; font-family: monospace;"|<small>[[right-to-left mark|RLM]]</small>||[[ş]]||[[þ]]||style="background-color:#ccffcc;"| ||[[ž]]||[[ŷ]]||[[þ]]||[[ț]] |- !1111 1111!!377!!255!!FF |ÿ||colspan="3"|[[˙]]||[[Dzhe|џ]]||style="background-color:#ccffcc;"| ||style="background-color:#ccffcc;"| ||style="background-color:#ccffcc;"| ||ÿ||[[ĸ]]||style="background-color:#ccffcc;"| ||[[Apostrophe|’]]||colspan="3"|ÿ |- ![[Binary numeral system|Binary]]!![[Octal|Oct]]!![[Decimal|Dec]]!![[Hexadecimal|Hex]] !1!!2!!3!!4!!5!!6!!7!!8!!9!!10!!11!!13!!14!!15!!16 |} {{color box|#ccffcc}}{{nbsp}}unassigned code points.<br/> {{color box|#ffff99}}{{nbsp}}new additions in [[ISO/IEC 8859-7:2003]] and [[ISO/IEC 8859-8:1999]] versions, previously unassigned. ==Relationship to Unicode and the UCS== Since 1991, the Unicode Consortium has been working with ISO and IEC to develop the [[Unicode|Unicode Standard]] and [[Universal Character Set|ISO/IEC 10646: the Universal Character Set]] (UCS) in tandem. Newer editions of ISO/IEC 8859 express characters in terms of their Unicode/UCS names and the ''U+nnnn'' notation, effectively causing each part of ISO/IEC 8859 to be a Unicode/UCS character encoding scheme that maps a very small subset of the UCS to single 8-bit bytes. The first 256 characters in Unicode and the UCS are identical to those in ISO/IEC-8859-1 ([[Latin-1]]). Single-byte character sets including the parts of ISO/IEC 8859 and derivatives of them were favoured throughout the 1990s, having the advantages of being well-established and more easily implemented in software: the equation of one byte to one character is simple and adequate for most single-language applications, and there are no combining characters or variant forms. As Unicode-enabled operating systems became more widespread, ISO/IEC 8859 and other legacy encodings became less popular. While remnants of ISO 8859 and single-byte character models remain entrenched in many operating systems, programming languages, data storage systems, networking applications, display hardware, and end-user application software, most modern computing applications use Unicode internally, and rely on conversion tables to map to and from other encodings, when necessary. ==Current status== The ISO/IEC 8859 standard was maintained by ISO/IEC Joint Technical Committee 1, Subcommittee 2, Working Group 3 (ISO/IEC JTC 1/SC 2/WG 3). In June 2004, WG 3 disbanded, and maintenance duties were transferred to [[ISO/IEC JTC 1/SC 2|SC 2]]. The standard is not currently being updated, as the Subcommittee's only remaining [[working group]], WG 2, is concentrating on development of Unicode's [[Universal Coded Character Set]]. The [[WHATWG]] Encoding Standard, which specifies the character encodings permitted in [[HTML5]] which compliant browsers must support,<ref>{{Cite web |url=https://www.w3.org/TR/html51/syntax.html#character-encodings |title=8.2.2.3. Character encodings |work=HTML 5.1 2nd Edition |publisher=[[W3C]] |quotation=User agents must support the encodings defined in the WHATWG Encoding standard, including, but not limited to [...]}}</ref> includes most parts of ISO/IEC 8859,<ref name="lsbe">{{cite web |url=https://encoding.spec.whatwg.org/#legacy-single-byte-encodings |title=Legacy single-byte encodings |work=Encoding Standard |last=van Kesteren |first=Anne |author-link=Anne van Kesteren |publisher=[[WHATWG]]}}</ref> except for parts 1, 9 and 11, which are instead interpreted as [[Windows-1252]], [[Windows-1254]] and [[Windows-874]] respectively.<ref name="nal" /> Authors of new pages and the designers of new protocols are instructed to use [[UTF-8]] instead.<ref name="nal">{{cite web |url=https://encoding.spec.whatwg.org/#names-and-labels |title=Names and labels |work=Encoding Standard |last=van Kesteren |first=Anne |author-link=Anne van Kesteren |publisher=[[WHATWG]]}}</ref> ==See also== * [[List of information system character sets]] * [[Number Forms]] * [[RPL character set]] (an ISO/IEC 8859-1 superset on HP calculators, referred to as "ECMA-94" as well) * [[DEC Multinational Character Set]] (MCS) * [[DEC National Replacement Character Set]] (NRCS) ==Notes== {{Reflist|group="nb"|refs= <ref group="nb" name="NB_Missing_vowels">Missing several accented vowels including ''[[Ø|Ǿ]]'' and ''ǿ''. These can be replaced with non-accented vowels at the cost of increased ambiguity.</ref> <ref group="nb" name="NB_IJ">The ISO 8859 encodings treat [[IJ (digraph)|IJ]] as a digraph. Some other encodings treat it as a letter.</ref> <ref group="nb" name="NB_Ghe">8859-5 misses the [[Ghe with upturn|Ґ/ґ]] letter, which was reintroduced into the [[Ukrainian alphabet]] in 1990.</ref> <ref group="nb" name="two">Missing characters are in ISO/IEC 8859-15.</ref> }} ==References== {{Reflist|refs= <ref name="Haralambous_2007">{{cite book |title=Fonts & Encodings |url=https://archive.org/details/fontsencodings00hara_390 |url-access=limited |author-first=Yannis |author-last=Haralambous |translator-first=P. Scott |translator-last=Horne |location=Sebastopol, California, USA |publisher=[[O'Reilly Media, Inc.]] |date=September 2007 |edition=1st |isbn=978-0-596-10242-5 |pages=[https://archive.org/details/fontsencodings00hara_390/page/n57 37]–38 |quote=According to an urban legend, the French delegate was out sick the day when the standard came up for a vote and had to have his Belgian counterpart act as his proxy. In fact, the French delegate was an engineer, who was convinced that this ligature was useless, and the Swiss and German representatives pressed hard to have the mathematical symbols ''×'' and ''÷'' included at the positions where ''Œ'' and ''œ'' would logically appear.}}</ref> <ref name="Andre_2003">{{cite journal |author-first1=Jacques |author-last1=André |title=Histoire d'Œ, histoire d'@ des rumeurs typographiques et de leurs enseignements |language=fr |date=2003-10-15 |orig-year=2003-10-02 |editor-first1=Bernard |editor-last1=André |editor-first2=Georges-Louis |editor-last2=Baron |editor-first3=Éric |editor-last3=Bruillard |journal=Traitement de Texte et Production de Documents INRP/GEDIAPS |pages=19–34 |url=http://jacques-andre.fr/japublis/dida/andrej.htm |access-date=2016-12-09 |url-status=live |archive-url=https://web.archive.org/web/20161208195405/http://jacques-andre.fr/japublis/dida/andrej.htm |archive-date=2016-12-08}}</ref> <ref name="Andre_1996">{{cite journal |author-first=Jacques |author-last=André |title=ISO Latin-1, norme de codage des caractères européens? trois caractères français en sont absents! |journal=Cahiers GUTenberg |number=25 |date=November 1996 |pages=65–77 |language=fr |url=http://www.gutenberg.eu.org/pub/GUTenberg/publicationsPDF/25-andre.pdf |url-status=dead |archive-url=https://web.archive.org/web/20081130153329/http://www.gutenberg.eu.org/pub/GUTenberg/publicationsPDF/25-andre.pdf |archive-date=2008-11-30}}</ref> }} ==Further reading== * Published versions of each part of ISO/IEC 8859 are available, for a fee, from the [https://web.archive.org/web/20040704224429/http://www.iso.ch/iso/en/stdsdevelopment/tc/tclist/TechnicalCommitteeStandardsListPage.TechnicalCommitteeStandardsList?COMMID=23 ISO catalogue site] and from the [http://webstore.iec.ch/webstore/webstore.nsf/searchview/?searchView=&SearchOrder=4&SearchWV=TRUE&SearchMax=1000&Submit=OK&Query=ISO/IEC%208859 IEC Webstore]. * PDF versions of the final drafts of some parts of ISO/IEC 8859 as submitted to the ISO/IEC JTC 1/SC 2/WG 3 for review & publication are available at the [http://www.open-std.org/JTC1/SC2/WG3/ WG 3 web site]: ** [http://www.open-std.org/JTC1/SC2/WG3/docs/n411.pdf ISO/IEC 8859-1:1998] - 8-bit single-byte coded graphic character sets, Part 1: Latin alphabet No. 1 ''(draft dated February 12, 1998, published April 15, 1998)'' ** [http://www.open-std.org/JTC1/SC2/WG3/docs/n413.pdf ISO/IEC 8859-4:1998] - 8-bit single-byte coded graphic character sets, Part 4: Latin alphabet No. 4 ''(draft dated February 12, 1998, published July 1, 1998)'' ** [http://www.open-std.org/jtc1/sc2/open/02n3329.pdf ISO/IEC 8859-7:1999] - 8-bit single-byte coded graphic character sets, Part 7: Latin/Greek alphabet ''(draft dated June 10, 1999; superseded by ISO/IEC 8859-7:2003, published October 10, 2003)'' ** [http://www.open-std.org/JTC1/SC2/WG3/docs/n415.pdf ISO/IEC 8859-10:1998] - 8-bit single-byte coded graphic character sets, Part 10: Latin alphabet No. 6 ''(draft dated February 12, 1998, published July 15, 1998)'' ** [http://www.open-std.org/jtc1/sc2/open/02n3333.pdf ISO/IEC 8859-11:1999] - 8-bit single-byte coded graphic character sets, Part 11: Latin/Thai character set ''(draft dated June 22, 1999; superseded by ISO/IEC 8859-11:2001, published 15 December 2001)'' ** [http://www.open-std.org/JTC1/SC2/WG3/docs/n451.pdf ISO/IEC 8859-13:1998] - 8-bit single-byte coded graphic character sets, Part 13: Latin alphabet No. 7 ''(draft dated April 15, 1998, published October 15, 1998)'' ** [http://www.open-std.org/JTC1/SC2/WG3/docs/n404.pdf ISO/IEC 8859-15:1998] - 8-bit single-byte coded graphic character sets, Part 15: Latin alphabet No. 9 ''(draft dated August 1, 1997; superseded by ISO/IEC 8859-15:1999, published March 15, 1999)'' ** [http://www.open-std.org/jtc1/sc2/open/02n3389.pdf ISO/IEC 8859-16:2000] - 8-bit single-byte coded graphic character sets, Part 16: Latin alphabet No. 10 ''(draft dated November 15, 1999; superseded by ISO/IEC 8859-16:2001, published July 15, 2001)'' * [[European Computer Manufacturers Association|ECMA]] standards, which in intent correspond exactly to the ISO/IEC 8859 character set standards, can be found at: ** [https://ecma-international.org/publications-and-standards/standards/ecma-94 Standard ECMA-94]: 8-Bit Single Byte Coded Graphic Character Sets - Latin Alphabets No. 1 to No. 4 ''2nd edition (June 1986)'' ** [https://ecma-international.org/publications-and-standards/standards/ecma-113 Standard ECMA-113]: 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Cyrillic Alphabet ''3rd edition (December 1999)'' ** [https://ecma-international.org/publications-and-standards/standards/ecma-114 Standard ECMA-114]: 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Arabic Alphabet ''2nd edition (December 2000)'' ** [https://ecma-international.org/publications-and-standards/standards/ecma-118 Standard ECMA-118]: 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Greek Alphabet ''(December 1986)'' ** [https://ecma-international.org/publications-and-standards/standards/ecma-121 Standard ECMA-121]: 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Hebrew Alphabet ''2nd edition (December 2000)'' ** [https://ecma-international.org/publications-and-standards/standards/ecma-128 Standard ECMA-128]: 8-Bit Single-Byte Coded Graphic Character Sets - Latin Alphabet No. 5 ''2nd edition (December 1999)'' ** [https://ecma-international.org/publications-and-standards/standards/ecma-144 Standard ECMA-144]: 8-Bit Single-Byte Coded Character Sets - Latin Alphabet No. 6 ''3rd edition (December 2000)'' * ISO/IEC 8859-1 to Unicode [https://unicode.org/Public/MAPPINGS/ISO8859 mapping tables] as plain text files are at the Unicode FTP site. * Informal descriptions and code charts for most ISO/IEC 8859 standards are available in [http://czyborra.com/charsets/iso8859.html ISO/IEC 8859 Alphabet Soup] [https://web.archive.org/web/20031218123509/http://www2.lysator.liu.se/~jmo/czyborra_index.html (Mirror)] {{List of International Electrotechnical Commission standards}} {{Character encoding}} {{Ecma International Standards}} {{ISO standards}} {{Unicode navigation}} {{DEFAULTSORT:ISO IEC 8859}} [[Category:ISO/IEC 8859| ]] [[Category:Character sets]] [[Category:Ecma standards]] [[Category:ISO/IEC standards|#08859]]
Summary:
Please note that all contributions to Niidae Wiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Encyclopedia:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Templates used on this page:
Template:As written
(
edit
)
Template:Character encoding
(
edit
)
Template:Citation
(
edit
)
Template:Cite web
(
edit
)
Template:Color box
(
edit
)
Template:Ctrl
(
edit
)
Template:Ecma International Standards
(
edit
)
Template:ISO standards
(
edit
)
Template:Infobox character encoding
(
edit
)
Template:List of International Electrotechnical Commission standards
(
edit
)
Template:Nbsp
(
edit
)
Template:Not a typo
(
edit
)
Template:Plainlist
(
edit
)
Template:Reflist
(
edit
)
Template:Refn
(
edit
)
Template:Short description
(
edit
)
Template:Unicode navigation
(
edit
)
Template:Use Oxford spelling
(
edit
)
Search
Search
Editing
ISO/IEC 8859
Add topic