ISO 646 is an ISO standard that since 1972 has specified a 7-bit character code from which several national standards are derived. A bit is a binary digit, taking a value of either 0 or 1 Binary digits are a basic unit of Information storage and communication
Since the portion of ISO 646 shared by all countries specified only those letters used in the English alphabet, other countries using the Latin alphabet with extensions needed to create national variants of ISO 646 to be able to use their native languages. The modern English alphabet consists of 26 letters derived from the Latin alphabet: History See also History of the Since universal acceptance of the 8 bit byte did not exist at that time, the national characters had to be made to fit within the constraints of 7 bits, meaning that some characters that appear in ASCII do not appear in other national variants of ISO 646. A byte (pronounced "bite" baɪt is the basic unit of measurement of information storage in Computer science.
ISO/IEC 646 was also ratified by ECMA as ECMA-6. Ecma International (Ecma is an international private (membership-based non-profit Standards organization for information and communication systems
History
ISO/IEC 646 and its predecessor ASCII (ANSI X3. American Standard Code for Information Interchange ( ASCII) 4) largely endorses existing practice regarding character encodings in the telecommunications industry.
During the 1960s, there was debate about whether character encoding standards (at either the national or international levels) for computers should follow 1) existing practice in the telecommunications industry (which was largely paper-tape based, but which was commonly transmitted on-line digitally over wires), or conversely, 2) existing practice in the punched-card portion of the computer industry, whose heritage was especially the off-line storage of World War II-era electro-mechanical punched-card machines predating electronic computers. The 1960s decade refers to the years from the beginning of 1960 to the end of 1969 World War II, or the Second World War, (often abbreviated WWII) was a global military conflict which involved a majority of the world's nations, including A computer is a Machine that manipulates data according to a list of instructions. For obvious corporate-history reasons regarding Hollerith punched cards, IBM sided with the punched-card character encodings, embodied by EBCDIC, whereas many other computer manufacturers sided with the telecommunications industry's character encodings. Herman Hollerith ( February 29, 1860 &ndash November 17, 1929) was a German-American statistician who developed a International Business Machines Corporation abbreviated IBM and nicknamed "Big Blue", is a multinational Computer Technology Extended Binary Coded Decimal Interchange Code ( EBCDIC) is an 8- Bit Character encoding ( Code page) used on IBM mainframe Operating
The ISO 8859 series of standards governing 8-bit character encodings supersede the ISO 646 international standard and its national variants. ISO/IEC 8859 is a joint ISO and IEC standard for 8-bit Character encodings for use by computers The ISO 10646 standard, directly related to Unicode, supersedes all of the ISO 646 and ISO 8859 sets of national-variant character encodings with arguably one unified set of character encodings. The Universal Character Set (UCS defined by the ISO / IEC 10646 International Standard, is a standard set of characters upon which In Computing, Unicode is an Industry standard allowing Computers to consistently represent and manipulate text expressed in most of the world's
Codepage layout
The following table shows the ISO/IEC 646 character set. Each character is shown with its decimal code and its Unicode equivalent. In Computing, Unicode is an Industry standard allowing Computers to consistently represent and manipulate text expressed in most of the world's Grey shaded cells indicate code points with character glyphs that vary from region to region. A glyph is an element of writing Two or more glyphs representing the same symbol whether interchangeable or context-dependent are called Allographs the abstract unit they These are discussed in detail below.
| ISO/IEC 646 |
|
—0 |
—1 |
—2 |
—3 |
—4 |
—5 |
—6 |
—7 |
—8 |
—9 |
—A |
—B |
—C |
—D |
—E |
—F |
0−
|
NUL
0000
0 |
SOH
0001
1 |
STX
0002
2 |
ETX
0003
3 |
EOT
0004
4 |
ENQ
0005
5 |
ACK
0006
6 |
BEL
0007
7 |
BS
0008
8 |
HT
0009
9 |
LF
000A
10 |
VT
000B
11 |
FF
000C
12 |
CR
000D
13 |
SO
000E
14 |
SI
000F
15 |
1−
|
DLE
0010
16 |
DC1
0011
17 |
DC2
0012
18 |
DC3
0013
19 |
DC4
0014
20 |
NAK
0015
21 |
SYN
0016
22 |
ETB
0017
23 |
CAN
0018
24 |
EM
0019
25 |
SUB
001A
26 |
ESC
001B
27 |
FS
001C
28 |
GS
001D
29 |
RS
001E
30 |
US
001F
31 |
2−
|
SP
0020
32 |
!
0021
33 |
"
0022
34 |
0023
35 |
0024
36 |
%
0025
37 |
&
0026
38 |
'
0027
39 |
(
0028
40 |
)
0029
41 |
*
002A
42 |
+
002B
43 |
,
002C
44 |
-
002D
45 |
.
002E
46 |
/
002F
47 |
3−
|
0
0030
48 |
1
0031
49 |
2
0032
50 |
3
0033
51 |
4
0034
52 |
5
0035
53 |
6
0036
54 |
7
0037
55 |
8
0038
56 |
9
0039
57 |
:
003A
58 |
;
003B
59 |
<
003C
60 |
=
003D
61 |
>
003E
62 |
?
003F
63 |
4−
|
0040
64 |
A
0041
65 |
B
0042
66 |
C
0043
67 |
D
0044
68 |
E
0045
69 |
F
0046
70 |
G
0047
71 |
H
0048
72 |
I
0049
73 |
J
004A
74 |
K
004B
75 |
L
004C
76 |
M
004D
77 |
N
004E
78 |
O
004F
79 |
5−
|
P
0050
80 |
Q
0051
81 |
R
0052
82 |
S
0053
83 |
T
0054
84 |
U
0055
85 |
V
0056
86 |
W
0057
87 |
X
0058
88 |
Y
0059
89 |
Z
005A
90 |
005B
91 |
005C
92 |
005D
93 |
005E
94 |
_
005F
95 |
6−
|
0060
96 |
a
0061
97 |
b
0062
98 |
c
0063
99 |
d
0064
100 |
e
0065
101 |
f
0066
102 |
g
0067
103 |
h
0068
104 |
i
0069
105 |
j
006A
106 |
k
006B
107 |
l
006C
108 |
m
006D
109 |
n
006E
110 |
o
006F
111 |
7−
|
p
0070
112 |
q
0071
113 |
r
0072
114 |
s
0073
115 |
t
0074
116 |
u
0075
117 |
v
0076
118 |
w
0077
119 |
x
0078
120 |
y
0079
121 |
z
007A
122 |
007B
123 |
007C
124 |
007D
125 |
007E
126 |
DEL
007F
127 |
National variants
Some national variants of ISO 646 are:
| Code |
ISO-
IR |
Standard |
Used in |
| CA-1 |
121 |
CSA Z243. The null character (also null terminator) is a character with the value zero present in the ASCII and Unicode character sets and available Control character article i need to think about merging these Control character article i need to think about merging these The End Of Text character (ETX is an ASCII Control character used to inform the receiving computer that the end of the data stream has been reached In Telecommunication, an end-of-transmission character (EOT is a transmission Control character used to indicate the conclusion of a transmission that Control character article i need to think about merging these For Teleprinters Acknowledge character (ACK is a transmission control character transmitted by the receiving station as an affirmative response to the sending station Bell character is an ASCII Control character, code 7 (^G When it is sent to a printer or a terminal, nothing is printed but an Backspace is the keyboard key that originally pushed the Typewriter carriage one position backwards and in modern computer displays moves the cursor one position backwards Tab key (abbreviation of tabulator key) on a keyboard is used to advance the cursor to the next Tab stop. In Computing, a newline (also known as a line break or end-of-line / EOL character is a special character or sequence of characters Tab key (abbreviation of tabulator key) on a keyboard is used to advance the cursor to the next Tab stop. A page break is a marker in an electronic Document, which tells the document interpreter that the contents which follows is part of a new page Originally carriage return was the term for the control character in Baudot code on a teletypewriter for end of line Return to beginning of line and Shift Out (SO and Shift In (SI are ASCII Control characters 14 and 15 respectively (0xE and 0xF Shift Out (SO and Shift In (SI are ASCII Control characters 14 and 15 respectively (0xE and 0xF Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these In Telecommunications a negative-acknowledge character (NAK is a transmission Control character sent by a station as a negative Response Control character article i need to think about merging these In the C0 control code set used in ASCII, ETB is a short name for the "End Transmission Block" control character (code 23 or 0x17 Control character article i need to think about merging these Substitute character (␚ A control character that is used in the place of a character that is recognized to be invalid or in error or that cannot be represented on a given device In Computing and Telecommunication, an escape character is a single character which in a sequence of characters signifies that what is to follow takes an alternative Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these In writing a space () is a blank area that is devoid of content which separates words letters numbers and punctuation Quotation marks or inverted commas (informally referred to as quotes and speech marks) are Punctuation marks used in pairs to set off speech The percent sign ( %) is the symbol used to indicate a Percentage (that the preceding number is divided by one hundred An ampersand ( &) also commonly called an " 'and' sign," is a Logogram representing the conjunction "and" Brackets are Punctuation marks used in pairs to set apart or interject text within other text Brackets are Punctuation marks used in pairs to set apart or interject text within other text An asterisk ( *) (Latin asteriscum "little star" from Greek ἀστερίσκος) is a Typographical symbol or Glyph The plus and minus signs ( + and &minus) are Mathematical symbols used to represent the notions of positive and negative as well as the operations A comma ( ,   is a Punctuation mark It has the same shape as an Apostrophe or single closing Quotation mark in many typefaces but it differs The plus and minus signs ( + and &minus) are Mathematical symbols used to represent the notions of positive and negative as well as the operations A full stop or period (sometimes stop, full point, decimal point, or dot) is the Punctuation mark commonly placed at the The slash ( /) is a punctuation mark It is also called a virgule, diagonal, stroke, forward slash, oblique dash, The underscore _ (also called understrike, underbar, low line, or low dash is a character that originally appeared on the Typewriter. The delete key, known less ambiguously as forward delete, Del, or ⌦, performs a function when struck on a Computer 4-1985 |
Canada (nr. Country to "Dominion of Canada" or "Canadian Federation" or anything else please read the Talk Page 1 alternative, with “î”)
(French, classical) |
| CA-2 |
122 |
CSA Z243. French ( français,) is a Romance language spoken around the world by 118 million people as a native language and by about 180 to 260 million people 4-1985 |
Canada (nr. Country to "Dominion of Canada" or "Canadian Federation" or anything else please read the Talk Page 2 alternative, with “É”)
(French, reformed orthography) |
| CN |
057 |
GB/T 1988-80 |
People's Republic of China (Basic Latin) |
| CU |
151 |
NC 99-10:81 |
Cuba (Spanish) |
| DE |
021 |
DIN 66003 |
Germany (German) |
| DK |
— |
DS 2089 |
Denmark (Danish) |
| FR |
069 |
AFNOR NF Z 62010-1982 |
France (French) |
| FR-0 |
025 |
AFNOR NF Z 62010-1973 |
France (obsolete since April 1985) |
| GB |
004 |
BSI 4730 |
United Kingdom (English) |
| GR |
088 |
HOS ELOT |
Greece (obsolete) |
| HU |
086 |
MSZ 7795/3 |
Hungary(Hungarian) |
| IE |
207 |
NSAI 433:1996 |
Ireland (Irish Goidelic) |
|
|
| Code |
ISO-
IR |
Standard |
Used in |
| INV |
—
ISO 646:1983
|
international (Invariant subset) |
| IRV |
002
ISO 646:1983
|
International Reference Variant |
| JA |
014 |
JIS C 6220-1969 |
Japan (Romaji) |
| JA-O |
092 |
JIS C 6229-1984 |
Japan (OCR-B) |
| KR |
— |
? |
South Korea |
| MT |
— |
? |
Malta (Maltese, English) |
| NO |
060 |
NS 4551 version 1 |
Norway |
| NO-2 |
061 |
NS 4551 version 2 |
Norway (obsolete since June 1987) |
| SE |
010 |
SEN 85 02 00 Annex B |
Sweden (basic Swedish) |
| SE-C |
011 |
SEN 85 02 00 Annex C |
Sweden (extended Swedish for names) |
| T. French ( français,) is a Romance language spoken around the world by 118 million people as a native language and by about 180 to 260 million people Talk People's Republic of China) PEOPLE'S REPUBLIC OF CHINA ARTICLE GUIDELINES The Republic of Cuba (ˈkjuːbə or) consists of the island of Cuba (the largest and second-most populous island of the Greater Antilles) Isla de la Deutsches Institut für Normung eV ( DIN; in English, the German Institute for Standardization) is the German national organization for Germany, officially the Federal Republic of Germany ( ˈbʊndəsʁepuˌbliːk ˈdɔʏtʃlant is a Country in Central Europe. The German language (de ''Deutsch'') is a West Germanic language and one of the world's major languages. The Kingdom of Denmark ( ˈd̥ænmɑɡ̊ (archaic ˈd̥anmɑːɡ̊ commonly known as Denmark, is a country in the Scandinavian region of northern Europe Danish ( d̥ænsɡ̊ is one of the North Germanic languages (also called Scandinavian languages a sub-group of the Germanic branch of the Association française de Normalisation ( AFNOR) is the French national organization for Standardization and is that country's ISO member body This article is about the country For a topic outline on this subject see List of basic France topics. French ( français,) is a Romance language spoken around the world by 118 million people as a native language and by about 180 to 260 million people Association française de Normalisation ( AFNOR) is the French national organization for Standardization and is that country's ISO member body This article is about the country For a topic outline on this subject see List of basic France topics. British Standards are produced by BSI British Standards, a division of BSI Group that is incorporated under a Royal Charter and is formally designated The United Kingdom of Great Britain and Northern Ireland, commonly known as the United Kingdom, the UK or Britain,is a Sovereign state located English is a West Germanic language originating in England and is the First language for most people in the United Kingdom, the United States Greece (Ελλάδα transliterated: Elláda, historically, Ellás,) officially the Hellenic Republic (Ελληνική Δημοκρατία Hungary (Magyarország 'mɔɟɔrorsaːg) officially in English the Republic of Hungary ( Magyar Köztársaság, literally Magyar (Hungarian Republic Hungarian ( magyar nyelv) is a Uralic language (more specifically a Ugric language) unrelated to most other languages in Europe. Ireland (pronounced /ˈaɾlənd/ Éire) is the third largest island in Europe, and the twentieth-largest island in the world Irish (ga ''Gaeilge'' is a Goidelic language of the Indo-European language family originating in Ireland and historically spoken by the Irish. For a topic outline on this subject see List of basic Japan topics. The romanization of Japanese or ( is the use of the Latin alphabet to write the Japanese language. For a topic outline on this subject see List of basic Japan topics. South Korea, officially the Republic of Korea and often referred to as Korea ( Korean: 대한민국 tɛː Malta, officially the Republic of Malta (Repubblika ta' Malta is a European Microstate, comprising an Archipelago of three islands Maltese (Maltese Malti is the National language of Malta, and a co-official language of the country alongside English, English is a West Germanic language originating in England and is the First language for most people in the United Kingdom, the United States Norway ( Norwegian: Norge ( Bokmål) or Noreg ( Nynorsk) officially the Kingdom of Norway, is a Constitutional Norway ( Norwegian: Norge ( Bokmål) or Noreg ( Nynorsk) officially the Kingdom of Norway, is a Constitutional "Sverige" redirects here For other uses see Sweden (disambiguation and Sverige (disambiguation. Swedish ( is a North Germanic language spoken by more than nine million people predominantly in Sweden and parts of Finland, especially along the "Sverige" redirects here For other uses see Sweden (disambiguation and Sverige (disambiguation. Swedish ( is a North Germanic language spoken by more than nine million people predominantly in Sweden and parts of Finland, especially along the 61 |
102 |
ITU/CCITT T. The ITU Telecommunication Standardization Sector ( ITU-T) coordinates standards for telecommunications on behalf of the International Telecommunication 61 Recommendation |
International (Teletex) |
| US |
006 |
ANSI X3. Teletex (not to be confused with Telex or Teletext) was an ITU-T specification for a text and document communications service that could be provided over 4-1968 |
United States (ASCII) |
| YU |
141 |
JUS I. The United States of America —commonly referred to as the American Standard Code for Information Interchange ( ASCII) B1. 002 (YUSCII) |
former Yugoslavia (Croatian, Slovenian, Serbian, Latin) |
|
Other proprietary standards approved later for international use by some standard committees:
The specifics of the changes for some of these variants are given in this table:
| Codes |
Characters for each ISO 646 compatible charset |
| binary |
decimal |
hexa |
INV |
US |
T. YUSCII was an informal name for JUS IB1002 national variant of ISO 646, 7- Bit Latinic character encoding standard, and used in Yugoslavia The Socialist Federal Republic of Yugoslavia ( Serbo-Croatian, Bosnian, Croatian, Serbian, Slovene, Macedonian: Croatian language ( hrvatski jezik) is a South Slavic language which is used primarily in Croatia, by Croats in Bosnia and Herzegovina in neighbouring Slovene or Slovenian ( slovenski jezik or slovenščina, not to be confused with Slovenčina) is a South Slavic language Serbian (sr-Cyrl српски језик sr-Latn ''srpski jezik'' is a South Slavic language, Latin ( lingua Latīna, laˈtiːna is an Italic language, historically spoken in Latium and Ancient Rome. International Business Machines Corporation abbreviated IBM and nicknamed "Big Blue", is a multinational Computer Technology Spain () or the Kingdom of Spain (Reino de España is a country located mostly in southwestern Europe on the Iberian Peninsula. Basque ( native name: euskara) is the Language spoken by the Basque people who inhabit the Pyrenees in North-Central Spain Catalan ˈkætəˌlæn ( català kətəˈla or) is a Romance language, the national and official language of Andorra, and a co-official Ing C Olivetti & Co SpA, known as Olivetti, is an Italian manufacturer of Computers printers and other business machines "Sverige" redirects here For other uses see Sweden (disambiguation and Sverige (disambiguation. The Kingdom of Denmark ( ˈd̥ænmɑɡ̊ (archaic ˈd̥anmɑːɡ̊ commonly known as Denmark, is a country in the Scandinavian region of northern Europe "Sverige" redirects here For other uses see Sweden (disambiguation and Sverige (disambiguation. Finland, officially the Republic of Finland ( is a Nordic country situated in the Fennoscandian region of northern Europe. Ing C Olivetti & Co SpA, known as Olivetti, is an Italian manufacturer of Computers printers and other business machines Italian ( or lingua italiana) is a Romance language spoken by about 63 million people as a First language, primarily in Italy. International Business Machines Corporation abbreviated IBM and nicknamed "Big Blue", is a multinational Computer Technology Portugal, officially the Portuguese Republic (República Portuguesa is a country on the Iberian Peninsula. Portuguese ( or língua portuguesa) is a Romance language that originated in what is now Galicia (Spain and northern Portugal. Ing C Olivetti & Co SpA, known as Olivetti, is an Italian manufacturer of Computers printers and other business machines Portuguese ( or língua portuguesa) is a Romance language that originated in what is now Galicia (Spain and northern Portugal. The binary numeral system, or base-2 number system, is a Numeral system that represents numeric values using two symbols usually 0 and 1. The decimal ( base ten or occasionally denary) Numeral system has ten as its base. In Mathematics and Computer science, hexadecimal (also base -, hexa, or hex) is a Numeral system with a 61 |
JA |
JA-O |
KR |
CN |
IRV |
GB |
DK |
NO |
NO-2 |
SE |
SE-C |
DE |
HU |
FR |
FR-0 |
CA-1 |
CA-2 |
IE |
IS |
ita |
por |
PT |
esp |
ES |
CU |
MT |
YU |
| 010 0010 |
34 |
22 |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
" |
| 010 0011 |
35 |
23 |
|
# |
# |
# |
# |
# |
# |
# |
£ |
# |
# |
§ |
# |
# |
# |
# |
£ |
£ |
# |
# |
£ |
# |
£ |
# |
£ |
# |
# |
# |
# |
# |
| 010 0100 |
36 |
24 |
|
$ |
¤ |
$ |
$ |
$ |
¥ |
$ |
$ |
$ |
$ |
$ |
¤ |
¤ |
$ |
¤ |
$ |
$ |
$ |
$ |
$ |
$ |
$ |
$ |
$ |
$ |
$ |
¤ |
$ |
$ |
| 010 1001 |
39 |
27 |
' |
' |
' |
' |
' |
' |
' |
’ |
’ |
’ |
’ |
’ |
’ |
’ |
’ |
’ |
’ |
’ |
’ |
’ |
’ |
' |
’ |
’ |
’ |
’ |
’ |
’ |
’ |
’ |
| 010 1100 |
44 |
2C |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
, |
| 010 1101 |
45 |
2D |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
- |
| 010 1111 |
47 |
2F |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
/ |
| 100 0000 |
64 |
40 |
|
@ |
@ |
@ |
@ |
@ |
@ |
@ |
@ |
@ |
@ |
@ |
@ |
É |
§ |
Á |
à |
à |
à |
à |
Ó |
Ð |
§ |
§ |
´ |
§ |
· |
@ |
@ |
Ž |
| 101 1011 |
91 |
5B |
|
[ |
[ |
[ |
[ |
[ |
[ |
[ |
[ |
Æ |
Æ |
Æ |
Ä |
Ä |
Ä |
É |
° |
° |
â |
â |
É |
Þ |
° |
à |
à |
¡ |
¡ |
¡ |
ġ |
Š |
| 101 1100 |
92 |
5C |
|
\ |
|
¥ |
¥ |
₩ |
\ |
\ |
\ |
Ø |
Ø |
Ø |
Ö |
Ö |
Ö |
Ö |
ç |
ç |
ç |
ç |
Í |
\ |
ç |
Ç |
Ç |
Ñ |
Ñ |
Ñ |
ż |
Đ |
| 101 1101 |
93 |
5D |
|
] |
] |
] |
] |
] |
] |
] |
] |
Å |
Å |
Å |
Å |
Å |
Ü |
Ü |
§ |
§ |
ê |
ê |
Ú |
Æ |
é |
Õ |
Õ |
¿ |
Ç |
] |
ħ |
Ć |
| 101 1110 |
94 |
5E |
|
^ |
|
^ |
^ |
^ |
^ |
ˆ |
ˆ |
ˆ |
ˆ |
ˆ |
ˆ |
Ü |
ˆ |
ˆ |
^ |
ˆ |
î |
É |
Á |
Ö |
ˆ |
ˆ |
ˆ |
ˆ |
¿ |
¿ |
ˆ |
Č |
| 101 1111 |
95 |
5F |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
_ |
| 110 0000 |
96 |
60 |
|
` |
|
` |
|
` |
` |
` |
` |
` |
` |
` |
` |
é |
` |
á |
µ |
µ |
ô |
ô |
ó |
ð |
ù |
` |
` |
` |
` |
` |
ċ |
ž |
| 111 1011 |
123 |
7B |
|
{ |
|
{ |
{ |
{ |
{ |
{ |
{ |
æ |
æ |
æ |
ä |
ä |
ä |
é |
é |
é |
é |
é |
é |
þ |
à |
ã |
ã |
° |
´ |
´ |
Ġ |
š |
| 111 1100 |
124 |
7C |
|
| |
| |
| |
| |
| |
| |
| |
| |
ø |
ø |
ø |
ö |
ö |
ö |
ö |
ù |
ù |
ù |
ù |
í |
| |
ò |
ç |
ç |
ñ |
ñ |
ñ |
Ż |
đ |
| 111 1101 |
125 |
7D |
|
} |
|
} |
} |
} |
} |
} |
} |
å |
å |
å |
å |
å |
ü |
ü |
è |
è |
è |
è |
ú |
æ |
è |
õ |
õ |
ç |
ç |
[ |
Ħ |
ć |
| 111 1110 |
126 |
7E |
|
~ |
|
‾ |
|
‾ |
~ |
˜ |
˜ |
˜ |
¯ |
| |
˜ |
ü |
ß |
˝ |
¨ |
¨ |
û |
û |
á |
ö |
ì |
° |
˜ |
˜ |
¨ |
¨ |
Ċ |
č |
In the table above, the cells with non-white background emphasize the differences from the US variant used in the Basic Latin subset of ISO/IEC 10646 and Unicode.
The characters displayed in cells with red background could be used as combining diacritics, when preceded or followed with a backspace C0 control (this encoding method is deprecated or is not recommended as it was part of some withdrawn national standards). In Digital typography, combining characters are characters that are intended to modify other characters Control character article i need to think about merging these In Computer software standards and documentation the term deprecation is applied to Software features that are superseded and should be avoided Without such complex encoding, they are no different from the symbols used in the US variant (although glyph variants are still possible, especially on the quotation marks, and circumflex or tilde symbols).
Later, when 8 bit character sets gained more acceptance, ISO 8859-1, ISO 8859-2, and ISO 8859-3 became the preferred method of coding most of these variants. ISO 8859-1, more formally cited as ISO/IEC 8859-1 is part 1 of ISO/IEC 8859, a standard Character encoding of the Latin alphabet. ISO 8859-2, more formally cited as ISO/IEC 8859-2 or less formally as Latin-2, is part 2 of ISO/IEC 8859, a standard Character encoding defined by ISO 8859-3, also known as Latin-3 or "South European" is an 8-bit Character encoding, part of the ISO 8859 standard
Variants of ASCII that are not ISO 646
There are also some 7-bit character sets that are not officially part of the ISO 646 standard. Examples include:
- 7-bit Greek, ELOT 927. The Greek alphabet is mapped to positions 0x61–0x71 and 0x73–0x79, on top of the Latin lowercase letters. The Greek alphabet (Ελληνικό αλφάβητο is a set of twenty-four letters that has been used to write the Greek language since the late 9th or early This mapping with the high bit set is ISO 8859-7. ISO 8859-7, also known as Greek, is an 8-bit Character encoding, part of the ISO 8859 standard
- 7-bit Cyrillic, KOI-7 or Short KOI. KOI7 is a 7-bit Character encoding, designed to cover Russian, which uses the Cyrillic alphabet The Cyrillic characters are mapped to positions 0x60–0x7E, on top of the Latin lowercase letters. Superseded by the KOI-8 variants. KOI8-R is an 8-bit Character encoding, designed to cover Russian, which uses the Cyrillic alphabet
- 7-bit Hebrew, SI 960. The Hebrew alphabet is mapped to positions 0x60–0x7A, on top of the lowercase Latin letters (and grave accent for aleph). The Hebrew alphabet (אָלֶף-בֵּית עִבְרִי alephbet ’ivri) consists of 22 letters used for writing the Hebrew language. 7-bit Hebrew was always stored in visual order. This mapping with the high bit set, i. e. with the Hebrew letters in 0xE0–0xFA, is ISO 8859-8. ISO 8859-8, more formally cited as ISO/IEC 8859-8 (but not as Latin-8! is part 8 of ISO/IEC 8859, a standard Character encoding
- 7-bit Arabic, ASMO 449. The Arabic alphabet is mapped to positions 0x41–0x5A and 0x60–0x6A, on top of both uppercase and lowercase Latin letters. The Arabic alphabet is the script used for writing several languages of Asia and Africa such as Arabic, Persian, and Urdu. This mapping with the high bit set is ISO 8859-6. ISO 8859-6, also known as Arabic, is an 8-bit Character encoding, part of the ISO 8859 standard
See also
External links
- Zeichensatz nach ISO 646 (ASCII) (in German)
- History at GNU Aspell website
- Character Tables by Koichi Yasuoka (see Domestic ISO646 Character Tables and Quasi-ISO646 Character Tables)
- Turkish Text Deasciifier a tool (based on statistical pentagram analysis of the Turkish language) which reverts an ASCII'fied Turkish text by determining the appropriate (but ambiguous) diacritics normally needed in Turkish but missing in the US-ASCII set. American Standard Code for Information Interchange ( ASCII) The term extended ASCII (or high ASCII) describes Eight-bit or larger Character encodings that include the standard seven- Bit Extended Binary Coded Decimal Interchange Code ( EBCDIC) is an 8- Bit Character encoding ( Code page) used on IBM mainframe Operating ISO 2022, more formally ISO/IEC 2022 "Information Technology—Character code structure and extension techniques" is an ISO standard (equivalent to the ISO/IEC 8859 is a joint ISO and IEC standard for 8-bit Character encodings for use by computers The Universal Character Set (UCS defined by the ISO / IEC 10646 International Standard, is a standard set of characters upon which In Computing, Unicode is an Industry standard allowing Computers to consistently represent and manipulate text expressed in most of the world's In the C family of Programming languages a trigraph is a sequence of three characters, the first two of which are both Question marks that represents ISO/IEC 6937 is a multibyte extension of ASCII, or rather of ISO/IEC 646 -IRV developed in common with ITU-T (then CCITT) for telematic services GNU Aspell, usually called just Aspell, is a Free software Spell checker designed to replace Ispell.
© 2009 citizendia.org; parts available under the terms of GNU Free Documentation License, from http://en.wikipedia.org
network: | |