Citizendia
Your Ad Here

ISO 8859-1, more formally cited as ISO/IEC 8859-1 is part 1 of ISO/IEC 8859, a standard character encoding of the Latin alphabet. ISO/IEC 8859 is a joint ISO and IEC standard for 8-bit Character encodings for use by computers A character encoding consists of a code that pairs a sequence of characters from a given character set (sometimes incorrectly referred to as Code page It is less formally referred to as Latin-1. It was originally developed by the ISO, but later jointly maintained by the ISO and the IEC. The International Electrotechnical Commission ( IEC) is a not-for-profit, non-governmental international Standards organization that prepares and publishes The standard, when supplemented with additional character assignments (in the C0 and C1 ranges: 0x00 to 0x1F and 0x7F, and 0x80 to 0x9F), is the basis of two widely-used character maps known as ISO-8859-1 (note the extra hyphen) and Windows-1252. Windows-1252 (also known as WinLatin1) is a Character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows

In June 2004, the ISO/IEC working group responsible for maintaining eight-bit coded character sets disbanded and ceased all maintenance of ISO 8859, including ISO 8859-1, in order to concentrate on the Universal Character Set and Unicode. "MMIV" redirects here For the Modest Mouse album see " Baron von Bullshit Rides Again " A bit is a binary digit, taking a value of either 0 or 1 Binary digits are a basic unit of Information storage and communication The Universal Character Set (UCS defined by the ISO / IEC 10646 International Standard, is a standard set of characters upon which In Computing, Unicode is an Industry standard allowing Computers to consistently represent and manipulate text expressed in most of the world's In computing applications, encodings that provide full UCS support (such as UTF-8 and UTF-16) are finding increasing favor over encodings based on ISO 8859-1. UTF-8 (8- Bit UCS / Unicode Transformation Format) is a variable-length Character encoding for Unicode. In Computing, UTF-16 (16- Bit Unicode Transformation Format is a variable-length Character encoding for Unicode, capable of encoding

Contents

Coverage

ISO 8859-1 encodes what it refers to as "Latin alphabet no. 1," consisting of 191 characters from the Latin script. For other uses see Character. In Computer and machine-based Telecommunications terminology a character is a unit of A writing system is a type of Symbolic system used to represent elements or statements expressible in Language. This character encoding is used throughout The Americas, Western Europe, Oceania, and much of Africa. The Americas are the lands of the Western hemisphere or New World, consisting of the Continents of North America and South America Western Europe at its most general meaning means 'all the countries in the West of Europe ' It is also commonly used in most standard romanizations of East-Asian languages.

Each character is encoded as a single eight-bit code value. These code values can be used in almost any data interchange system to communicate in the following European languages (with a few exceptions due to missing characters, as noted):

Modern languages with complete coverage of their alphabet
Languages commonly supported with nearly complete coverage of their alphabet
  • Dutch (missing IJ, ij but these should always be represented as IJ or ij in electronic form)
  • Estonian (missing Š, š, Ž, ž for loan words)
    • Note that Windows-1252 and ISO-8859-15 do contain these
  • French (missing Œ, œ and the very rare Ÿ; they are generally replaced by 'OE' and 'oe' without the normally required ligature, and 'Y' without the diaeresis)
    • Note that Windows-1252 and ISO-8859-15 do contain these
  • Finnish (missing Š, š, Ž, ž for loan words)
    • Note that Windows-1252 and ISO-8859-15 do contain these
  • Welsh (missing Ŵ, ŵ, Ŷ, ŷ)
Coverage of punctuation signs and apostrophes

For some languages listed above the correct typographical quotation marks are missing, for only « », " ", and ' ' are included. Afrikaans is an Indo-European language, derived from 17th century Dutch and classified as Low Franconian Germanic, mainly spoken in Albanian (sq ''Gjuha shqipe'' ˈɟuha ˈʃcipɛ is an Indo-European language spoken by nearly 6 million peoplewhile others claim that it derives from Daco - The Breton language ( Brezhoneg) formerly often called Armoric or Armorican, is a Celtic language spoken by some of the inhabitants of Brittany Danish ( d̥ænsɡ̊ is one of the North Germanic languages (also called Scandinavian languages a sub-group of the Germanic branch of the English is a West Germanic language originating in England and is the First language for most people in the United Kingdom, the United States Faroese ( føroyskt ˈføːɹɪst or) often also spelled Faeroese (cf The German language (de ''Deutsch'') is a West Germanic language and one of the world's major languages. Icelandic ( is a North Germanic language, the language of Iceland. Irish (ga ''Gaeilge'' is a Goidelic language of the Indo-European language family originating in Ireland and historically spoken by the Irish. Italian ( or lingua italiana) is a Romance language spoken by about 63 million people as a First language, primarily in Italy. The Kurdish alphabet is a Writing system for the Kurdish language. Latin ( lingua Latīna, laˈtiːna is an Italic language, historically spoken in Latium and Ancient Rome. Luxembourgish (lb Lëtzebuergesch Luxembourgeois Luxemburgisch Luxemburgs Lussimbordjwès also called Luxembourgian, also spelled Luxemburgish, is one of Norwegian ( norsk) is a North Germanic Language spoken primarily in Norway, where it is an official language Occitan ( IPA BrE: /ˈɒksɪtn/ AmE: /ˈɑksəˌtɑn/ known also as Lenga d'òc or Langue d'oc (native name occitan Portuguese ( or língua portuguesa) is a Romance language that originated in what is now Galicia (Spain and northern Portugal. Romansh or Romansch may refer to Romansh language Romansh people Scottish Gaelic ( Gàidhlig) is a member of the Goidelic branch of Celtic languages. Swahili (called Kiswahili in the language itself is the First language of the Swahili people (Waswahili who inhabit several large stretches Swedish ( is a North Germanic language spoken by more than nine million people predominantly in Sweden and parts of Finland, especially along the Walloon ( Walon) is a Romance language spoken as a second language by some in Wallonia, Belgium. Basque ( native name: euskara) is the Language spoken by the Basque people who inhabit the Pyrenees in North-Central Spain Dutch ( is a West Germanic language spoken by around 24 million people 22 million of which are from the Netherlands, Belgium and Suriname Estonian (; ˈeːsti ˈkeːl is the official language of Estonia, spoken by about 1 The Grapheme Š, š (Latin S with Háček) is used in various contexts usually denoting the Voiceless postalveolar fricative, including The grapheme Ž ( minuscule: ž) is formed from Latin Z with the addition of Háček. ISO 8859-15 is part 15 of ISO 8859, a standard Character encoding defined by International Organization for Standardization. French ( français,) is a Romance language spoken around the world by 118 million people as a native language and by about 180 to 260 million people This article is about the typographic ligature for other uses see Oe Œ Diaeresis or trema See also Diaeresis History Historically the diaeresis mark or trema is far older than the umlaut mark ISO 8859-15 is part 15 of ISO 8859, a standard Character encoding defined by International Organization for Standardization. Finnish ( or suomen kieli) is the language spoken by the majority of the population in Finland (92% As of 2006) and by ethnic Finns outside The Grapheme Š, š (Latin S with Háček) is used in various contexts usually denoting the Voiceless postalveolar fricative, including The grapheme Ž ( minuscule: ž) is formed from Latin Z with the addition of Háček. ISO 8859-15 is part 15 of ISO 8859, a standard Character encoding defined by International Organization for Standardization. Welsh ( cy Cymraeg or cy y Gymraeg, kəmˈrɑːɨɡ and {{IPA|[ə ɡəmˈrɑːɨɡ]}}, is a member of the Brythonic branch of Celtic Pitch The circumflex accent was first used in the polytonic orthography of Ancient Greek, where it occurred (subject to certain rules on the accented syllable Pitch The circumflex accent was first used in the polytonic orthography of Ancient Greek, where it occurred (subject to certain rules on the accented syllable ISO 8859-14, also known as Latin-8 or "Celtic" is an 8-bit Character encoding, part of the ISO 8859 standard Quotation marks or inverted commas (informally referred to as quotes and speech marks) are Punctuation marks used in pairs to set off speech

Also, this encoding does not provide the correct character for the apostrophe, and oriented single high quotation marks, although some texts use the spacing grave accent and spacing acute accent which are both part of ISO 8859-1, instead of the 6-shaped/9-shaped quotations marks or apostrophes (and this works reliably with some font styles where all these characters are displayed as slanted wedge glyphs).

See also: Alphabets derived from the Latin

History

ISO 8859-1 was based on the Multinational Character Set used by Digital Equipment Corporation in the popular VT220 terminal. Variants of the Latin alphabet are used by the Writing systems of many languages throughout the world The Multinational Character Set is a Character encoding created by Digital Equipment Corporation for use in the popular VT220 terminal. Digital Equipment Corporation was a pioneering American company in the Computer industry The VT220 was a terminal produced by Digital Equipment Corporation from 1983 to 1987. It was developed within ECMA, the European Computer Manufacturers Association, and published in March 1985 as ECMA-94, by which name it is still sometimes known. Ecma International (Ecma is an international private (membership-based non-profit Standards organization for information and communication systems The second edition of ECMA-94 (June 1986) also included ISO 8859-2, ISO 8859-3, and ISO 8859-4 as part of the specification


Relationship to ISO/IEC 8859-15

Although ISO/IEC 8859-1 has enough characters for most French text, it is missing a few letters that are less common. ISO 8859-2, more formally cited as ISO/IEC 8859-2 or less formally as Latin-2, is part 2 of ISO/IEC 8859, a standard Character encoding defined by ISO 8859-3, also known as Latin-3 or "South European" is an 8-bit Character encoding, part of the ISO 8859 standard ISO 8859-4, also known as Latin-4 or "North European" is an 8-bit Character encoding, part of the ISO 8859 standard It is also missing a single-glyph representation for the letter IJ, two Finnish letters used for transcription of some foreign names and in a few loanwords (Š and Ž), typographic quotation marks and dashes, and common symbols such as the euro sign (€) and dagger (†). Quotation marks or inverted commas (informally referred to as quotes and speech marks) are Punctuation marks used in pairs to set off speech A dash is a Punctuation mark It is longer than a Hyphen and is used differently The euro sign (€ is the Currency sign used for the Euro, the official currency of the European Union (EU A dagger ( †, &dagger U+ 2020 is a typographical symbol or Glyph.

In order to provide some of these characters, ISO/IEC 8859-15 was developed as an update of ISO/IEC 8859-1. ISO 8859-15 is part 15 of ISO 8859, a standard Character encoding defined by International Organization for Standardization. This required, however, the removal of some infrequently-used characters from ISO/IEC 8859-1, including fraction symbols and letter-free diacritics: ¤, ¦, ¨, ´, ¸, ¼, ½, and ¾.

Codepage layout

Since all 191 characters encoded by ISO/IEC 8859-1 are 'graphic' (ISO's term for characters that are not control codes) and are compatible with most web browsers, they can be shown as glyphs in the following table. A glyph is an element of writing Two or more glyphs representing the same symbol whether interchangeable or context-dependent are called Allographs the abstract unit they Since the space, no-break space, and soft hyphen characters would not normally be visible, they are represented by abbreviations for their names. All other characters are represented literally. Row and column headings indicate the hexadecimal digit combinations to produce the eight-bit code value; e. In Mathematics and Computer science, hexadecimal (also base -, hexa, or hex) is a Numeral system with a g. , the letter L is at code value 4C.

ISO/IEC 8859-1 (Latin-1)
—0 —1 —2 —3 —4 —5 —6 —7 —8 —9 —A —B —C —D —E —F
 
0−
 
 
1−
 
 
2−
 
SP
0020
32
!
0021
33
"
0022
34
#
0023
35
$
0024
36
%
0025
37
&
0026
38
'
0027
39
(
0028
40
)
0029
41
*
002A
42
+
002B
43
,
002C
44
-
002D
45
.
002E
46
/
002F
47
 
3−
 
0
0030
48
1
0031
49
2
0032
50
3
0033
51
4
0034
52
5
0035
53
6
0036
54
7
0037
55
8
0038
56
9
0039
57
:
003A
58
;
003B
59
<
003C
60
=
003D
61
>
003E
62
?
003F
63
 
4−
 
@
0040
64
A
0041
65
B
0042
66
C
0043
67
D
0044
68
E
0045
69
F
0046
70
G
0047
71
H
0048
72
I
0049
73
J
004A
74
K
004B
75
L
004C
76
M
004D
77
N
004E
78
O
004F
79
 
5−
 
P
0050
80
Q
0051
81
R
0052
82
S
0053
83
T
0054
84
U
0055
85
V
0056
86
W
0057
87
X
0058
88
Y
0059
89
Z
005A
90
[
005B
91
\
005C
92
]
005D
93
^
005E
94
_
005F
95
 
6−
 
`
0060
96
a
0061
97
b
0062
98
c
0063
99
d
0064
100
e
0065
101
f
0066
102
g
0067
103
h
0068
104
i
0069
105
j
006A
106
k
006B
107
l
006C
108
m
006D
109
n
006E
110
o
006F
111
 
7−
 
p
0070
112
q
0071
113
r
0072
114
s
0073
115
t
0074
116
u
0075
117
v
0076
118
w
0077
119
x
0078
120
y
0079
121
z
007A
122
{
007B
123
|
007C
124
}
007D
125
~
007E
126
 
8−
 
 
9−
 
 
A−
 
NBSP
00A0
160
¡
00A1
161
¢
00A2
162
£
00A3
163
¤
00A4
164
¥
00A5
165
¦
00A6
166
§
00A7
167
¨
00A8
168
©
00A9
169
ª
00AA
170
«
00AB
171
¬
00AC
172
SHY
00AD
173
®
00AE
174
¯
00AF
175
 
B−
 
°
00B0
176
±
00B1
177
²
00B2
178
³
00B3
179
´
00B4
180
µ
00B5
181

00B6
182
·
00B7
183
¸
00B8
184
¹
00B9
185
º
00BA
186
»
00BB
187
¼
00BC
188
½
00BD
189
¾
00BE
190
¿
00BF
191
 
C−
 
À
00C0
192
Á
00C1
193
Â
00C2
194
Ã
00C3
195
Ä
00C4
196
Å
00C5
197
Æ
00C6
198
Ç
00C7
199
È
00C8
200
É
00C9
201
Ê
00CA
202
Ë
00CB
203
Ì
00CC
204
Í
00CD
205
Î
00CE
206
Ï
00CF
207
 
D−
 
Ð
00D0
208
Ñ
00D1
209
Ò
00D2
210
Ó
00D3
211
Ô
00D4
212
Õ
00D5
213
Ö
00D6
214
×
00D7
215
Ø
00D8
216
Ù
00D9
217
Ú
00DA
218
Û
00DB
219
Ü
00DC
220
Ý
00DD
221
Þ
00DE
222
ß
00DF
223
 
E−
 
à
00E0
224
á
00E1
225
â
00E2
226
ã
00E3
227
ä
00E4
228
å
00E5
229
æ
00E6
230
ç
00E7
231
è
00E8
232
é
00E9
233
ê
00EA
234
ë
00EB
235
ì
00EC
236
í
00ED
237
î
00EE
238
ï
00EF
239
 
F−
 
ð
00F0
240
ñ
00F1
241
ò
00F2
242
ó
00F3
243
ô
00F4
244
õ
00F5
245
ö
00F6
246
÷
00F7
247
ø
00F8
248
ù
00F9
249
ú
00FA
250
û
00FB
251
ü
00FC
252
ý
00FD
253
þ
00FE
254
ÿ
00FF
255
—0 —1 —2 —3 —4 —5 —6 —7 —8 —9 —A —B —C —D —E —F

Code values 00–1F, 7F–9F are not assigned to characters by ISO/IEC 8859-1. In writing a space () is a blank area that is devoid of content which separates words letters numbers and punctuation Quotation marks or inverted commas (informally referred to as quotes and speech marks) are Punctuation marks used in pairs to set off speech Number sign is a name for the symbol #; it is the preferred Unicode name for the Code point associated with that Glyph. The percent sign ( %) is the symbol used to indicate a Percentage (that the preceding number is divided by one hundred An ampersand ( &) also commonly called an " 'and' sign," is a Logogram representing the conjunction "and" Brackets are Punctuation marks used in pairs to set apart or interject text within other text Brackets are Punctuation marks used in pairs to set apart or interject text within other text An asterisk ( *) (Latin asteriscum "little star" from Greek ἀστερίσκος) is a Typographical symbol or Glyph The plus and minus signs ( + and &minus) are Mathematical symbols used to represent the notions of positive and negative as well as the operations A comma ( ,   is a Punctuation mark It has the same shape as an Apostrophe or single closing Quotation mark in many typefaces but it differs The plus and minus signs ( + and &minus) are Mathematical symbols used to represent the notions of positive and negative as well as the operations A full stop or period (sometimes stop, full point, decimal point, or dot) is the Punctuation mark commonly placed at the The slash ( /) is a punctuation mark It is also called a virgule, diagonal, stroke, forward slash, oblique dash, Mathematics For any number x: x ·1 = 1· x = x (1 is the multiplicative identity In mathematics Two has many properties in Mathematics. An Integer is called Even if it is divisible by 2 ---- In mathematics Three is the first odd Prime number, and the second smallest prime In mathematics Four is the smallest Composite number, its proper Divisors being and. This article discusses the number five. For the year 5 AD see 5. In mathematics Six is the second smallest Composite number, its proper Divisors being 1, 2 and 3. In mathematics Seven is the fourth Prime number. It is not only a Mersenne prime (since 23 &minus 1 = 7 but also a In mathematics 8 is a Composite number, its Proper divisors being 1, 2, and 4. In mathematics Nine is a Composite number, its proper Divisors being 1 and 3. A semicolon (   ) is a conventional Punctuation mark with several usages Brackets are Punctuation marks used in pairs to set apart or interject text within other text History The "=" symbol that is now universally accepted by mathematics for equality was first recorded by Welsh mathematician Robert Recorde in The Brackets are Punctuation marks used in pairs to set apart or interject text within other text The question mark (? also known as an interrogation point, question point, query, or eroteme, is a punctuation mark that replaces The typographic character @, the at sign, denotes a pan-lingual abbreviation of the word 'at' The letter A is the first letter in the Latin alphabet. Its name in English is a (eɪ plural B is the second letter in the Latin alphabet. Its name in English is spelled bee or occasionally be (biː plural bees. C is the third letter in the Latin alphabet. Its name in English is spelled cee or occasionally ce (siː D is the fourth letter in the Latin alphabet. Its name in English is spelled dee or occasionally de (diː E is the fifth letter in the Latin alphabet. Its name in English is spelled e (iː plural es or ees (also written E's E F is the sixth letter in the Latin alphabet. Its name in English is spelled ef or eff (ɛf G is the seventh letter in the Latin alphabet. Its name in English is spelled gee or occasionally ge (dʒiː I is the ninth letter of the Latin alphabet. Its English name is i (aɪ J is the tenth letter in the modern Latin alphabet; it was the last of the 26 letters to be added K is the eleventh letter of the modern Latin alphabet. Its name in English is spelled kay (keɪ L is the twelfth letter of the Latin alphabet. Its name in English is el or occasionally ell (ɛl M is the thirteenth letter of the modern Latin alphabet. Its name in English is spelled em (ɛm N is the fourteenth letter in the Latin alphabet. Its name in English is spelled en (ɛn O is the fifteenth letter of the modern Latin Alphabet. Its name in English is spelled o (oʊ plural usually o's or os; sometimes P is the sixteenth letter of the modern Latin alphabet. Its name in English is spelled pee or occasionally pe (piː Q is the seventeenth letter of the modern Latin alphabet. Its name in English is spelled cue (kjuː R is the eighteenth letter of the modern Latin alphabet. Its name in English is spelled ar (ɑr pronounced or) S is the nineteenth letter in the modern Latin alphabet. Its name in English is spelled ess or occasionally es (ɛs generally es- T is the twentieth letter in the modern Latin alphabet. Its name in English is spelled tee or occasionally te (tiː U is the twenty-first letter in the modern Latin alphabet. Its name in English is spelled u (juː V is the twenty-second letter in the modern Latin alphabet. Its name in English is spelled vee or occasionally ve (viː W is the twenty-third letter in the Latin alphabet. Its name in English is spelled double-u (ˈdʌbljuː X is the twenty-fourth letter in the modern Latin alphabet. Its name in English is spelled ex or occasionally ecks (ɛks plural exes The letter Y is the twenty-fifth letter in the modern Latin alphabet. Z is the twenty-sixth and last letter of the modern Latin alphabet. Brackets are Punctuation marks used in pairs to set apart or interject text within other text The backslash ( \) is a typographical mark ( Glyph) used chiefly in Computing. Brackets are Punctuation marks used in pairs to set apart or interject text within other text Pitch The circumflex accent was first used in the polytonic orthography of Ancient Greek, where it occurred (subject to certain rules on the accented syllable The underscore _ (also called understrike, underbar, low line, or low dash is a character that originally appeared on the Typewriter. Pitch The grave accent was first used in the polytonic orthography of Ancient Greek, where it occurred only on the last syllable of a word in cases where the The letter A is the first letter in the Latin alphabet. Its name in English is a (eɪ plural B is the second letter in the Latin alphabet. Its name in English is spelled bee or occasionally be (biː plural bees. C is the third letter in the Latin alphabet. Its name in English is spelled cee or occasionally ce (siː D is the fourth letter in the Latin alphabet. Its name in English is spelled dee or occasionally de (diː E is the fifth letter in the Latin alphabet. Its name in English is spelled e (iː plural es or ees (also written E's E F is the sixth letter in the Latin alphabet. Its name in English is spelled ef or eff (ɛf G is the seventh letter in the Latin alphabet. Its name in English is spelled gee or occasionally ge (dʒiː I is the ninth letter of the Latin alphabet. Its English name is i (aɪ J is the tenth letter in the modern Latin alphabet; it was the last of the 26 letters to be added K is the eleventh letter of the modern Latin alphabet. Its name in English is spelled kay (keɪ L is the twelfth letter of the Latin alphabet. Its name in English is el or occasionally ell (ɛl M is the thirteenth letter of the modern Latin alphabet. Its name in English is spelled em (ɛm N is the fourteenth letter in the Latin alphabet. Its name in English is spelled en (ɛn O is the fifteenth letter of the modern Latin Alphabet. Its name in English is spelled o (oʊ plural usually o's or os; sometimes P is the sixteenth letter of the modern Latin alphabet. Its name in English is spelled pee or occasionally pe (piː Q is the seventeenth letter of the modern Latin alphabet. Its name in English is spelled cue (kjuː R is the eighteenth letter of the modern Latin alphabet. Its name in English is spelled ar (ɑr pronounced or) S is the nineteenth letter in the modern Latin alphabet. Its name in English is spelled ess or occasionally es (ɛs generally es- T is the twentieth letter in the modern Latin alphabet. Its name in English is spelled tee or occasionally te (tiː U is the twenty-first letter in the modern Latin alphabet. Its name in English is spelled u (juː V is the twenty-second letter in the modern Latin alphabet. Its name in English is spelled vee or occasionally ve (viː W is the twenty-third letter in the Latin alphabet. Its name in English is spelled double-u (ˈdʌbljuː X is the twenty-fourth letter in the modern Latin alphabet. Its name in English is spelled ex or occasionally ecks (ɛks plural exes The letter Y is the twenty-fifth letter in the modern Latin alphabet. Z is the twenty-sixth and last letter of the modern Latin alphabet. Brackets are Punctuation marks used in pairs to set apart or interject text within other text In Unix-like computer Operating systems a pipeline is the original software pipeline: a set of processes chained by their Standard Brackets are Punctuation marks used in pairs to set apart or interject text within other text The tilde (~ (/ˈtɪldə/ is a Grapheme with several uses The name of the character comes from Spanish, from the Latin titulus In computer-based Text processing and Digital typesetting, a non-breaking space or no-break space ( NBSP) is In many national currencies, the cent is a monetary unit that equals 1/100 of the basic monetary unit See also Pound (currency.The pound sign (" £ " or " ₤ " is the symbol for the Pound sterling —the currency of the The currency sign ( ¤) is a character used to denote a currency when the symbol for a particular currency is unavailable Note "broken bar" and the glyph "¦" redirect here The section sign (§ Unicode U+00A7 HTML entity &sect is a typographical character used mainly to refer to a particular section In Linguistics, diaeresis, or dieresis, is the pronunciation of two adjacent Vowels in two separate Syllables rather than as a Diphthong copyright symbol, designated by © (a circled "C" is the Symbol used to provide notice of Copyright in works other than sound recordings (which º redirects here It is not to be confused with the Degree symbol ° Quotation marks, also called quotes, speech marks or inverted commas, are Punctuation marks used in pairs to set off speech a quotation In Logic and Mathematics, negation or not is an operation on Logical values for example the logical value of a Proposition A hyphen ( -) is a Punctuation mark It is used for both Words to join and to separate Syllables It is often confused with the dashes A trademark or trade mark, represented by the symbols ™ and ®, or mark is a distinctive sign or indicator used by an individual A macron, from Greek el μακρόv ( makrón) meaning "long" is a Diacritic ¯ placed over or under a Vowel which was originally The degree symbol (° Unicode: U+00B0 HTML: &deg is a typographical symbol or Glyph, that is used to represent degrees of arc (see ± The In Algebra, the square of a number is that number multiplied by itself In Arithmetic and Algebra, the cube of a number n is its third power &mdash the result of multiplying it by itself three times History An early precursor of the acute accent was the apex, used in Latin inscriptions to mark long vowels. micro- ( µ) is a prefix in the SI and other systems of units denoting a factor of 10&minus6 (one Millionth. The pilcrow (¶ Unicode U+00B6 HTML entity &para also called the Paragraph sign or the alinea ( An interpunct ( ·) is a small dot used for Interword separation in ancient Latin script, being perhaps the first consistent visual representation of word boundaries Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. º redirects here It is not to be confused with the Degree symbol ° Quotation marks, also called quotes, speech marks or inverted commas, are Punctuation marks used in pairs to set off speech a quotation The question mark (? also known as an interrogation point, question point, query, or eroteme, is a punctuation mark that replaces Pitch The grave accent was first used in the polytonic orthography of Ancient Greek, where it occurred only on the last syllable of a word in cases where the Á, á ( A - acute) is a letter of the Czech, Faroese, Hungarian, Icelandic, and Slovak languages Â, â ( A - Circumflex) is a letter of the Romanian and Vietnamese alphabets à / ã ( A - Tilde) is a letter used in some languages generally considered a variant of the letter A. " Ä " or " ä " is a character which represents either a letter from several extended Latin alphabets or the letter A with The letter Å represents various sounds in the Swedish, Finnish (although no native Finnish words contain the letter å Danish, Norwegian Æ ( minuscule: æ) is a Grapheme formed from the letters A and E. È can be The letter E with a Grave accent. 鄂 or È is an abbreviation for the Hubei province of the É, é ( E - acute) is a letter of Hungarian, Icelandic, Kashubian, Czech, Slovak, and Uyghur language Ê, ê ( E - Circumflex) is a letter of Kurdish and Vietnamese language. Ë, ë ( E - umlaut or diaeresis) is a letter of Albanian and Kashubian language. Letter Ü The letter Ü occurs in Hungarian, Karelian, Turkish, Estonian, Azeri, Turkmen, Crimean Í, í ( I - acute) is a letter of Faroese, Hungarian, Icelandic, Czech, Slovak, and Tatar language Î, î ( I - Circumflex) is a letter of Kurdish and Romanian language. Ï is a symbol used in various languages written with the Latin alphabet. Eth ( Ð, ð; also spelled edh or eð) is a letter used in Old English, Icelandic, Faroese (in Ñ (lower case ñ) is a letter of the modern Roman alphabet formed by an N with a diacritical Tilde. Ò, ò ( O - grave) is a letter of Kashubian language. This letter also appears in Catalan, Italian, Occitan, Ó, ó ( O - acute) is a letter in the Faroese, Hungarian, Icelandic, Kashubian, Polish, Czech Pitch The circumflex accent was first used in the polytonic orthography of Ancient Greek, where it occurred (subject to certain rules on the accented syllable Estonian In Estonian, Õ is the 27th letter of the alphabet (between W and Ä) representing the Close-mid back unrounded vowel O-Umlaut The glyph O with Umlaut appears in the German alphabet. The multiplication sign is the symbol × ( multiplication sign is the preferred Unicode name for the Codepoint represented by that Glyph The " Ø " ( minuscule: " ø " is a Vowel and a letter used in the Danish, Faroese and Norwegian Pitch The grave accent was first used in the polytonic orthography of Ancient Greek, where it occurred only on the last syllable of a word in cases where the Ê, ê ( E - Circumflex) is a letter of Kurdish and Vietnamese language. Û is used in the ISO 91995 system of Cyrillic transliteration as the letter Ю. Letter Ü The letter Ü occurs in Hungarian, Karelian, Turkish, Estonian, Azeri, Turkmen, Crimean History An early precursor of the acute accent was the apex, used in Latin inscriptions to mark long vowels. Thorn, or þorn (Þ þ is a letter in the Anglo-Saxon and Icelandic Alphabets It was also used in Medieval Scandinavia The letter ß ( Unicode U+00DF is a letter in the German alphabet. Pitch The grave accent was first used in the polytonic orthography of Ancient Greek, where it occurred only on the last syllable of a word in cases where the Á, á ( A - acute) is a letter of the Czech, Faroese, Hungarian, Icelandic, and Slovak languages Â, â ( A - Circumflex) is a letter of the Romanian and Vietnamese alphabets à / ã ( A - Tilde) is a letter used in some languages generally considered a variant of the letter A. " Ä " or " ä " is a character which represents either a letter from several extended Latin alphabets or the letter A with The letter Å represents various sounds in the Swedish, Finnish (although no native Finnish words contain the letter å Danish, Norwegian Æ ( minuscule: æ) is a Grapheme formed from the letters A and E. Ç, ç ( C - Cedilla) is a letter of Albanian, Turkish, Azerbaijani, Turkmen, Tatar, Kurdish language È can be The letter E with a Grave accent. 鄂 or È is an abbreviation for the Hubei province of the É, é ( E - acute) is a letter of Hungarian, Icelandic, Kashubian, Czech, Slovak, and Uyghur language Ê, ê ( E - Circumflex) is a letter of Kurdish and Vietnamese language. Ë, ë ( E - umlaut or diaeresis) is a letter of Albanian and Kashubian language. Letter Ü The letter Ü occurs in Hungarian, Karelian, Turkish, Estonian, Azeri, Turkmen, Crimean Í, í ( I - acute) is a letter of Faroese, Hungarian, Icelandic, Czech, Slovak, and Tatar language Î, î ( I - Circumflex) is a letter of Kurdish and Romanian language. Ï is a symbol used in various languages written with the Latin alphabet. Eth ( Ð, ð; also spelled edh or eð) is a letter used in Old English, Icelandic, Faroese (in Ñ (lower case ñ) is a letter of the modern Roman alphabet formed by an N with a diacritical Tilde. Ò, ò ( O - grave) is a letter of Kashubian language. This letter also appears in Catalan, Italian, Occitan, Ó, ó ( O - acute) is a letter in the Faroese, Hungarian, Icelandic, Kashubian, Polish, Czech Pitch The circumflex accent was first used in the polytonic orthography of Ancient Greek, where it occurred (subject to certain rules on the accented syllable Estonian In Estonian, Õ is the 27th letter of the alphabet (between W and Ä) representing the Close-mid back unrounded vowel O-Umlaut The glyph O with Umlaut appears in the German alphabet. The word " obelus " is also an alternative name for the dagger († symbol The " Ø " ( minuscule: " ø " is a Vowel and a letter used in the Danish, Faroese and Norwegian Pitch The grave accent was first used in the polytonic orthography of Ancient Greek, where it occurred only on the last syllable of a word in cases where the Ê, ê ( E - Circumflex) is a letter of Kurdish and Vietnamese language. Û is used in the ISO 91995 system of Cyrillic transliteration as the letter Ю. Letter Ü The letter Ü occurs in Hungarian, Karelian, Turkish, Estonian, Azeri, Turkmen, Crimean History An early precursor of the acute accent was the apex, used in Latin inscriptions to mark long vowels. Thorn, or þorn (Þ þ is a letter in the Anglo-Saxon and Icelandic Alphabets It was also used in Medieval Scandinavia Diaeresis or trema See also Diaeresis History Historically the diaeresis mark or trema is far older than the umlaut mark

The lower range 20 to 7E (the G0 subset) maps exactly to the same coded G0 subset of the ISO 646 US variant (commonly known as ASCII), whose ISO 2022 standard switch sequence is "ESC ( B". American Standard Code for Information Interchange ( ASCII) ISO 2022, more formally ISO/IEC 2022 "Information Technology—Character code structure and extension techniques" is an ISO standard (equivalent to the The higher range A0 to FF (the G1 subset) maps exactly to the same subset initiated by the ISO 2022 standard switch sequence "ESC . ISO 2022, more formally ISO/IEC 2022 "Information Technology—Character code structure and extension techniques" is an ISO standard (equivalent to the A".

Related character maps

The ISO/IEC 8859-1 standard has long been the basis of a number of character maps, also known as character sets, charsets, or code pages, the most popular being ISO-8859-1 (note the extra hyphen) and Windows-1252. Windows-1252 (also known as WinLatin1) is a Character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows Both of these maps are a superset of ISO/IEC 8859-1; they supplement the standard's 191 character assignments by mapping additional characters to at least some portion of the code value ranges 00–1F, 7F, and 80–9F.

ISO-8859-1

In 1992, the IANA registered the character map ISO_8859-1:1987, more commonly known by its preferred MIME name of ISO-8859-1 (note the extra hyphen over ISO 8859-1), a superset of ISO 8859-1, for use on the Internet. Year 1992 ( MCMXCII) was a Leap year starting on Wednesday (link will display full 1992 Gregorian calendar) The Internet Assigned Numbers Authority (IANA is the entity that oversees global IP address allocation, DNS root zone management, media types Multipurpose Internet Mail Extensions ( MIME) is an Internet standard that extends the format of e-mail to support text in Character The Internet is a global system of interconnected Computer networks This map assigns the C0 and C1 control characters to the code values 00–1F, 7F, and 80–9F. Control character article i need to think about merging these It thus provides for 256 characters via every possible 8-bit value.

ISO-8859-1 is (according to the standards at least) the default encoding of documents delivered via HTTP with a MIME type beginning with "text/". Hypertext Transfer Protocol ( HTTP) is a Communications protocol for the transfer of information on the Internet. An Internet media type, originally called a MIME type after MIME and sometimes a Content-type after the name of a header in several protocols whose value It is the default encoding of the values of certain descriptive HTTP headers, and is the standard encoding used by the X Window System on most Unix machines in locales which use that character set. Unix (officially trademarked as UNIX, sometimes also written as Unix with Small caps) is a computer It was also the basis of the repertoire of characters allowed in HTML 3. HTML, an initialism of HyperText Markup Language, is the predominant Markup language for Web pages It provides a means to describe the structure 2 documents (HTML 4. 0, however, is based on Unicode). In Computing, Unicode is an Industry standard allowing Computers to consistently represent and manipulate text expressed in most of the world's

Escape sequences (from ISO/IEC 6429 or ISO/IEC 2022) are not to be interpreted in documents labeled as ISO-8859-1 encoded. ISO 2022, more formally ISO/IEC 2022 "Information Technology—Character code structure and extension techniques" is an ISO standard (equivalent to the As well as the canonical name and preferred MIME name mentioned above, the following other aliases are registered for ISO-8859-1: ISO_8859-1, ISO-8859-1, iso-ir-100, csISOLatin1, latin1, l1, IBM819, CP819. ISO-8859-1 was also incorporated as the first 256 code points of ISO/IEC 10646 and Unicode. The Universal Character Set (UCS defined by the ISO / IEC 10646 International Standard, is a standard set of characters upon which In Computing, Unicode is an Industry standard allowing Computers to consistently represent and manipulate text expressed in most of the world's

Code point Control character Abbreviation
00 Null NUL
01 Start Of Heading SOH
02 Start of Text STX
03 End of Text ETX
04 End Of Transmission EOT
05 Enquiry ENQ
06 Acknowledge ACK
07 Bell BEL
08 Backspace BS
09 Horizontal Tab HT
0A Line Feed LF
0B Vertical Tab VT
0C Form Feed FF
0D Carriage Return CR
0E Shift Out SO
0F Shift In SI
10 Data Link Escape DLE
11 Device Control 1 DC1
12 Device Control 2 DC2
13 Device Control 3 DC3
14 Device Control 4 DC4
15 Negative Acknowledge NAK
16 Synchronous idle SYN
17 End of Transmission Block ETB
18 Cancel CAN
19 End of Medium EM
1A Substitute (character) SUB
1B Escape character ESC
1C File separator FS
1D Group separator GS
1E Record separator RS
1F Unit separator US
7F Delete DEL
 
Code point Control character Abbreviation
80 Padding Character PAD
81 High Octet Preset HOP
82 Break Permitted Here BPH
83 No Break Here NBH
84 Index IND
85 Next Line NEL
86 Start of Selected Area SSA
87 End of Selected Area ESA
88 Character Tabulation Set HTS
89 Character Tabulation with Justification HTJ
8A Line Tabulation Set VTS
8B Partial Line Forward PLD
8C Partial Line Backward PLU
8D Reverse Line Feed RI
8E Single Shift 2 SS2
8F Single Shift 3 SS3
90 Device Control String DCS
91 Private Use 1 PU1
92 Private Use 2 PU2
93 Set Transmit State STS
94 Cancel Character CCH
95 Message Waiting MW
96 Start of Guarded Area SPA
97 End of Guarded Area EPA
98 Start of String SOS
99 Single Graphic Character Introducer SGCI
9A Single Character Introducer SCI
9B Control Sequence Introducer CSI
9C String Terminator ST
9D Operating System Command OSC
9E Privacy Message PM
9F Application Program Command APC
ISO-8859-1
—0 —1 —2 —3 —4 —5 —6 —7 —8 —9 —A —B —C —D —E —F
 
0−
 
NUL
0000
0
SOH
0001
1
STX
0002
2
ETX
0003
3
EOT
0004
4
ENQ
0005
5
ACK
0006
6
BEL
0007
7
BS
0008
8
HT
0009
9
LF
000A
10
VT
000B
11
FF
000C
12
CR
000D
13
SO
000E
14
SI
000F
15
 
1−
 
DLE
0010
16
DC1
0011
17
DC2
0012
18
DC3
0013
19
DC4
0014
20
NAK
0015
21
SYN
0016
22
ETB
0017
23
CAN
0018
24
EM
0019
25
SUB
001A
26
ESC
001B
27
FS
001C
28
GS
001D
29
RS
001E
30
US
001F
31
 
2−
 
SP
0020
32
!
0021
33
"
0022
34
#
0023
35
$
0024
36
%
0025
37
&
0026
38
'
0027
39
(
0028
40
)
0029
41
*
002A
42
+
002B
43
,
002C
44
-
002D
45
.
002E
46
/
002F
47
 
3−
 
0
0030
48
1
0031
49
2
0032
50
3
0033
51
4
0034
52
5
0035
53
6
0036
54
7
0037
55
8
0038
56
9
0039
57
:
003A
58
;
003B
59
<
003C
60
=
003D
61
>
003E
62
?
003F
63
 
4−
 
@
0040
64
A
0041
65
B
0042
66
C
0043
67
D
0044
68
E
0045
69
F
0046
70
G
0047
71
H
0048
72
I
0049
73
J
004A
74
K
004B
75
L
004C
76
M
004D
77
N
004E
78
O
004F
79
 
5−
 
P
0050
80
Q
0051
81
R
0052
82
S
0053
83
T
0054
84
U
0055
85
V
0056
86
W
0057
87
X
0058
88
Y
0059
89
Z
005A
90
[
005B
91
\
005C
92
]
005D
93
^
005E
94
_
005F
95
 
6−
 
`
0060
96
a
0061
97
b
0062
98
c
0063
99
d
0064
100
e
0065
101
f
0066
102
g
0067
103
h
0068
104
i
0069
105
j
006A
106
k
006B
107
l
006C
108
m
006D
109
n
006E
110
o
006F
111
 
7−
 
p
0070
112
q
0071
113
r
0072
114
s
0073
115
t
0074
116
u
0075
117
v
0076
118
w
0077
119
x
0078
120
y
0079
121
z
007A
122
{
007B
123
|
007C
124
}
007D
125
~
007E
126
DEL
007F
127
 
8−
 
PAD
0080
128
HOP
0081
129
BPH
0082
130
NBH
0083
131
IND
0084
132
NEL
0085
133
SSA
0086
134
ESA
0087
135
HTS
0088
136
HTJ
0089
137
VTS
008A
138
PLD
008B
139
PLU
008C
140
RI
008D
141
SS2
008E
142
SS3
008F
143
 
9−
 
DCS
0090
144
PU1
0091
145
PU2
0092
146
STS
0093
147
CCH
0094
148
MW
0095
149
SPA
0096
150
EPA
0097
151
SOS
0098
152
SGCI
0099
153
SCI
009A
154
CSI
009B
155
ST
009C
156
OSC
009D
157
PM
009E
158
APC
009F
159
 
A−
 
NBSP
00A0
160
¡
00A1
161
¢
00A2
162
£
00A3
163
¤
00A4
164
¥
00A5
165
¦
00A6
166
§
00A7
167
¨
00A8
168
©
00A9
169
ª
00AA
170
«
00AB
171
¬
00AC
172
SHY
00AD
173
®
00AE
174
¯
00AF
175
 
B−
 
°
00B0
176
±
00B1
177
²
00B2
178
³
00B3
179
´
00B4
180
µ
00B5
181

00B6
182
·
00B7
183
¸
00B8
184
¹
00B9
185
º
00BA
186
»
00BB
187
¼
00BC
188
½
00BD
189
¾
00BE
190
¿
00BF
191
 
C−
 
À
00C0
192
Á
00C1
193
Â
00C2
194
Ã
00C3
195
Ä
00C4
196
Å
00C5
197
Æ
00C6
198
Ç
00C7
199
È
00C8
200
É
00C9
201
Ê
00CA
202
Ë
00CB
203
Ì
00CC
204
Í
00CD
205
Î
00CE
206
Ï
00CF
207
 
D−
 
Ð
00D0
208
Ñ
00D1
209
Ò
00D2
210
Ó
00D3
211
Ô
00D4
212
Õ
00D5
213
Ö
00D6
214
×
00D7
215
Ø
00D8
216
Ù
00D9
217
Ú
00DA
218
Û
00DB
219
Ü
00DC
220
Ý
00DD
221
Þ
00DE
222
ß
00DF
223
 
E−
 
à
00E0
224
á
00E1
225
â
00E2
226
ã
00E3
227
ä
00E4
228
å
00E5
229
æ
00E6
230
ç
00E7
231
è
00E8
232
é
00E9
233
ê
00EA
234
ë
00EB
235
ì
00EC
236
í
00ED
237
î
00EE
238
ï
00EF
239
 
F−
 
ð
00F0
240
ñ
00F1
241
ò
00F2
242
ó
00F3
243
ô
00F4
244
õ
00F5
245
ö
00F6
246
÷
00F7
247
ø
00F8
248
ù
00F9
249
ú
00FA
250
û
00FB
251
ü
00FC
252
ý
00FD
253
þ
00FE
254
ÿ
00FF
255
—0 —1 —2 —3 —4 —5 —6 —7 —8 —9 —A —B —C —D —E —F

Note that most of these control characters are not made for use in portable ISO-8859-1 encoded plain text documents, but only within specific protocols or devices, except a few ones whose behavior are standardized: TAB (09), LF (0A), CR (0D) and NEL (85); all but the first one are used to encode end of lines or to separate paragraphs, and TAB is often considered equivalent to whitespace. The null character (also null terminator) is a character with the value zero present in the ASCII and Unicode character sets and available Control character article i need to think about merging these Control character article i need to think about merging these The End Of Text character (ETX is an ASCII Control character used to inform the receiving computer that the end of the data stream has been reached In Telecommunication, an end-of-transmission character (EOT is a transmission Control character used to indicate the conclusion of a transmission that For Teleprinters Acknowledge character (ACK is a transmission control character transmitted by the receiving station as an affirmative response to the sending station Bell character is an ASCII Control character, code 7 (^G When it is sent to a printer or a terminal, nothing is printed but an Backspace is the keyboard key that originally pushed the Typewriter carriage one position backwards and in modern computer displays moves the cursor one position backwards Tab key (abbreviation of tabulator key) on a keyboard is used to advance the cursor to the next Tab stop. In Computing, a newline (also known as a line break or end-of-line / EOL character is a special character or sequence of characters Tab key (abbreviation of tabulator key) on a keyboard is used to advance the cursor to the next Tab stop. A page break is a marker in an electronic Document, which tells the document interpreter that the contents which follows is part of a new page Originally carriage return was the term for the control character in Baudot code on a teletypewriter for end of line Return to beginning of line and Shift Out (SO and Shift In (SI are ASCII Control characters 14 and 15 respectively (0xE and 0xF Shift Out (SO and Shift In (SI are ASCII Control characters 14 and 15 respectively (0xE and 0xF Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these In Telecommunications a negative-acknowledge character (NAK is a transmission Control character sent by a station as a negative Response Control character article i need to think about merging these In the C0 control code set used in ASCII, ETB is a short name for the "End Transmission Block" control character (code 23 or 0x17 In Telecommunication, the term cancel character has the following meanings A precision Control character (In Unicode, the Control character article i need to think about merging these Substitute character (␚ A control character that is used in the place of a character that is recognized to be invalid or in error or that cannot be represented on a given device In Computing and Telecommunication, an escape character is a single character which in a sequence of characters signifies that what is to follow takes an alternative Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these The delete key, known less ambiguously as forward delete, Del, or ⌦, performs a function when struck on a Computer In Telecommunication, the term cancel character has the following meanings A precision Control character (In Unicode, the ANSI escape codes are used to control text formatting and other output options on Text terminals In this context ANSI refers to the ANSI X3 The null character (also null terminator) is a character with the value zero present in the ASCII and Unicode character sets and available Control character article i need to think about merging these Control character article i need to think about merging these The End Of Text character (ETX is an ASCII Control character used to inform the receiving computer that the end of the data stream has been reached In Telecommunication, an end-of-transmission character (EOT is a transmission Control character used to indicate the conclusion of a transmission that Control character article i need to think about merging these For Teleprinters Acknowledge character (ACK is a transmission control character transmitted by the receiving station as an affirmative response to the sending station Bell character is an ASCII Control character, code 7 (^G When it is sent to a printer or a terminal, nothing is printed but an Backspace is the keyboard key that originally pushed the Typewriter carriage one position backwards and in modern computer displays moves the cursor one position backwards Tab key (abbreviation of tabulator key) on a keyboard is used to advance the cursor to the next Tab stop. In Computing, a newline (also known as a line break or end-of-line / EOL character is a special character or sequence of characters Tab key (abbreviation of tabulator key) on a keyboard is used to advance the cursor to the next Tab stop. A page break is a marker in an electronic Document, which tells the document interpreter that the contents which follows is part of a new page Originally carriage return was the term for the control character in Baudot code on a teletypewriter for end of line Return to beginning of line and Shift Out (SO and Shift In (SI are ASCII Control characters 14 and 15 respectively (0xE and 0xF Shift Out (SO and Shift In (SI are ASCII Control characters 14 and 15 respectively (0xE and 0xF Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these In Telecommunications a negative-acknowledge character (NAK is a transmission Control character sent by a station as a negative Response Control character article i need to think about merging these In the C0 control code set used in ASCII, ETB is a short name for the "End Transmission Block" control character (code 23 or 0x17 Control character article i need to think about merging these Substitute character (␚ A control character that is used in the place of a character that is recognized to be invalid or in error or that cannot be represented on a given device In Computing and Telecommunication, an escape character is a single character which in a sequence of characters signifies that what is to follow takes an alternative Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these Control character article i need to think about merging these In writing a space () is a blank area that is devoid of content which separates words letters numbers and punctuation Quotation marks or inverted commas (informally referred to as quotes and speech marks) are Punctuation marks used in pairs to set off speech Number sign is a name for the symbol #; it is the preferred Unicode name for the Code point associated with that Glyph. The percent sign ( %) is the symbol used to indicate a Percentage (that the preceding number is divided by one hundred An ampersand ( &) also commonly called an " 'and' sign," is a Logogram representing the conjunction "and" Brackets are Punctuation marks used in pairs to set apart or interject text within other text Brackets are Punctuation marks used in pairs to set apart or interject text within other text An asterisk ( *) (Latin asteriscum "little star" from Greek ἀστερίσκος) is a Typographical symbol or Glyph The plus and minus signs ( + and &minus) are Mathematical symbols used to represent the notions of positive and negative as well as the operations A comma ( ,   is a Punctuation mark It has the same shape as an Apostrophe or single closing Quotation mark in many typefaces but it differs The plus and minus signs ( + and &minus) are Mathematical symbols used to represent the notions of positive and negative as well as the operations A full stop or period (sometimes stop, full point, decimal point, or dot) is the Punctuation mark commonly placed at the The slash ( /) is a punctuation mark It is also called a virgule, diagonal, stroke, forward slash, oblique dash, Mathematics For any number x: x ·1 = 1· x = x (1 is the multiplicative identity In mathematics Two has many properties in Mathematics. An Integer is called Even if it is divisible by 2 ---- In mathematics Three is the first odd Prime number, and the second smallest prime In mathematics Four is the smallest Composite number, its proper Divisors being and. This article discusses the number five. For the year 5 AD see 5. In mathematics Six is the second smallest Composite number, its proper Divisors being 1, 2 and 3. In mathematics Seven is the fourth Prime number. It is not only a Mersenne prime (since 23 &minus 1 = 7 but also a In mathematics 8 is a Composite number, its Proper divisors being 1, 2, and 4. In mathematics Nine is a Composite number, its proper Divisors being 1 and 3. A semicolon (   ) is a conventional Punctuation mark with several usages Brackets are Punctuation marks used in pairs to set apart or interject text within other text History The "=" symbol that is now universally accepted by mathematics for equality was first recorded by Welsh mathematician Robert Recorde in The Brackets are Punctuation marks used in pairs to set apart or interject text within other text The question mark (? also known as an interrogation point, question point, query, or eroteme, is a punctuation mark that replaces The typographic character @, the at sign, denotes a pan-lingual abbreviation of the word 'at' The letter A is the first letter in the Latin alphabet. Its name in English is a (eɪ plural B is the second letter in the Latin alphabet. Its name in English is spelled bee or occasionally be (biː plural bees. C is the third letter in the Latin alphabet. Its name in English is spelled cee or occasionally ce (siː D is the fourth letter in the Latin alphabet. Its name in English is spelled dee or occasionally de (diː E is the fifth letter in the Latin alphabet. Its name in English is spelled e (iː plural es or ees (also written E's E F is the sixth letter in the Latin alphabet. Its name in English is spelled ef or eff (ɛf G is the seventh letter in the Latin alphabet. Its name in English is spelled gee or occasionally ge (dʒiː I is the ninth letter of the Latin alphabet. Its English name is i (aɪ J is the tenth letter in the modern Latin alphabet; it was the last of the 26 letters to be added K is the eleventh letter of the modern Latin alphabet. Its name in English is spelled kay (keɪ L is the twelfth letter of the Latin alphabet. Its name in English is el or occasionally ell (ɛl M is the thirteenth letter of the modern Latin alphabet. Its name in English is spelled em (ɛm N is the fourteenth letter in the Latin alphabet. Its name in English is spelled en (ɛn O is the fifteenth letter of the modern Latin Alphabet. Its name in English is spelled o (oʊ plural usually o's or os; sometimes P is the sixteenth letter of the modern Latin alphabet. Its name in English is spelled pee or occasionally pe (piː Q is the seventeenth letter of the modern Latin alphabet. Its name in English is spelled cue (kjuː R is the eighteenth letter of the modern Latin alphabet. Its name in English is spelled ar (ɑr pronounced or) S is the nineteenth letter in the modern Latin alphabet. Its name in English is spelled ess or occasionally es (ɛs generally es- T is the twentieth letter in the modern Latin alphabet. Its name in English is spelled tee or occasionally te (tiː U is the twenty-first letter in the modern Latin alphabet. Its name in English is spelled u (juː V is the twenty-second letter in the modern Latin alphabet. Its name in English is spelled vee or occasionally ve (viː W is the twenty-third letter in the Latin alphabet. Its name in English is spelled double-u (ˈdʌbljuː X is the twenty-fourth letter in the modern Latin alphabet. Its name in English is spelled ex or occasionally ecks (ɛks plural exes The letter Y is the twenty-fifth letter in the modern Latin alphabet. Z is the twenty-sixth and last letter of the modern Latin alphabet. Brackets are Punctuation marks used in pairs to set apart or interject text within other text The backslash ( \) is a typographical mark ( Glyph) used chiefly in Computing. Brackets are Punctuation marks used in pairs to set apart or interject text within other text Pitch The circumflex accent was first used in the polytonic orthography of Ancient Greek, where it occurred (subject to certain rules on the accented syllable The underscore _ (also called understrike, underbar, low line, or low dash is a character that originally appeared on the Typewriter. Pitch The grave accent was first used in the polytonic orthography of Ancient Greek, where it occurred only on the last syllable of a word in cases where the The letter A is the first letter in the Latin alphabet. Its name in English is a (eɪ plural B is the second letter in the Latin alphabet. Its name in English is spelled bee or occasionally be (biː plural bees. C is the third letter in the Latin alphabet. Its name in English is spelled cee or occasionally ce (siː D is the fourth letter in the Latin alphabet. Its name in English is spelled dee or occasionally de (diː E is the fifth letter in the Latin alphabet. Its name in English is spelled e (iː plural es or ees (also written E's E F is the sixth letter in the Latin alphabet. Its name in English is spelled ef or eff (ɛf G is the seventh letter in the Latin alphabet. Its name in English is spelled gee or occasionally ge (dʒiː I is the ninth letter of the Latin alphabet. Its English name is i (aɪ J is the tenth letter in the modern Latin alphabet; it was the last of the 26 letters to be added K is the eleventh letter of the modern Latin alphabet. Its name in English is spelled kay (keɪ L is the twelfth letter of the Latin alphabet. Its name in English is el or occasionally ell (ɛl M is the thirteenth letter of the modern Latin alphabet. Its name in English is spelled em (ɛm N is the fourteenth letter in the Latin alphabet. Its name in English is spelled en (ɛn O is the fifteenth letter of the modern Latin Alphabet. Its name in English is spelled o (oʊ plural usually o's or os; sometimes P is the sixteenth letter of the modern Latin alphabet. Its name in English is spelled pee or occasionally pe (piː Q is the seventeenth letter of the modern Latin alphabet. Its name in English is spelled cue (kjuː R is the eighteenth letter of the modern Latin alphabet. Its name in English is spelled ar (ɑr pronounced or) S is the nineteenth letter in the modern Latin alphabet. Its name in English is spelled ess or occasionally es (ɛs generally es- T is the twentieth letter in the modern Latin alphabet. Its name in English is spelled tee or occasionally te (tiː U is the twenty-first letter in the modern Latin alphabet. Its name in English is spelled u (juː V is the twenty-second letter in the modern Latin alphabet. Its name in English is spelled vee or occasionally ve (viː W is the twenty-third letter in the Latin alphabet. Its name in English is spelled double-u (ˈdʌbljuː X is the twenty-fourth letter in the modern Latin alphabet. Its name in English is spelled ex or occasionally ecks (ɛks plural exes The letter Y is the twenty-fifth letter in the modern Latin alphabet. Z is the twenty-sixth and last letter of the modern Latin alphabet. Brackets are Punctuation marks used in pairs to set apart or interject text within other text In Unix-like computer Operating systems a pipeline is the original software pipeline: a set of processes chained by their Standard Brackets are Punctuation marks used in pairs to set apart or interject text within other text The tilde (~ (/ˈtɪldə/ is a Grapheme with several uses The name of the character comes from Spanish, from the Latin titulus The delete key, known less ambiguously as forward delete, Del, or ⌦, performs a function when struck on a Computer In computer-based Text processing and Digital typesetting, a non-breaking space or no-break space ( NBSP) is In many national currencies, the cent is a monetary unit that equals 1/100 of the basic monetary unit See also Pound (currency.The pound sign (" £ " or " ₤ " is the symbol for the Pound sterling —the currency of the The currency sign ( ¤) is a character used to denote a currency when the symbol for a particular currency is unavailable Note "broken bar" and the glyph "¦" redirect here The section sign (§ Unicode U+00A7 HTML entity &sect is a typographical character used mainly to refer to a particular section In Linguistics, diaeresis, or dieresis, is the pronunciation of two adjacent Vowels in two separate Syllables rather than as a Diphthong copyright symbol, designated by © (a circled "C" is the Symbol used to provide notice of Copyright in works other than sound recordings (which º redirects here It is not to be confused with the Degree symbol ° Quotation marks, also called quotes, speech marks or inverted commas, are Punctuation marks used in pairs to set off speech a quotation In Logic and Mathematics, negation or not is an operation on Logical values for example the logical value of a Proposition A hyphen ( -) is a Punctuation mark It is used for both Words to join and to separate Syllables It is often confused with the dashes A trademark or trade mark, represented by the symbols ™ and ®, or mark is a distinctive sign or indicator used by an individual A macron, from Greek el μακρόv ( makrón) meaning "long" is a Diacritic ¯ placed over or under a Vowel which was originally The degree symbol (° Unicode: U+00B0 HTML: &deg is a typographical symbol or Glyph, that is used to represent degrees of arc (see ± The In Algebra, the square of a number is that number multiplied by itself In Arithmetic and Algebra, the cube of a number n is its third power &mdash the result of multiplying it by itself three times History An early precursor of the acute accent was the apex, used in Latin inscriptions to mark long vowels. micro- ( µ) is a prefix in the SI and other systems of units denoting a factor of 10&minus6 (one Millionth. The pilcrow (¶ Unicode U+00B6 HTML entity &para also called the Paragraph sign or the alinea ( An interpunct ( ·) is a small dot used for Interword separation in ancient Latin script, being perhaps the first consistent visual representation of word boundaries Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. º redirects here It is not to be confused with the Degree symbol ° Quotation marks, also called quotes, speech marks or inverted commas, are Punctuation marks used in pairs to set off speech a quotation The question mark (? also known as an interrogation point, question point, query, or eroteme, is a punctuation mark that replaces Pitch The grave accent was first used in the polytonic orthography of Ancient Greek, where it occurred only on the last syllable of a word in cases where the Á, á ( A - acute) is a letter of the Czech, Faroese, Hungarian, Icelandic, and Slovak languages Â, â ( A - Circumflex) is a letter of the Romanian and Vietnamese alphabets à / ã ( A - Tilde) is a letter used in some languages generally considered a variant of the letter A. " Ä " or " ä " is a character which represents either a letter from several extended Latin alphabets or the letter A with The letter Å represents various sounds in the Swedish, Finnish (although no native Finnish words contain the letter å Danish, Norwegian Æ ( minuscule: æ) is a Grapheme formed from the letters A and E. È can be The letter E with a Grave accent. 鄂 or È is an abbreviation for the Hubei province of the É, é ( E - acute) is a letter of Hungarian, Icelandic, Kashubian, Czech, Slovak, and Uyghur language Ê, ê ( E - Circumflex) is a letter of Kurdish and Vietnamese language. Ë, ë ( E - umlaut or diaeresis) is a letter of Albanian and Kashubian language. Letter Ü The letter Ü occurs in Hungarian, Karelian, Turkish, Estonian, Azeri, Turkmen, Crimean Í, í ( I - acute) is a letter of Faroese, Hungarian, Icelandic, Czech, Slovak, and Tatar language Î, î ( I - Circumflex) is a letter of Kurdish and Romanian language. Ï is a symbol used in various languages written with the Latin alphabet. Eth ( Ð, ð; also spelled edh or eð) is a letter used in Old English, Icelandic, Faroese (in Ñ (lower case ñ) is a letter of the modern Roman alphabet formed by an N with a diacritical Tilde. Ò, ò ( O - grave) is a letter of Kashubian language. This letter also appears in Catalan, Italian, Occitan, Ó, ó ( O - acute) is a letter in the Faroese, Hungarian, Icelandic, Kashubian, Polish, Czech Pitch The circumflex accent was first used in the polytonic orthography of Ancient Greek, where it occurred (subject to certain rules on the accented syllable Estonian In Estonian, Õ is the 27th letter of the alphabet (between W and Ä) representing the Close-mid back unrounded vowel O-Umlaut The glyph O with Umlaut appears in the German alphabet. The multiplication sign is the symbol × ( multiplication sign is the preferred Unicode name for the Codepoint represented by that Glyph The " Ø " ( minuscule: " ø " is a Vowel and a letter used in the Danish, Faroese and Norwegian Pitch The grave accent was first used in the polytonic orthography of Ancient Greek, where it occurred only on the last syllable of a word in cases where the Ê, ê ( E - Circumflex) is a letter of Kurdish and Vietnamese language. Û is used in the ISO 91995 system of Cyrillic transliteration as the letter Ю. Letter Ü The letter Ü occurs in Hungarian, Karelian, Turkish, Estonian, Azeri, Turkmen, Crimean History An early precursor of the acute accent was the apex, used in Latin inscriptions to mark long vowels. Thorn, or þorn (Þ þ is a letter in the Anglo-Saxon and Icelandic Alphabets It was also used in Medieval Scandinavia The letter ß ( Unicode U+00DF is a letter in the German alphabet. Pitch The grave accent was first used in the polytonic orthography of Ancient Greek, where it occurred only on the last syllable of a word in cases where the Á, á ( A - acute) is a letter of the Czech, Faroese, Hungarian, Icelandic, and Slovak languages Â, â ( A - Circumflex) is a letter of the Romanian and Vietnamese alphabets à / ã ( A - Tilde) is a letter used in some languages generally considered a variant of the letter A. " Ä " or " ä " is a character which represents either a letter from several extended Latin alphabets or the letter A with The letter Å represents various sounds in the Swedish, Finnish (although no native Finnish words contain the letter å Danish, Norwegian Æ ( minuscule: æ) is a Grapheme formed from the letters A and E. Ç, ç ( C - Cedilla) is a letter of Albanian, Turkish, Azerbaijani, Turkmen, Tatar, Kurdish language È can be The letter E with a Grave accent. 鄂 or È is an abbreviation for the Hubei province of the É, é ( E - acute) is a letter of Hungarian, Icelandic, Kashubian, Czech, Slovak, and Uyghur language Ê, ê ( E - Circumflex) is a letter of Kurdish and Vietnamese language. Ë, ë ( E - umlaut or diaeresis) is a letter of Albanian and Kashubian language. Letter Ü The letter Ü occurs in Hungarian, Karelian, Turkish, Estonian, Azeri, Turkmen, Crimean Í, í ( I - acute) is a letter of Faroese, Hungarian, Icelandic, Czech, Slovak, and Tatar language Î, î ( I - Circumflex) is a letter of Kurdish and Romanian language. Ï is a symbol used in various languages written with the Latin alphabet. Eth ( Ð, ð; also spelled edh or eð) is a letter used in Old English, Icelandic, Faroese (in Ñ (lower case ñ) is a letter of the modern Roman alphabet formed by an N with a diacritical Tilde. Ò, ò ( O - grave) is a letter of Kashubian language. This letter also appears in Catalan, Italian, Occitan, Ó, ó ( O - acute) is a letter in the Faroese, Hungarian, Icelandic, Kashubian, Polish, Czech Pitch The circumflex accent was first used in the polytonic orthography of Ancient Greek, where it occurred (subject to certain rules on the accented syllable Estonian In Estonian, Õ is the 27th letter of the alphabet (between W and Ä) representing the Close-mid back unrounded vowel O-Umlaut The glyph O with Umlaut appears in the German alphabet. The word " obelus " is also an alternative name for the dagger († symbol The " Ø " ( minuscule: " ø " is a Vowel and a letter used in the Danish, Faroese and Norwegian Pitch The grave accent was first used in the polytonic orthography of Ancient Greek, where it occurred only on the last syllable of a word in cases where the Ê, ê ( E - Circumflex) is a letter of Kurdish and Vietnamese language. Û is used in the ISO 91995 system of Cyrillic transliteration as the letter Ю. Letter Ü The letter Ü occurs in Hungarian, Karelian, Turkish, Estonian, Azeri, Turkmen, Crimean History An early precursor of the acute accent was the apex, used in Latin inscriptions to mark long vowels. Thorn, or þorn (Þ þ is a letter in the Anglo-Saxon and Icelandic Alphabets It was also used in Medieval Scandinavia Diaeresis or trema See also Diaeresis History Historically the diaeresis mark or trema is far older than the umlaut mark However FF (0C) is commonly accepted in some applications interpreting plain-text documents as an additional ignorable whitespace at the beginning of lines, to mark the position of an explicit page break when printing.

However, some encodings allow using BS (08) to create additional characters by emulating the superposition of multiple characters on printing devices.

Some ISO standards assign specific functions to some controls (for example in ISO 2022) where SO (0E), SI (0F), DLE (10), ESC (1B) and SS2 (8E) are used to control the encoding of characters after them or to switch between multiple encodings.

The NUL character (00) is commonly used as a string terminator in some programming languages, or as a filler in database records that must be ignored and is not part of the encoded text. STX (02) and ETX (03) are commonly used for delimiting frames in some transmission protocols. SUB (1A) is also commonly used as a replacement character to mark errors detected in input transmission streams, and it may be rendered graphically. DC1 (11) and DC3 (13) are commonly used in the XON/XOFF protocol for controlling the transmission speed. Finally, EM (19) or EOT (04) may be used as an end-of-file marker in some text file formats.

The ISO-8859-1/Windows-1252 mixup

It is very common to mislabel text data with the charset label ISO-8859-1, even though the data are really Windows-1252 encoded. Windows-1252 (also known as WinLatin1) is a Character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows In Windows-1252, codes between 0x80 and 0x9F are used for letters and punctuation, whereas they are control codes in ISO-8859-1. Many web browsers and e-mail clients will interpret ISO-8859-1 control codes as Windows-1252 characters in order to accommodate such mislabeling but it is not a standard behaviour and care should be taken to avoid generating these characters in ISO-8859-1 labeled content.

Similar character sets

The Apple Macintosh computer introduced a character encoding called Mac Roman, or Mac-Roman, in 1984. Several binary representations of character sets for common Western European languages are compared in this article Macintosh, commonly nicknamed Mac is a Brand name which covers several lines of Personal computers designed developed and marketed by Apple Inc Mac OS Roman is a Character encoding primarily used by Mac OS to represent text Year 1984 ( MCMLXXXIV) was a Leap year starting on Sunday (link displays the 1984 Gregorian calendar) It was meant to be suitable for Western European desktop publishing. Desktop publishing (also known as DTP) combines a Personal computer and WYSIWYG page layout Software to create Publication Documents It is a superset of ASCII, like ISO-8859-1, and has most of the characters that are in ISO-8859-1 but in a totally different arrangement. American Standard Code for Information Interchange ( ASCII) A later version, registered with IANA as "Macintosh", replaced the generic currency sign ¤ with the euro sign €. The currency sign ( ¤) is a character used to denote a currency when the symbol for a particular currency is unavailable The euro sign (€ is the Currency sign used for the Euro, the official currency of the European Union (EU The few printable characters that are in ISO 8859-1 but not in this set are often a source of trouble when editing text on websites using older Macintosh browsers (including the last version of Internet Explorer for Mac). Internet Explorer for Mac (also referred to as Internet Explorermac, IEmac or Internet Explorer Macintosh Edition) was a proprietary Web However the extra characters that Windows-1252 has in the C1 codepoint range are all supported in MacRoman and except for the few missing ISO-8859-1 characters a Macintosh can send/receive files (and email) that are encoded/marked as ISO-8859-1 (with the C1 Control Characters) and Windows-1252 by remapping the glyph's codepoint numbers.

DOS had code page 850, which had all printable characters that ISO-8859-1 had (albeit in a totally different arrangement) plus the most widely used graphics characters from code page 437. Code page 850 is a Code page that was used in western Europe under systems such as DOS. IBM PC or MS-DOS Code page 437, often abbreviated CP437 and also known as DOS-US, OEM-US or sometimes misleadingly referred

See also

External links


© 2009 citizendia.org; parts available under the terms of GNU Free Documentation License, from http://en.wikipedia.org
Dapyx Software network: MP3 Explorer | Ebook Manager | Zenithic