[The Mathematical Atlas] [Search][Subject Index][MathMap][Tour][Help!]

The HTML character set

HTML 1.0 assumes a character set for transmission to be ISO8859 (Latin-1). This includes 256 characters; the first 128 coincide with ASCII 7-bit. The others may be represented in several ways. Here we list, for each character,

  1. its ordinal number in decimal
  2. the same number, in octal (e.g. the Unix command to change "¡" to "!" would be "cat filename | tr \241 \041"
  3. the same number, in hexadecimal (e.g. an extended URL might include the string "%A1" to transmit the character "¡"; a MIME encoding would be "=A1")
  4. a single 8-bit character (as stored in European documents, say)
  5. the HTML "entity name" for the character (e.g. we enter the ASCII string "¡" into the HTML files at this site to render a "¡")
  6. the rendering of that HTML entity (if your browser is HTML 1.0 compliant, this should be e.g. an inverted exclamation mark. For most browsers, this will appear identical to the character two columns earlier.)
A compliant browser should also allow reference to a character by its decimal ordinal number, e.g. the ASCII string "¡" should also produce an inverted exclamation mark. (Your browser renders this as "¡").

In these pages we use none of the non-printing characters (&#001 or ctrl-A, etc.) except line-feeds (&#010; or ctrl-J) and spaces (&#032;) -- in particular we do not use tabs (&#009; or ctrl-I) or carriage-returns (&#013; or ctrl-M). Since the ampersand (&), less-than (<), and greater-than (>) characters have special meaning in HTML files, we encode them using entity names, too: "&amp;", "&lt;", and "&gt;", respectively. Finally, we avoid dollar-signs "$" to avoid a possible conflict with TeX encodings; when the symbol is needed, we use the decimal HTML code "&#036" since for some reason the entity code "&dollar;" is not recognized here.

The printing ASCII characters are those with (decimal) ordinals 33 "!" to 126 "~". Character numbers 127 through 159 are unassigned (but see below); the rest of the HTML 1.0 character set is as follows:

.decimal. ..octal.. hexadecimal ..8-bit.. .entity. rendered
160 240 A0 &nbsp;  
161 241 A1 &iexcl; ¡
162 242 A2 &cent; ¢
163 243 A3 &pound; £
164 244 A4 &curren; ¤
165 245 A5 &yen; ¥
166 246 A6 &brvbar; ¦
167 247 A7 &sect; §
168 250 A8 &uml; ¨
169 251 A9 &copy; ©
170 252 AA &ordf; ª
171 253 AB &laquo; «
172 254 AC &not; ¬
173 255 AD &shy; ­
174 256 AE &reg; ®
175 257 AF &macr; ¯
176 260 B0 &deg; °
177 261 B1 &plusmn; ±
178 262 B2 &sup2; ²
179 263 B3 &sup3; ³
180 264 B4 &acute; ´
181 265 B5 &micro; µ
182 266 B6 &para;
183 267 B7 &middot; ·
184 270 B8 &cedil; ¸
185 271 B9 &sup1; ¹
186 272 BA &ordm; º
187 273 BB &raquo; »
188 274 BC &frac14; ¼
189 275 BD &frac12; ½
190 276 BE &frac34; ¾
191 277 BF &iquest; ¿
192 300 C0 &Agrave; À
193 301 C1 &Aacute; Á
194 302 C2 &Acirc; Â
195 303 C3 &Atilde; Ã
196 304 C4 &Auml; Ä
197 305 C5 &Aring; Å
198 306 C6 &AElig; Æ
199 307 C7 &Ccedil; Ç
200 310 C8 &Egrave; È
201 311 C9 &Eacute; É
202 312 CA &Ecirc; Ê
203 313 CB &Euml; Ë
204 314 CC &Igrave; Ì
205 315 CD &Iacute; Í
206 316 CE &Icirc; Î
207 317 CF &Iuml; Ï
208 320 D0 &ETH; Ð
209 321 D1 &Ntilde; Ñ
210 322 D2 &Ograve; Ò
211 323 D3 &Oacute; Ó
212 324 D4 &Ocirc; Ô
213 325 D5 &Otilde; Õ
214 326 D6 &Ouml; Ö
215 327 D7 &times; ×
216 330 D8 &Oslash; Ø
217 331 D9 &Ugrave; Ù
218 332 DA &Uacute; Ú
219 333 DB &Ucirc; Û
220 334 DC &Uuml; Ü
221 335 DD &Yacute; Ý
222 336 DE &THORN; Þ
223 337 DF &szlig; ß
224 340 E0 &agrave; à
225 341 E1 &aacute; á
226 342 E2 &acirc; â
227 343 E3 &atilde; ã
228 344 E4 &auml; ä
229 345 E5 &aring; å
230 346 E6 &aelig; æ
231 347 E7 &ccedil; ç
232 350 E8 &egrave; è
233 351 E9 &eacute; é
234 352 EA &ecirc; ê
235 353 EB &euml; ë
236 354 EC &igrave; ì
237 355 ED &iacute; í
238 356 EE &icirc; î
239 357 EF &iuml; ï
240 360 F0 &eth; ð
241 361 F1 &ntilde; ñ
242 362 F2 &ograve; ò
243 363 F3 &oacute; ó
244 364 F4 &ocirc; ô
245 365 F5 &otilde; õ
246 366 F6 &ouml; ö
247 367 F7 &divide; ÷
248 370 F8 &oslash; ø
249 371 F9 &ugrave; ù
250 372 FA &uacute; ú
251 373 FB &ucirc; û
252 374 FC &uuml; ü
253 375 FD &yacute; ý
254 376 FE &thorn; þ
255 377 FF &yuml; ÿ

Although HTML1.0 does not specify a character to be rendered for decimal codes 127-159, a few of those codes are treated consistently by a number of recent browsers; they are shown below. (This is primarily a Microsoft convention.) Please note that since these characters are neither specified by standard nor universally recognized by browsers, they are not used in these documents. They are simply listed below for your experimentation! (the last column is rendered with e.g. "&#128;", etc., since entity codes are not available.)

.decimal. ..octal.. hexadecimal ..8-bit.. .entity. rendered
128 200 80 [none]
129 201 81 [none] 
130 202 82 [']
131 203 83 [none] ƒ
132 204 84 ["]
133 205 85 [...]
134 206 86 [/-]
135 207 87 [/=]
136 210 88 [none] ˆ
137 211 89 [0/00]
138 212 8A [none] Š
139 213 8B [<]
140 214 8C [none] Œ
141 215 8D [none] 
142 216 8E [none] Ž
143 217 8F [none] 
144 220 90 [none] 
145 221 91 [`]
146 222 92 [']
147 223 93 ["]
148 224 94 ["]
149 225 95 [o]
150 226 96 [-]
151 227 97 [-]
152 230 98 [~] ˜
153 231 99 [(TM), ™ ™]
154 232 9A [none] š
155 233 9B [>]
156 234 9C [none] œ
157 235 9D [none] 
158 236 9E [none] ž
159 237 9F [none] Ÿ

HTML 1.0 uses the ISO8859 character set as the medium of data exchange. For background information on this decision see (http://www.uni-passau.de/~ramsch/iso8859-1.html). For more information visit htmlgoodies.com and search under "character set" You can view other tables of ISO characters by decimal number (http://ucs.orst.edu/~ramsperg/framed_iso.html) or entity name (http://www.htmlgoodies.com/tutors/&command.html)

HTML 4.0 will use the Universal Character Set (UCS) as its character set; UCS is a character-by-character equivalent to Unicode 2.0. Unfortunately, most browsers are not yet ready to display these characters. For more information and a few of the "next generation" characters, see
http://www.natural-innovations.com/boo/doc-charset.html
To really whet your appetite for the symbols which will be available for web documents once this standard is widely accepted, look at the Unicode charts at http://charts.unicode.org/charts.html.


You can reach this page through http://www.math-atlas.org/welcome.html
Last modified 2000/12/17 by Dave Rusin. Mail: rusin@math.niu.edu