(Pronounced "ass-kee." or "ask-key.")
American Code for Information Interchange - Pronounced "ass-kee." or "ask-key". A way of represent characters. The first 128 characters are standardized, and the first 32 of those are control codes, which are not visible characters but codes that can be used for text formatting or actions, such as form-feed. After the 32 control codes, the next 96 standardized characters represent numbers, letters (both uppercase and lowercase), and standard punctuation marks. The last 128 characters represent different things on different platforms. The total number of characters is 256, (8 bits of binary data). Only the first 7 bits 0 to 127 are defined. (The 8th bit may have been used as a parity check, generally this eight bit is now used to store extra characters).
A tutorial on character code issues
ASCII Character Codes. This is a list of character codes in decimal, hex, octal and binary.
ASCII Code Characters Includes (ASCII Control code details and The IBM (DOS) Extended Character set)
A tutorial on character code issues
Fix Text - Converts out strange non-web character codes when filling out web forms
Using Special Characters on Web Pages: Cross-Platform Considerations
Creating Multilingual Web Pages: Unicode Support in HTML, HTML Editors and Web Browsers
Please note there could be errors below. Also there are different numbering systems. If you spot any errors please click here and let me know the exact error.
| Decimal | ASCII |
Description | HTML | Hex |
Binary |
Octal | |
|
0 |
NUL | null |
0h |
00000000 |
000 | ||
|
1 |
SOH |
Start of Heading Control-A |
1h |
00000001 |
001 | ||
|
2 |
STX | Start of Text Control-B |
2h |
00000010 |
002 | ||
|
3 |
ETX | End of Text Control-C |
3h |
00000011 |
003 | ||
|
4 |
EOT | End of Transmission Control-D |
4h |
00000100 |
004 | ||
|
5 |
ENQ | Enquiry Control-E |
5h |
00000101 |
005 | ||
|
6 |
ACK | Acknowledge Control-F |
6h |
00000110 |
006 | ||
|
7 |
BELL | bell |
7h |
00000111 |
007 | ||
|
8 |
BS | Back Space |
8h |
00001000 |
010 | ||
|
9 |
TAB | Horizontal tab |
9h |
00001001 |
011 | ||
|
10 | LF | Line feed |
Ah |
00001010 |
012 | ||
|
11 | VF | Vertical Tab |
Bh |
00001011 |
013 | ||
|
12 | FF | Form Feed (New Page) |
Ch |
00001100 |
014 | ||
|
13 | CR | Carriage Return |
Dh |
00001101 |
015 | ||
|
14 | SO | Shift Out Control-N |
Eh |
00001110 |
016 | ||
|
15 | SI | Shift In Control-O |
Fh |
00001111 |
017 | ||
|
16 | DLE | Data Link Escape Control-P |
10h |
00010000 |
020 | ||
|
17 | DC1 | Device Control 1 Control-Q |
11h |
00010001 |
021 | ||
|
18 |
DC2 | Device Control 2 Control-R |
12h |
00010010 |
022 | ||
|
19 |
DC3 | Device Control 3 Control-S |
13h |
00010011 |
023 | ||
|
20 | DC4 | Device Control 4 Control-T |
14h |
00010100 |
024 | ||
|
21 | NAK | Negative Acknowledge Control-U |
15h |
00010101 |
025 | ||
|
22 |
SYN | Synchronise Idle Control-V |
16h |
00010110 |
026 | ||
|
23 |
EBT | End of Transmission Block Control-W |
17h |
00010111 |
027 | ||
|
24 |
CAN | Cancel Control-X |
18h |
00011000 |
030 | ||
|
25 |
EM | End of Medium Control-X |
19h |
00011001 |
031 | ||
|
26 |
SUB | Substiute Control-Y |
1Ah |
00011010 |
032 | ||
|
27 |
ESC | Escape Control-Z |
1Bh |
00011011 |
033 | ||
|
28 |
FS | File Separator |
1Ch |
00011100 |
034 | ||
|
29 |
GS | Group Separator |
1Dh |
00011101 |
035 | ||
|
30 | RS | Record Separator |
1Eh |
00011110 |
036 | ||
|
31 | UN | Unit Separator |
1Fh |
00011111 |
037 | ||
|
Printable Characters 32 to 127 decimal. |
|||||||
| 32 | space | Space | 20h | 00100000 | 040 | ||
| 33 | ! | Exclamation mark | 21h | 00100001 | 041 | ||
| 34 | " | Quotation mark | 22h | 00100010 | 042 | ||
| 35 | # | Number sign | 23h | 00100011 | 043 | ||
| 36 | $ | Dollar sign | 24h | 00100100 | 044 | ||
| 37 | % | Percent sign | 25h | 00100101 | 045 | ||
| 38 | & | Ampersand | 26h | 00100110 | 046 | ||
| 39 | ' | Apostrophe | 27h | 00100111 | 047 | ||
| 40 | ( | Left parenthesis | 28h | 00101000 | 050 | ||
| 41 | ) | Right parenthesis | 29h | 00101001 | 051 | ||
| 42 | * | Asterisk | 2Ah | 00101010 | 052 | ||
| 43 | + | Plus sign | 2Bh | 00101011 | 053 | ||
| 44 | , | Comma | 2Ch | 00101100 | 054 | ||
| 45 | - | Hyphen | 2Dh | 00101101 | 055 | ||
| 46 | . | Period (fullstop) | 2Eh | 00101110 | 056 | ||
| 47 | / | Solidus (slash) | 2Fh | 00101111 | 057 | ||
| 48 | 0 | 30h | 00110000 | 060 | |||
| 49 | 1 | 31h | 00110001 | 061 | |||
| 50 | 2 | 32h | 00110010 | 062 | |||
| 51 | 3 | 33h | 00110011 | 063 | |||
| 52 | 4 | 34h | 00110100 | 064 | |||
| 53 | 5 | 35h | 00110101 | 065 | |||
| 54 | 6 | 36h | 00110110 | 066 | |||
| 55 | 7 | 37h | 00110111 | 067 | |||
| 56 | 8 | 38h | 00111000 | 070 | |||
| 57 | 9 | 39h | 00111001 | 071 | |||
| 58 | : | Colon | 3Ah | 00111010 | 072 | ||
| 59 | ; | Semi-colon | 3Bh | 00111011 | 073 | ||
| 60 | < | Less than | 3Ch | 00111100 | 074 | ||
| 61 | = | Equals sign | 3Dh | 00111101 | 075 | ||
| 62 | > | Greater than | 3Eh | 00111110 | 076 | ||
| 63 | ? | Question mark | 3Fh | 00111111 | 077 | ||
| 64 | @ | Commercial at | 40h | 01000000 | 100 | ||
| 65 | A | 41h | 01000001 | 101 | |||
| 66 | B | 42h | 01000010 | 102 | |||
| 67 | C | 43h | 01000011 | 103 | |||
| 68 | D | 44h | 01000100 | 104 | |||
| 69 | E | 45h | 01000101 | 105 | |||
| 70 | F | 46h | 01000110 | 106 | |||
| 71 | G | 47h | 01000111 | 107 | |||
| 72 | H | 48h | 01001000 | 110 | |||
| 73 | I | 49h | 01001001 | 111 | |||
| 74 | J | 4Ah | 01001010 | 112 | |||
| 75 | K | 4Bh | 01001011 | 113 | |||
| 76 | L | 4Ch | 01001100 | 114 | |||
| 77 | M | 4Dh | 01001101 | 115 | |||
| 78 | N | 4Eh | 01001110 | 116 | |||
| 79 | O | 4Fh | 01001111 | 117 | |||
| 80 | P | 50h | 01010000 | 120 | |||
| 81 | Q | 51h | 01010001 | 121 | |||
| 82 | R | 52h | 01010010 | 122 | |||
| 83 | S | 53h | 01010011 | 123 | |||
| 84 | T | 54h | 01010100 | 124 | |||
| 85 | U | 55h | 01010101 | 125 | |||
| 86 | V | 56h | 01010110 | 126 | |||
| 87 | W | 57h | 01010111 | 127 | |||
| 88 | X | 58h | 01011000 | 130 | |||
| 89 | Y | 59h | 01011001 | 131 | |||
| 90 | Z | 5Ah | 01011010 | 132 | |||
| 91 | [ | Left square bracket | 5Bh | 01011011 | 133 | ||
| 92 | \ | Reverse solidus (backslash) | 5Ch | 01011100 | 134 | ||
| 93 | ] | Right square bracket | 5Dh | 01011101 | 135 | ||
| 94 | ^ | Caret | 5Eh | 01011110 | 136 | ||
| 95 | _ | Horizontal bar (underscore) | 5Fh | 01011111 | 137 | ||
| 96 | ` | Acute accent | 60h | 01100000 | 140 | ||
| 97 | a | 61h | 01100001 | 141 | |||
| 98 | b | 62h | 01100010 | 142 | |||
| 99 | c | 63h | 01100011 | 143 | |||
| 100 | d | 64h | 01100100 | 144 | |||
| 101 | e | 65h | 01100101 | 145 | |||
| 102 | f | 66h | 01100110 | 146 | |||
| 103 | g | 67h | 01100111 | 147 | |||
| 104 | h | 68h | 01101000 | 150 | |||
| 105 | i | 69h | 01101001 | 151 | |||
| 106 | j | 6Ah | 01101010 | 152 | |||
| 107 | k | 6Bh | 01101011 | 153 | |||
| 108 | l | 6Ch | 01101100 | 154 | |||
| 109 | m | 6Dh | 01101101 | 155 | |||
| 110 | n | 6Eh | 01101110 | 156 | |||
| 111 | o | 6Fh | 01101111 | 157 | |||
| 112 | p | 70h | 01110000 | 160 | |||
| 113 | q | 71h | 01110001 | 161 | |||
| 114 | r | 72h | 01110010 | 162 | |||
| 115 | s | 73h | 01110011 | 163 | |||
| 116 | t | 74h | 01110100 | 164 | |||
| 117 | u | 75h | 01110101 | 165 | |||
| 118 | v | 76h | 01110110 | 166 | |||
| 119 | w | 77h | 01110111 | 167 | |||
| 120 | x | 78h | 01111000 | 170 | |||
| 121 | y | 79h | 01111001 | 171 | |||
| 122 | z | 7Ah | 01111010 | 172 | |||
| 123 | { | Left curly brace | 7Bh | 01111011 | 173 | ||
| 124 | | | Vertical bar | 7Ch | 01111100 | 174 | ||
| 125 | } | Right curly brace | 7Dh | 01111101 | 175 | ||
| 126 |
~ |
Tilde | 7Eh | 01111110 | 176 | ||
| 127 |
DEL |
7Fh | 01111111 | 177 | |||
|
Values Above 127 decimal are some times used for extended characters. These may not be the same on all systems. |
|||||||
| 128 | 80h | 10000000 | 200 | ||||
| 129 | 81h | 10000001 | 201 | ||||
| 130 | 82h | 10000010 | 202 | ||||
| 130 | 83h | 10000011 | 203 | ||||
| 132 | 84h | 10000100 | 204 | ||||
| 133 | 85h | 10000101 | 205 | ||||
| 134 | 86h | 10000110 | 206 | ||||
| 135 | 87h | 10000111 | 207 | ||||
| 136 | 88h | 10001000 | 210 | ||||
| 137 | 89h | 10001001 | 211 | ||||
| 138 | 8Ah | 10001010 | 212 | ||||
| 139 | 8Bh | 10001011 | 213 | ||||
| 140 | 8Ch | 10001100 | 214 | ||||
| 141 | 8Dh | 10001101 | 215 | ||||
| 142 | 8Eh | 10001110 | 216 | ||||
| 143 | 8Fh | 10001111 | 217 | ||||
| 144 | 90h | 10010000 | 220 | ||||
| 145 | 91h | 10010001 | 221 | ||||
|
146 |
92h | 10010010 | 222 | ||||
| 147 | 93h | 10010011 | 223 | ||||
| 148 | 94h | 10010100 | 224 | ||||
| 149 | 95h | 10010101 | 225 | ||||
| 150 | 96h | 10010110 | 226 | ||||
| 151 | 97h | 10010111 | 227 | ||||
| 152 | 98h | 10011000 | 230 | ||||
| 153 | 99h | 10011001 | 231 | ||||
| 154 | 9Ah | 10011010 | 232 | ||||
| 155 | 9Bh | 10011011 | 233 | ||||
| 156 | 9Ch | 10011100 | 234 | ||||
| 157 | 9Dh | 10011101 | 235 | ||||
| 158 | 9Eh | 10011110 | 236 | ||||
| 159 | 9Fh | 10011111 | 237 | ||||
| 160 | Non-breaking Space | | A0h | 10100000 | 240 | ||
| 161 | ¡ | Inverted exclamation | ¡ | A1h | 10100001 | 241 | |
| 162 | ¢ | Cent sign | ¢ | A2h | 10100010 | 242 | |
| 163 | £ | Pound sterling | £ | A3h | 10100011 | 243 | |
| 164 | ¤ | General currency sign | ¤ | A4h | 10100100 | 244 | |
| 165 | ¥ | Yen sign | ¥ | A5h | 10100101 | 245 | |
| 166 | ¦ | Broken vertical bar | ¦ | A6h | 10100110 | 246 | |
| 167 | § | Section sign | § | A7h | 10100111 | 247 | |
| 168 | ¨ | Umlaut (dieresis) | ¨ | A8h | 10101000 | 250 | |
| 169 | © | Copyright | © | A9h | 10101001 | 251 | |
| 170 | ª | Feminine ordinal | ª | AAh | 10101010 | 252 | |
| 171 | « | Left angle quote, guillemotleft | « | ABh | 10101011 | 253 | |
| 172 | ¬ | Not sign | ¬ | ACh | 10101100 | 254 | |
| 173 | | Soft hyphen | ­ | ADh | 10101101 | 255 | |
| 174 | ® | Registered trademark | ® | AEh | 10101110 | 256 | |
| 175 | ¯ | Macron accent | ¯ | AFh | 10101111 | 257 | |
| 176 | ° | Degree sign | ° | B0h | 10110000 | 260 | |
| 177 | ± | Plus or minus | ± | B1h | 10110001 | 261 | |
| 178 | ² | Superscript two | ² | B2h | 10110010 | 262 | |
| 179 | ³ | Superscript three | ³ | B3h | 10110011 | 263 | |
| 180 | ´ | Acute accent | ´ | B4h | 10110100 | 264 | |
| 181 | µ | Micro sign | µ | B5h | 10110101 | 265 | |
| 182 | ¶ | Paragraph sign | ¶ | B6h | 10110110 | 266 | |
| 183 | · | Middle dot | · | B7h | 10110111 | 267 | |
| 184 | ¸ | Cedilla | ¸ | B8h | 10111000 | 270 | |
| 185 | ¹ | Superscript one | ¹ | B9h | 10111001 | 271 | |
| 186 | º | Masculine ordinal | º | BAh | 10111010 | 272 | |
| 187 | » | Right angle quote, guillemotright | » | BBh | 10111011 | 273 | |
| 188 | ¼ | Fraction one-fourth | ¼ | BCh | 10111100 | 274 | |
| 189 | ½ | Fraction one-half | ½ | BDh | 10111101 | 275 | |
| 190 | ¾ | Fraction three-fourths | ¾ | BEh | 10111110 | 276 | |
| 191 | ¿ | Inverted question mark | ¿ | BFh | 10111111 | 277 | |
| 192 | À | Capital A, grave accent | À | C0h | 11000000 | 300 | |
| 193 | Á | Capital A, acute accent | Á | C1h | 11000001 | 301 | |
| 194 | Â | Capital A, circumflex accent | Â | C2h | 11000010 | 302 | |
| 195 | Ã | Capital A, tilde | Ã | C3h | 11000011 | 303 | |
| 196 | Ä | Capital A, dieresis or umlaut mark | Ä | C4h | 11000100 | 304 | |
| 197 | Å | Capital A, ring | Å | C5h | 11000101 | 305 | |
| 198 | Æ | Capital AE dipthong (ligature) | Æ | C6h | 11000110 | 306 | |
| 199 | Ç | Capital C, cedilla | Ç | C7h | 11000111 | 307 | |
| 200 | È | Capital E, grave accent | È | C8h | 11001000 | 310 | |
| 201 | É | Capital E, acute accent | É | C9h | 11001001 | 311 | |
| 202 | Ê | Capital E, circumflex accent | Ê | CAh | 11001010 | 312 | |
| 203 | Ë | Capital E, dieresis or umlaut mark | Ë | CBh | 11001011 | 313 | |
| 204 | Ì | Capital I, grave accent | Ì | CCh | 11001100 | 314 | |
| 205 | Í | Capital I, acute accent | Í | CDh | 11001101 | 315 | |
| 206 | Î | Capital I, circumflex accent | Î | CEh | 11001110 | 316 | |
| 207 | Ï | Capital I, dieresis or umlaut mark | Ï | CFh | 11001111 | 317 | |
| 208 | Ð | Capital Eth, Icelandic | Ð | D0h | 11010000 | 320 | |
| 209 | Ñ | Capital N, tilde | Ñ | D1h | 11010001 | 321 | |
| 210 | Ò | Capital O, grave accent | Ò | D2h | 11010010 | 322 | |
| 211 | Ó | Capital O, acute accent | Ó | D3h | 11010011 | 323 | |
| 212 | Ô | Capital O, circumflex accent | Ô | D4h | 11010100 | 324 | |
| 213 | Õ | Capital O, tilde | Õ | D5h | 11010101 | 325 | |
| 214 | Ö | Capital O, dieresis or umlaut mark | Ö | D6h | 11010110 | 326 | |
| 215 | × | Multiply sign | × | D7h | 11010111 | 327 | |
| 216 | Ø | Capital O, slash | Ø | D8h | 11011000 | 330 | |
| 217 | Ù | Capital U, grave accent | Ù | D9h | 11011001 | 331 | |
| 218 | Ú | Capital U, acute accent | Ú | DAh | 11011010 | 332 | |
| 219 | Û | Capital U, circumflex accent | Û | DBh | 11011011 | 333 | |
| 220 | Ü | Capital U, dieresis or umlaut mark | Ü | DCh | 11011100 | 334 | |
| 221 | Ý | Capital Y, acute accent | Ý | DDh | 11011101 | 335 | |
| 222 | Þ | Capital THORN, Icelandic | Þ | DEh | 11011110 | 336 | |
| 223 | ß | Small sharp s, German (sz ligature) | ß | DFh | 11011111 | 337 | |
| 224 | à | Small a, grave accent | à | E0h | 11100000 | 340 | |
| 225 | á | Small a, acute accent | á | E1h | 11100001 | 341 | |
| 226 | â | Small a, circumflex accent | â | E2h | 11100010 | 342 | |
| 227 | ã | Small a, tilde | ã | E3h | 11100011 | 343 | |
| 228 | ä | Small a, dieresis or umlaut mark | ä | E4h | 11100100 | 344 | |
| 229 | å | Small a, ring | å | E5h | 11100101 | 345 | |
| 230 | æ | Small ae dipthong (ligature) | æ | E6h | 11100110 | 346 | |
| 231 | ç | Small c, cedilla | ç | E7h | 11100111 | 347 | |
| 232 | è | Small e, grave accent | è | E8h | 11101000 | 350 | |
| 233 | é | Small e, acute accent | é | E9h | 11101001 | 351 | |
| 234 | ê | Small e, circumflex accent | ê | EAh | 11101010 | 352 | |
| 235 | ë | Small e, dieresis or umlaut mark | ë | EBh | 11101011 | 353 | |
| 236 | ì | Small i, grave accent | ì | ECh | 11101100 | 354 | |
| 237 | í | Small i, acute accent | í | EDh | 11101101 | 355 | |
| 238 | î | Small i, circumflex accent | î | EEh | 11101110 | 356 | |
| 239 | ï | Small i, dieresis or umlaut mark | ï | EFh | 11101111 | 357 | |
| 240 | ð | Small eth, Icelandic | ð | F0h | 11110000 | 360 | |
| 241 | ñ | Small n, tilde | ñ | F1h | 11110001 | 361 | |
| 242 | ò | Small o, grave accent | ò | F2h | 11110010 | 362 | |
| 243 | ó | Small o, acute accent | ó | F3h | 11110011 | 363 | |
| 244 | ô | Small o, circumflex accent | ô | F4h | 11110100 | 364 | |
| 245 | õ | Small o, tilde | õ | F5h | 11110101 | 365 | |
| 246 | ö | Small o, dieresis or umlaut mark | ö | F6h | 11110110 | 366 | |
| 247 | ÷ | Division sign | ÷ | F7h | 11110111 | 367 | |
| 248 | ø | Small o, slash | ø | F8h | 11111000 | 370 | |
| 249 | ù | Small u, grave accent | ù | F9h | 11111001 | 371 | |
| 250 | ú | Small u, acute accent | ú | FAh | 11111010 | 372 | |
| 251 | û | Small u, circumflex accent | û | FBh | 11111011 | 373 | |
| 252 | ü | Small u, dieresis or umlaut mark | ü | FCh | 11111100 | 374 | |
| 253 | ý | Small y, acute accent | ý | FDh | 11111101 | 375 | |
| 254 | þ | Small thorn, Icelandic | þ | FEh | 11111110 | 376 | |
| 255 | ÿ | Small y, dieresis or umlaut mark | ÿ | FFh | 11111111 | 377 | |
|
Some special Symbols and Letters |
||
| Symbols | ||
| Code | Symbol | Description |
| ‘ | ‘ | left single quote |
| ’ | ’ | right single quote |
| ‚ | ‚ | single low-9 quote |
| “ | “ | left double quote |
| ” | ” | right double quote |
| „ | „ | double low-9 quote |
| † | † | dagger |
| ‡ | ‡ | double dagger |
| ‰ | ‰ | per mill sign |
| ‹ | ‹ | left angle quote (single) |
| › | › | right angle quote (single) |
| ♠ | ♠ | black spade suit |
| ♣ | ♣ | black club suit |
| ♥ | ♥ | black heart suit |
| ♦ | ♦ | black diamond suit |
| ‾ | ‾ | overline |
| ← | ← | leftward arrow |
| ↑ | ↑ | upward arrow |
| → | → | rightward arrow |
| ↓ | ↓ | downward arrow |
| ™ | ™ | trademark sign |
| & | & | ampersand |
| < | < | less-than sign |
| > | > | greater-than sign |
| " | " | quotation mark |
| × | × | multiplication sign |
| ÷ | ÷ | division sign |
| – | | en dash |
| — | | em dash |
| | nonbreaking space (note *) | |
| ¡ | ¡ | inverted exclamation |
| ¢ | ¢ | cent sign |
| £ | £ | pound sterling |
| ¤ | ¤ | general currency sign |
| ¥ | ¥ | yen sign |
| € | | euro sign |
| ¦ | ¦ | broken vertical bar |
| § | § | section sign |
|  | | square (note **) |
| ¨ | ¨ | umlaut |
| © | © | copyright |
| ª | ª | feminine ordinal |
| « | « | left angle quote |
| ¬ | ¬ | not sign |
| ® | ® | registered trademark |
| ¯ | ¯ | macron accent |
| ° | ° | degree sign |
| ± | ± | plus or minus |
| ² | ² | superscript two |
| ³ | ³ | superscript tree |
| ´ | ´ | acute accent |
| µ | µ | micro sign |
| ¶ | ¶ | paragraph sign |
| · | · | middle dot |
| ¸ | ¸ | cedilla |
| ¹ | ¹ | superscript one |
| º | º | masculine ordinal |
| » | » | right angle quote |
| ¼ | ¼ | one quarter |
| ½ | ½ | one half |
| ¾ | ¾ | three quarters |
| ¿ | ¿ | inverted question mark |
| Letters Uppercase | ||
| Code | Symbol | Description |
| À | À | uppercase A, grave accent |
| Á | Á | uppercase A, acute accent |
| Â | Â | uppercase A, circumflex accent |
| Ã | Ã | uppercase A, tilde |
| Ä | Ä | uppercase A, umlaut |
| Å | Å | uppercase A, ring |
| Æ | Æ | uppercase AE |
| Ç | Ç | uppercase C, cedilla |
| È | È | uppercase E, grave accent |
| É | É | uppercase E, acute accent |
| Ê | Ê | uppercase E, circumflex accent |
| Ë | Ë | uppercase E, umlaut |
| Ì | Ì | uppercase I, grave accent |
| Í | Í | uppercase I, acute accent |
| Î | Î | uppercase I, circumflex accent |
| Ï | Ï | uppercase I, umlaut |
| &ET; | Ð | uppercase Et, Icelandic |
| Ñ | Ñ | uppercase N, tilde |
| Ò | Ò | uppercase O, grave accent |
| Ó | Ó | uppercase O, acute accent |
| Ô | Ô | uppercase O, circumflex accent |
| Õ | Õ | uppercase O, tilde |
| Ö | Ö | uppercase O, umlaut |
| Ø | Ø | uppercase O, slash |
| Ù | Ù | uppercase U, grave accent |
| Ú | Ú | uppercase U, acute accent |
| Û | Û | uppercase U, circumflex accent |
| Ü | Ü | uppercase U, umlaut |
| Ý | Ý | uppercase Y, acute accent |
| Þ | Þ | uppercase THORN, Icelandic |
| Letters Lowercase | ||
| Code | Symbol | Description |
| ß | ß | lowercase sarps, German |
| à | à | lowercase a, grave accent |
| á | á | lowercase a, acute accent |
| â | â | lowercase a, circumflex accent |
| ã | ã | lowercase a, tilde |
| ä | ä | lowercase a, umlaut |
| å | å | lowercase a, ring |
| æ | æ | lowercase ae |
| ç | ç | lowercase c, cedilla |
| è | è | lowercase e, grave accent |
| é | é | lowercase e, acute accent |
| ê | ê | lowercase e, circumflex accent |
| ë | ë | lowercase e, umlaut |
| ì | ì | lowercase i, grave accent |
| í | í | lowercase i, acute accent |
| î | î | lowercase i, circumflex accent |
| ï | ï | lowercase i, umlaut |
| &et; | ð | lowercase et, Icelandic |
| ñ | ñ | lowercase n, tilde |
| ò | ò | lowercase o, grave accent |
| ó | ó | lowercase o, acute accent |
| ô | ô | lowercase o, circumflex accent |
| õ | õ | lowercase o, tilde |
| ö | ö | lowercase o, umlaut |
| &oslas; | ø | lowercase o, slash |
| ù | ù | lowercase u, grave accent |
| ú | ú | lowercase u, acute accent |
| û | û | lowercase u, circumflex accent |
| ü | ü | lowercase u, umlaut |
| ý | ý | lowercase y, acute accent |
| &torn; | þ | lowercase torn, Icelandic |
| ÿ | ÿ | lowercase y, umlaut |
Note * : this creates a non-breaking space. You can indent stuff off the margin with a few .
Note .is useful if you don't want words to be placed onto a new lines you may separate them with This is especially useful for names etc.. Firstname Surname may be cause the Surname to be paced on different lines whereas Firstname Surname should keep these two together, the two word may be placed on a new line but they will be kept together.
Example space used:- Firstname Surname
Fred
JonesExample non-breaking space, ( ), used:- Firstname Surname
Fred Jones
Note : when there is no named value for a symbol, you can use its numerical value instead. All characters have a number attached to them, but if there is a name, you may as well use that.
Caution : If you are putting in the code correctly but the symbol isn't coming up you may want to check which font the text is in. Some typefaces won't have the symbol included. You can fix this by changing all your text's font type, or by wrapping the FONT tag (with a common font like Arial) around the symbol itself.
ASCII instead of EBCDIC, Rad50, or Unicode, discrepancies arise. (From O'reilly PERL cookbook)
Unicode provides a unique number for every character, no matter what the platform, no matter what the program, no matter what the language.
Unicode HTML Entities From Mr. Rat
Unicode Easy Keyboard Layout This is a keyboard layout for Windows XP that adds easy access for lots of additional characters. It differs from the "United States International" keyboard layout included with Windows in that none of the standard punctuation marks are dead keys. This means you can activate this layout and not even notice it until you intentionally use it, whereas using the "United States International" layout requires retraining yourself in typing certain character combinations. Also, the Unicode Easy keyboard layout includes all of the Latin composite characters and all of the Greek characters in Unicode, whereas the built-in layout includes only characters in the ISO-8859-1 code page. Supported diacritical marks are: acute, grave, circumflex, diaeresis, tilde, cedilla, breve, caron, ring above, stroke, macron, dot above, and dot below.
Under Unix or Plan9, a "\n", (Newline character), represents the physical sequence "\cJ" (the PERL double-quote escape for Ctrl-J), a linefeed. An Enter key generates an incoming "\cM" (a carriage return) which turns into "\cJ", whereas an outgoing "\cJ" turns into "\cM\cJ", (carriage return and a linefeed) .
Apple Mac, a "\n" is usually represented by "\cM" because the standard requires that "\n" and "\r" be different a "\r" represents a "\cJ". This is the opposite of the way that Unix, Plan9, VMS, CP/M, or nearly anyone else does it. So, Mac programmers writing files for other systems or talking over a network have to be careful. If you send out "\n", you'll deliver a "\cM", and no "\cJ" will be seen. Most network services prefer to receive and send "\cM\cJ" as a line terminator, but most accept merely a "\cJ".
With VMS, DOS, or their derivatives, a "\n" represents "\cJ", similar to Unix and Plan9. From the perspective of a tty, Unix and DOS act identically a user who hits Enter generates a "\cM", which arrives at the program as a "\n", which is "\cJ". A "\n" ("\cJ", remember) sent to a terminal shows up as a "\cM\cJ".
This occurs with Windows files as well. A DOS text file actually physically contains two characters at the end of every line, "\cM\cJ". The last block in the file has a "\cZ" to indicate where the EOF, End of File). When you write a line like "This is a line\n" on those systems, the file contains "This is a line\cM\cJ", just as if it were a terminal.
If you read a line on such systems. The file itself contains "This is a line\cM\cJ", this is a 16-byte string. When it is read by you program it gets "This is a line\n", where that "\n" is the virtual newline character, that is, a linefeed ( "\cJ" ). To remove it, a single chop or chomp, in PERL will do.
Be careful with the position markers in such files. The way the file is stored may have a different way to how it is displayed and how the data is interpreted by the program. This could lead to miscalculations of the position.
Use the tell function, in PERL (Returns the current position for FILEHANDLE, or -1 on error. FILEHANDLE may be an expression whose value gives the name of the actual filehandle. If FILEHANDLE is omitted, assumes the file last read.), to determine your location.
This legacy of the old CP/M filesystem, whose equivalent of a Unix inode stored only block counts and not file sizes, has frustrated programmers for decades, and no end is in sight. DOS is compatible with CP/M file formats, Windows with DOS, and NT with Windows.
On Windows, text and binary files are quite different. In a text file, the characters'\r\n'are translated to'\n'when read from the disk. The End Of File marker, (EOF), is the'^Z'character. If you open a file in text mode, and the data has a'^Z'somewhere in it, the file may abort prematurely. In binary files, these characters are not translated, they're treated like an ordinary character value.
Character Entity References in HTML and XHTML. A set of tables allowed entities in HTML and XHTML, as described in section 24 of the official HTML specifications, published by the W3C. Divided into logical, categories. From Cookwood Press.
OPERATOR
DIFFERENCES The .. range operator treats certain
character ranges with care on EBCDIC machines. For example the following array
will have twenty six elements on either an EBCDIC machine or an ASCII machine.
Find out about cJ, cI, (You may see these when loading Excel
characters codes via PERL), etc...
Unicode (UTF-8) provides a unique number for every character, no matter what the platform, no matter what the program, no matter what the language.
UTF-8 and Unicode Standards. What is UTF-8? UTF-8 stands for Unicode Transformation Format-8. It is an octet,( see Octal). (8-bit) lossless encoding of Unicode characters.
Hexadecimal Values
The hexadecimal system (hex for short) uses numbers from 0 to 15. It starts off like the decimal system: 0, 1, 2, 3, 4, 5, 6, 7, 8 and 9 but then comes A which equals 10 and then B, C, D, E and F (which of course equals 15). The next number is 10 which is actually 16 in decimal and so on....
Because it can be impossible to distinguish between a hex and a decimal number (is that 25 a decimal 25 or is it 25 in hex which equals 37 decimal?) it is custom to put a lower case h after each hex number.
Octal Values
Similar to the hexadecimal system but uses a base of 8. It therefore includes only the digits 0 through 7 (any other digit would make the number an invalid octal number).
Binary Values
Most computers use switches, (transistors very small,) which have two states (or "on" and "off" or "false" and "true") 0 or 1.
So will be something like 01000100. Not all the bits may be shown e.g. 101 is the same as 00000101 which is 5 decimal.
Microsoft Global Input Method Editors (IMEs). Now, you can input Chinese, Japanese, and Korean text into any Microsoft Office XP applications, Web forms, and e-mail messages on any language version of Windows XP, Windows 2000, Windows Millennium, Windows 98, and Windows NT 4.0 with Service Pack 6 (SP6) or later. And it's easy. After you have installed the Global IME for Office XP, you just start your Office XP program, select the language from the Language bar, and you can type the language you choose, regardless of the language version of Office XP or Windows you are using. Translation
ECMA Standards. ECMA International is an industry association founded in 1961, dedicated to the standardization of information and communication systems.
Web Masters. Click Here Now to start making money. A Great opportunity to make some money. Receive 50% by offering your users Ton's of Keywords on A Great Portal websites. Our Affiliate Program Pays you 50% on Level 1 of Every Sale of our Text Link both searchable and static Text Link!
A Computer Portal. Freeware, Shareware. Download software. Computer languages and Programming code. Including PERL Scripts and Java Scripts. Webmaster Tools. Internet Marketing, Website promotion. Hardware Help from BIOS to Windows and UNIX.
® © ™ are owned by respective authors and websites. There may be a charge for some software. Google™ is a trademark of Google Inc, These pages are not endorsed by Google or any other Company