Single-byte character encodings
Multi-byte character encodings
Querying and setting a code page
Which code page TSE uses to display text
Tested system locales and their global code page
Example code pages and their maximum character size in bytes
Single-byte character encodings
Multi-byte character encodings
Querying and setting a code page
Which code page TSE uses to display text
Tested system locales and their global code page
Example code pages and their maximum character size in bytes
Disclaimer:
This document is limited in its viewpoints
by focussing on computer files, Windows,
languages of Western European origin,
and relevance to TSE.
It might have no relevance to peripherals, like printers,
and especially has no relevance to keyboards.
People use characters to read and write. Computers use numbers. Therefore people assigned numbers to characters so that computers could use "characters".
A character set is a defined group of characters.
A character encoding is a character set for which each character has a defined number.
Historically countries and companies started out creating their
own character encodings by the hundreds.
We still live with the remnants of that legacy,
and that is a problem.
The following examples are huge simplifications. The rest of this document goes into the details that make the example behaviors happen.
Example:
Suppose an American writes a "√" (square root) character
in a file using the Console variant of TSE.
If a non-American opens that file using the Console variant of TSE,
then they see the "¹" (superscript one) character instead.
If anyone opens that file using the GUI or Linux variant of TSE,
then they see the "û" (small letter u with circumflex) character
instead.
Fortunately, over time some character encodings emerged as more commonly used, and nowadays the Unicode character encodings are a world standard.
Unfortunately, TSE does not natively support Unicode.
Example:
If you copy a file or content from the internet,
or open a file that was created with a modern editor,
then the odds are that the file was created using one of the Unicode
encodings.
This makes TSE show 2 to 4 inappropriate characters for some of the
file's original characters.
In conclusion:
If TSE shows some weird or unexpected characters in a text,
then read the rest of this document.
Because computer memory was extremely expensive, the initial trend was to use an as small as possible-sized number per character. Such a number is called a byte . Not coincidentally a byte is also the smallest addressable chunk of computer memory.
A byte size of 8 bits emerged as the most-used standard. Formally such a byte is called an "octet" after the Latin word for 8, but in English everybody calls it a "byte".
Bits can one-on-one be represented by binary numbers . An 8-bit binary number can express the decimal numbers 0 through 255. Byte-sized character numbers therefore enforce a limit of 256 characters.
For languages that started as Western European, this limit could contain all characters necessary for ordinary reading and writing.
The first 128 characters of each such a character encoding were standardized under the name "ASCII", and are the same for all such languages.
Initially each country and company created its own single-byte character encoding, in which the last 128 characters differed per country and language. This let to a huge amount of character encodings.
Nowadays, in Windows, in practice, for single-byte character encodings, that huge amount has been reduced to 3 mainly used ones.
Microsoft refers to each of its own character encodings as a code page, and identifies each code page by a code page number.
(There are other companies with their own "code pages" and their own terminology.)
Microsoft's mainly used single-byte code pages are:
Editing a file that uses a single-byte character encoding will always show a character for each byte-value, but it might be a wrong character if the editor assumes a wrong character encoding.
A file's single-byte character encoding is not stored as a property of that file. An editor opening a file with single-byte character-encoding has no reliable way to determine which code page the file's creator used.
Sometimes the user can make a best guess based on context. Then other editors let the user on-the-fly change the code page they use for showing the file.
With the Console variant of TSE, if the user's guessed code page implements a single-byte character encoding, then the user can close the editor, type "chcp <guessed code page>" on the command prompt, and (re)start Console TSE to edit the file.
TSE natively only supports single-byte character encodings.
Using TSE to edit a file that uses a multi-byte character encoding will show multiple wrong characters per each multi-byte character.
TSE and single-byte code pages:
Be aware of this both helpful and misleading Windows command prompt feature:
Nowadays single-byte character encodings still exist for historical reasons, but modern technologies and applications default use one of Unicode's multi-byte character encodings.
For example:
"UTF-…" are names for Unicode character encodings.
Unicode has 1 character set and 5 main character encodings
for that character set.
The character set is updated once a year in September.
It takes a long time for operating systems, browsers, editors, et cetera
to implement a new Unicode version.
A code point is Unicode's administrative number for a character.
Because a Unicode code point is not a character encoding, Unicode code points are ubiquitously used for uniquely identifying a character across character encodings, environments, and other contexts.
Unicode "UTF-…" character encodings can also be referred to by a Microsoft code page number:
| Unicode name | Code page number | Bytes per character (*) |
|---|---|---|
| UTF-8 | 65001 | 1 to 4 |
| UTF-16LE | 1200 | 2 or 4 |
| UTF-16BE | 1201 | 2 or 4 |
| UTF-32LE | 12000 | 4 |
| UTF-32BE | 12001 | 4 |
But note: Windows prompts and APIs do not recognize code pages 1200, 1201, 12000, and 12001. These code pages are only used by .net and in applications.
GUI TSE and Linux TSE can use extensions to handle "Unicode files".
Unicode files are files in which the characters are encoded using one of the Unicode encodings.
Unicode files can optionally contain an invisible leading character, named a "byte order mark", a.k.a. "BOM", that indicates which Unicode encoding is used.
Fortunately, even if no byte order mark is used, it is usually easy to automatically detect with a very high reliability whether a file is Unicode-encoded, and which Unicode encoding is used.
In Windows you can set a code page at 2 levels:
A Windows command prompt default uses the global code page, but can be set to a local code page.
You can query a command prompt's local code page
with the command "chcp" without parameters:
chcp
You change a command prompt's local code page
with the command "chcp <code page number>":
chcp 1252
Note that there is a huge amount of prompt commands and programs that ignore the local code page and always use the global one.
When you start a .bat or .cmd file or a command from TSE, then you have the option to begin the command (file) with a "chcp <code page number>" command to set the desired character encoding for its output.
Again, be aware that lots of commands and programs ignore the local code page and use the global code page instead.
Be aware of this both helpful and misleading Windows command prompt feature:
You cannot change Windows' global code page directly.
You have to change the Windows "system locale" setting, which has a "Language (Country)" format, and then Windows implicitly sets its global code page based on that.
In Windows 10 you can change the system locale with: Settings → Time & Language → Language → Administrative language settings → Change system locale.
In Windows 11 you can change the system locale with: Settings → Time & Language → Beside "Windows display language" select the right-most ˅ → Select a "Language for non-Unicode programs".
Warning:
If you check-mark the
"Beta: Use Unicode UTF-8 for worldwide language support" setting,
then this overrules the system locale's code page,
and sets the global code page to 65001 (Unicode's UTF-8 encoding).
This is an interesting option, but be aware of its impact
on all command prompts and spawned command files.
Because many system locales share a same global code page,
often nothing happens when you change the system locale,
but when the system locale changes the code page too,
then Windows 10 will ask you to restart Windows to activate the change.
Windows 11 will not ask, and sometimes does not need to be restarted,
and sometimes does. I always do so to be safe.
Console TSE
GUI TSE
Linux TSE
Notes:
| System Locale | Global Code Page |
|---|---|
| Danish (Denmark) | 850 |
| Dutch (Belgium) | 850 |
| Dutch (Netherlands) | 850 |
| English (Australia) | 850 |
| English (Canada) | 850 |
| English (New Zeeland) | 850 |
| English (United Kingdom) | 850 |
| English (United States) | 437 |
| French (Belgium) | 850 |
| French (Canada) | 850 |
| French (France) | 850 |
| German (Austria) | 850 |
| German (Germany) | 850 |
| German (Liechtenstein) | 850 |
| German (Luxembourg) | 850 |
| German (Switzerland) | 850 |
| Norwegian (Bokmål) | 850 |
| Norwegian (Nynorsk) | 850 |
| Portuguese (Brazil) | 850 |
| Portuguese (Portugal) | 850 |
| Spanish (Chile) | 850 |
| Spanish (Latin America) | 850 |
| Spanish (Mexican) | 850 |
| Spanish (Spain) | 850 |
| Spanish (United States) | 850 |
This is an example table.
It was specific to my specific Windows computer at a specific date.
To list the available code pages on your Windows computer use the
CodePageList
tool.
This table is sorted on Max Char Size, and within Max Char Size on Code Page.
| Code Page | Max Char Size | Description |
|---|---|---|
| 37 | 1 | (IBM EBCDIC - U.S./Canada) |
| 437 | 1 | (OEM - United States) |
| 500 | 1 | (IBM EBCDIC - International) |
| 708 | 1 | (Arabic - ASMO) |
| 720 | 1 | (Arabic - Transparent ASMO) |
| 737 | 1 | (OEM - Greek 437G) |
| 775 | 1 | (OEM - Baltic) |
| 850 | 1 | (OEM - Multilingual Latin I) |
| 852 | 1 | (OEM - Latin II) |
| 855 | 1 | (OEM - Cyrillic) |
| 857 | 1 | (OEM - Turkish) |
| 858 | 1 | (OEM - Multilingual Latin I + Euro) |
| 860 | 1 | (OEM - Portuguese) |
| 861 | 1 | (OEM - Icelandic) |
| 862 | 1 | (OEM - Hebrew) |
| 863 | 1 | (OEM - Canadian French) |
| 864 | 1 | (OEM - Arabic) |
| 865 | 1 | (OEM - Nordic) |
| 866 | 1 | (OEM - Russian) |
| 869 | 1 | (OEM - Modern Greek) |
| 870 | 1 | (IBM EBCDIC - Multilingual/ROECE (Latin-2)) |
| 874 | 1 | (ANSI/OEM - Thai) |
| 875 | 1 | (IBM EBCDIC - Modern Greek) |
| 1026 | 1 | (IBM EBCDIC - Turkish (Latin-5)) |
| 1047 | 1 | (IBM EBCDIC - Latin-1/Open System) |
| 1140 | 1 | (IBM EBCDIC - U.S./Canada (37 + Euro)) |
| 1141 | 1 | (IBM EBCDIC - Germany (20273 + Euro)) |
| 1142 | 1 | (IBM EBCDIC - Denmark/Norway (20277 + Euro)) |
| 1143 | 1 | (IBM EBCDIC - Finland/Sweden (20278 + Euro)) |
| 1144 | 1 | (IBM EBCDIC - Italy (20280 + Euro)) |
| 1145 | 1 | (IBM EBCDIC - Latin America/Spain (20284 + Euro)) |
| 1146 | 1 | (IBM EBCDIC - United Kingdom (20285 + Euro)) |
| 1148 | 1 | (IBM EBCDIC - International (500 + Euro)) |
| 1149 | 1 | (IBM EBCDIC - Icelandic (20871 + Euro)) |
| 1250 | 1 | (ANSI - Central Europe) |
| 1251 | 1 | (ANSI - Cyrillic) |
| 1252 | 1 | (ANSI - Latin I) |
| 1253 | 1 | (ANSI - Greek) |
| 1254 | 1 | (ANSI - Turkish) |
| 1255 | 1 | (ANSI - Hebrew) |
| 1256 | 1 | (ANSI - Arabic) |
| 1257 | 1 | (ANSI - Baltic) |
| 1258 | 1 | (ANSI/OEM - Viet Nam) |
| 10000 | 1 | (MAC - Roman) |
| 10004 | 1 | (MAC - Arabic) |
| 10005 | 1 | (MAC - Hebrew) |
| 10006 | 1 | (MAC - Greek I) |
| 10007 | 1 | (MAC - Cyrillic) |
| 10010 | 1 | (MAC - Romania) |
| 10017 | 1 | (MAC - Ukraine) |
| 10021 | 1 | (MAC - Thai) |
| 10029 | 1 | (MAC - Latin II) |
| 10079 | 1 | (MAC - Icelandic) |
| 10081 | 1 | (MAC - Turkish) |
| 10082 | 1 | (MAC - Croatia) |
| 20105 | 1 | (IA5 IRV International Alphabet No.5) |
| 20106 | 1 | (IA5 German) |
| 20107 | 1 | (IA5 Swedish) |
| 20108 | 1 | (IA5 Norwegian) |
| 20127 | 1 | (US-ASCII) |
| 20269 | 1 | (ISO 6937 Non-Spacing Accent) |
| 20273 | 1 | (IBM EBCDIC - Germany) |
| 20277 | 1 | (IBM EBCDIC - Denmark/Norway) |
| 20278 | 1 | (IBM EBCDIC - Finland/Sweden) |
| 20280 | 1 | (IBM EBCDIC - Italy) |
| 20284 | 1 | (IBM EBCDIC - Latin America/Spain) |
| 20285 | 1 | (IBM EBCDIC - United Kingdom) |
| 20290 | 1 | (IBM EBCDIC - Japanese Katakana Extended) |
| 20297 | 1 | (IBM EBCDIC - France) |
| 20420 | 1 | (IBM EBCDIC - Arabic) |
| 20423 | 1 | (IBM EBCDIC - Greek) |
| 20424 | 1 | (IBM EBCDIC - Hebrew) |
| 20833 | 1 | (IBM EBCDIC - Korean Extended) |
| 20838 | 1 | (IBM EBCDIC - Thai) |
| 20866 | 1 | (Russian - KOI8) |
| 20871 | 1 | (IBM EBCDIC - Icelandic) |
| 20880 | 1 | (IBM EBCDIC - Cyrillic (Russian)) |
| 20905 | 1 | (IBM EBCDIC - Turkish) |
| 20924 | 1 | (IBM EBCDIC - Latin-1/Open System (1047 + Euro)) |
| 21025 | 1 | (IBM EBCDIC - Cyrillic (Serbian, Bulgarian)) |
| 21027 | 1 | (Ext Alpha Lowercase) |
| 21866 | 1 | (Ukrainian - KOI8-U) |
| 28591 | 1 | (ISO 8859-1 Latin I) |
| 28592 | 1 | (ISO 8859-2 Central Europe) |
| 28593 | 1 | (ISO 8859-3 Latin 3) |
| 28594 | 1 | (ISO 8859-4 Baltic) |
| 28595 | 1 | (ISO 8859-5 Cyrillic) |
| 28596 | 1 | (ISO 8859-6 Arabic) |
| 28597 | 1 | (ISO 8859-7 Greek) |
| 28598 | 1 | (ISO 8859-8 Hebrew: Visual Ordering) |
| 28599 | 1 | (ISO 8859-9 Latin 5) |
| 28603 | 1 | (ISO 8859-13 Latin 7) |
| 28605 | 1 | (ISO 8859-15 Latin 9) |
| 38598 | 1 | (ISO 8859-8 Hebrew: Logical Ordering) |
| 932 | 2 | (ANSI/OEM - Japanese Shift-JIS) |
| 936 | 2 | (ANSI/OEM - Simplified Chinese GBK) |
| 949 | 2 | (ANSI/OEM - Korean) |
| 950 | 2 | (ANSI/OEM - Traditional Chinese Big5) |
| 1361 | 2 | (Korean - Johab) |
| 10001 | 2 | (MAC - Japanese) |
| 10002 | 2 | (MAC - Traditional Chinese Big5) |
| 10003 | 2 | (MAC - Korean) |
| 10008 | 2 | (MAC - Simplified Chinese GB 2312) |
| 20000 | 2 | (CNS - Taiwan) |
| 20001 | 2 | (TCA - Taiwan) |
| 20002 | 2 | (Eten - Taiwan) |
| 20003 | 2 | (IBM5550 - Taiwan) |
| 20004 | 2 | (TeleText - Taiwan) |
| 20005 | 2 | (Wang - Taiwan) |
| 20261 | 2 | (T.61) |
| 20932 | 2 | (JIS X 0208-1990 & 0212-1990) |
| 20936 | 2 | (Simplified Chinese GB2312) |
| 51949 | 2 | (EUC-Korean) |
| 55000 | 2 | (SMS GSM 7bit) |
| 55001 | 2 | (SMS GSM 7bit Spanish) |
| 55002 | 2 | (SMS GSM 7bit Portuguese) |
| 55003 | 2 | (SMS GSM 7bit Turkish) |
| 55004 | 2 | (SMS GSM 7bit Greek) |
| 54936 | 4 | (GB18030 Simplified Chinese) |
| 57002 | 4 | (ISCII - Devanagari) |
| 57003 | 4 | (ISCII - Bengali) |
| 57004 | 4 | (ISCII - Tamil) |
| 57005 | 4 | (ISCII - Telugu) |
| 57006 | 4 | (ISCII - Assamese) |
| 57007 | 4 | (ISCII - Odia (Oriya)) |
| 57008 | 4 | (ISCII - Kannada) |
| 57009 | 4 | (ISCII - Malayalam) |
| 57010 | 4 | (ISCII - Gujarati) |
| 57011 | 4 | (ISCII - Punjabi (Gurmukhi)) |
| 65001 | 4 | (UTF-8) |
| 50220 | 5 | (ISO-2022 Japanese with no halfwidth Katakana) |
| 50221 | 5 | (ISO-2022 Japanese with halfwidth Katakana) |
| 50222 | 5 | (ISO-2022 Japanese JIS X 0201-1989) |
| 50225 | 5 | (ISO-2022 Korean) |
| 50227 | 5 | (ISO-2022 Simplified Chinese) |
| 50229 | 5 | (ISO-2022 Traditional Chinese) |
| 52936 | 5 | (HZ-GB2312 Simplified Chinese) |
| 65000 | 5 | (UTF-7) |
Click a code page's header to view its additional info on Wikipedia.
| Byte Number | Code Page 437 | Code Page 850 | Code Page 1252 | ||||
|---|---|---|---|---|---|---|---|
| Decimal | Hexadecimal | Character | Code Point | Character | Code Point | Character | Code Point |
| 0 | 0 | NUL | 0000 | NUL | 0000 | NUL | 0000 |
| 1 | 1 | ☺︎ | 263A | ☺︎ | 263A | SOH | 0001 |
| 2 | 2 | ☻ | 263B | ☻ | 263B | STX | 0002 |
| 3 | 3 | ♥︎ | 2665 | ♥︎ | 2665 | ETX | 0003 |
| 4 | 4 | ♦︎ | 2666 | ♦︎ | 2666 | EOT | 0004 |
| 5 | 5 | ♣︎ | 2663 | ♣︎ | 2663 | ENQ | 0005 |
| 6 | 6 | ♠︎ | 2660 | ♠︎ | 2660 | ACK | 0006 |
| 7 | 7 | • | 2022 | • | 2022 | BEL | 0007 |
| 8 | 8 | ◘ | 25D8 | ◘ | 25D8 | BS | 0008 |
| 9 | 9 | ○ | 25CB | ○ | 25CB | HT | 0009 |
| 10 | A | ◙ | 25D9 | ◙ | 25D9 | LF | 000A |
| 11 | B | ♂︎ | 2642 | ♂︎ | 2642 | VT | 000B |
| 12 | C | ♀︎ | 2640 | ♀︎ | 2640 | FF | 000C |
| 13 | D | ♪ | 266A | ♪ | 266A | CR | 000D |
| 14 | E | ♫ | 266B | ♫ | 266B | SO | 000E |
| 15 | F | ☼ | 263C | ☼ | 263C | SI | 000F |
| 16 | 10 | ► | 25BA | ► | 25BA | DLE | 0010 |
| 17 | 11 | ◄ | 25C4 | ◄ | 25C4 | DC1 | 0011 |
| 18 | 12 | ↕︎ | 2195 | ↕︎ | 2195 | DC2 | 0012 |
| 19 | 13 | ‼︎ | 203C | ‼︎ | 203C | DC3 | 0013 |
| 20 | 14 | ¶ | 00B6 | ¶ | 00B6 | DC4 | 0014 |
| 21 | 15 | § | 00A7 | § | 00A7 | NAK | 0015 |
| 22 | 16 | ▬ | 25AC | ▬ | 25AC | SYN | 0016 |
| 23 | 17 | ↨ | 21A8 | ↨ | 21A8 | ETB | 0017 |
| 24 | 18 | ↑ | 2191 | ↑ | 2191 | CAN | 0018 |
| 25 | 19 | ↓ | 2193 | ↓ | 2193 | EM | 0019 |
| 26 | 1A | → | 2192 | → | 2192 | SUB | 001A |
| 27 | 1B | ← | 2190 | ← | 2190 | ESC | 001B |
| 28 | 1C | ∟ | 221F | ∟ | 221F | FS | 001C |
| 29 | 1D | ↔︎ | 2194 | ↔︎ | 2194 | GS | 001D |
| 30 | 1E | ▲ | 25B2 | ▲ | 25B2 | RS | 001E |
| 31 | 1F | ▼ | 25BC | ▼ | 25BC | US | 001F |
| 32 | 20 | SP | 0020 | SP | 0020 | SP | 0020 |
| 33 | 21 | ! | 0021 | ! | 0021 | ! | 0021 |
| 34 | 22 | " | 0022 | " | 0022 | " | 0022 |
| 35 | 23 | # | 0023 | # | 0023 | # | 0023 |
| 36 | 24 | $ | 0024 | $ | 0024 | $ | 0024 |
| 37 | 25 | % | 0025 | % | 0025 | % | 0025 |
| 38 | 26 | & | 0026 | & | 0026 | & | 0026 |
| 39 | 27 | ' | 0027 | ' | 0027 | ' | 0027 |
| 40 | 28 | ( | 0028 | ( | 0028 | ( | 0028 |
| 41 | 29 | ) | 0029 | ) | 0029 | ) | 0029 |
| 42 | 2A | * | 002A | * | 002A | * | 002A |
| 43 | 2B | + | 002B | + | 002B | + | 002B |
| 44 | 2C | , | 002C | , | 002C | , | 002C |
| 45 | 2D | - | 002D | - | 002D | - | 002D |
| 46 | 2E | . | 002E | . | 002E | . | 002E |
| 47 | 2F | / | 002F | / | 002F | / | 002F |
| 48 | 30 | 0 | 0030 | 0 | 0030 | 0 | 0030 |
| 49 | 31 | 1 | 0031 | 1 | 0031 | 1 | 0031 |
| 50 | 32 | 2 | 0032 | 2 | 0032 | 2 | 0032 |
| 51 | 33 | 3 | 0033 | 3 | 0033 | 3 | 0033 |
| 52 | 34 | 4 | 0034 | 4 | 0034 | 4 | 0034 |
| 53 | 35 | 5 | 0035 | 5 | 0035 | 5 | 0035 |
| 54 | 36 | 6 | 0036 | 6 | 0036 | 6 | 0036 |
| 55 | 37 | 7 | 0037 | 7 | 0037 | 7 | 0037 |
| 56 | 38 | 8 | 0038 | 8 | 0038 | 8 | 0038 |
| 57 | 39 | 9 | 0039 | 9 | 0039 | 9 | 0039 |
| 58 | 3A | : | 003A | : | 003A | : | 003A |
| 59 | 3B | ; | 003B | ; | 003B | ; | 003B |
| 60 | 3C | < | 003C | < | 003C | < | 003C |
| 61 | 3D | = | 003D | = | 003D | = | 003D |
| 62 | 3E | > | 003E | > | 003E | > | 003E |
| 63 | 3F | ? | 003F | ? | 003F | ? | 003F |
| 64 | 40 | @ | 0040 | @ | 0040 | @ | 0040 |
| 65 | 41 | A | 0041 | A | 0041 | A | 0041 |
| 66 | 42 | B | 0042 | B | 0042 | B | 0042 |
| 67 | 43 | C | 0043 | C | 0043 | C | 0043 |
| 68 | 44 | D | 0044 | D | 0044 | D | 0044 |
| 69 | 45 | E | 0045 | E | 0045 | E | 0045 |
| 70 | 46 | F | 0046 | F | 0046 | F | 0046 |
| 71 | 47 | G | 0047 | G | 0047 | G | 0047 |
| 72 | 48 | H | 0048 | H | 0048 | H | 0048 |
| 73 | 49 | I | 0049 | I | 0049 | I | 0049 |
| 74 | 4A | J | 004A | J | 004A | J | 004A |
| 75 | 4B | K | 004B | K | 004B | K | 004B |
| 76 | 4C | L | 004C | L | 004C | L | 004C |
| 77 | 4D | M | 004D | M | 004D | M | 004D |
| 78 | 4E | N | 004E | N | 004E | N | 004E |
| 79 | 4F | O | 004F | O | 004F | O | 004F |
| 80 | 50 | P | 0050 | P | 0050 | P | 0050 |
| 81 | 51 | Q | 0051 | Q | 0051 | Q | 0051 |
| 82 | 52 | R | 0052 | R | 0052 | R | 0052 |
| 83 | 53 | S | 0053 | S | 0053 | S | 0053 |
| 84 | 54 | T | 0054 | T | 0054 | T | 0054 |
| 85 | 55 | U | 0055 | U | 0055 | U | 0055 |
| 86 | 56 | V | 0056 | V | 0056 | V | 0056 |
| 87 | 57 | W | 0057 | W | 0057 | W | 0057 |
| 88 | 58 | X | 0058 | X | 0058 | X | 0058 |
| 89 | 59 | Y | 0059 | Y | 0059 | Y | 0059 |
| 90 | 5A | Z | 005A | Z | 005A | Z | 005A |
| 91 | 5B | [ | 005B | [ | 005B | [ | 005B |
| 92 | 5C | \ | 005C | \ | 005C | \ | 005C |
| 93 | 5D | ] | 005D | ] | 005D | ] | 005D |
| 94 | 5E | ^ | 005E | ^ | 005E | ^ | 005E |
| 95 | 5F | _ | 005F | _ | 005F | _ | 005F |
| 96 | 60 | ` | 0060 | ` | 0060 | ` | 0060 |
| 97 | 61 | a | 0061 | a | 0061 | a | 0061 |
| 98 | 62 | b | 0062 | b | 0062 | b | 0062 |
| 99 | 63 | c | 0063 | c | 0063 | c | 0063 |
| 100 | 64 | d | 0064 | d | 0064 | d | 0064 |
| 101 | 65 | e | 0065 | e | 0065 | e | 0065 |
| 102 | 66 | f | 0066 | f | 0066 | f | 0066 |
| 103 | 67 | g | 0067 | g | 0067 | g | 0067 |
| 104 | 68 | h | 0068 | h | 0068 | h | 0068 |
| 105 | 69 | i | 0069 | i | 0069 | i | 0069 |
| 106 | 6A | j | 006A | j | 006A | j | 006A |
| 107 | 6B | k | 006B | k | 006B | k | 006B |
| 108 | 6C | l | 006C | l | 006C | l | 006C |
| 109 | 6D | m | 006D | m | 006D | m | 006D |
| 110 | 6E | n | 006E | n | 006E | n | 006E |
| 111 | 6F | o | 006F | o | 006F | o | 006F |
| 112 | 70 | p | 0070 | p | 0070 | p | 0070 |
| 113 | 71 | q | 0071 | q | 0071 | q | 0071 |
| 114 | 72 | r | 0072 | r | 0072 | r | 0072 |
| 115 | 73 | s | 0073 | s | 0073 | s | 0073 |
| 116 | 74 | t | 0074 | t | 0074 | t | 0074 |
| 117 | 75 | u | 0075 | u | 0075 | u | 0075 |
| 118 | 76 | v | 0076 | v | 0076 | v | 0076 |
| 119 | 77 | w | 0077 | w | 0077 | w | 0077 |
| 120 | 78 | x | 0078 | x | 0078 | x | 0078 |
| 121 | 79 | y | 0079 | y | 0079 | y | 0079 |
| 122 | 7A | z | 007A | z | 007A | z | 007A |
| 123 | 7B | { | 007B | { | 007B | { | 007B |
| 124 | 7C | | | 007C | | | 007C | | | 007C |
| 125 | 7D | } | 007D | } | 007D | } | 007D |
| 126 | 7E | ~ | 007E | ~ | 007E | ~ | 007E |
| 127 | 7F | ⌂ | 2302 | ⌂ | 2302 | DEL | 007F |
| 128 | 80 | Ç | 00C7 | Ç | 00C7 | € | 20AC |
| 129 | 81 | ü | 00FC | ü | 00FC | UNUSED | |
| 130 | 82 | é | 00E9 | é | 00E9 | ‚ | 201A |
| 131 | 83 | â | 00E2 | â | 00E2 | ƒ | 0192 |
| 132 | 84 | ä | 00E4 | ä | 00E4 | „ | 201E |
| 133 | 85 | à | 00E0 | à | 00E0 | … | 2026 |
| 134 | 86 | å | 00E5 | å | 00E5 | † | 2020 |
| 135 | 87 | ç | 00E7 | ç | 00E7 | ‡ | 2021 |
| 136 | 88 | ê | 00EA | ê | 00EA | ˆ | 02C6 |
| 137 | 89 | ë | 00EB | ë | 00EB | ‰ | 2030 |
| 138 | 8A | è | 00E8 | è | 00E8 | Š | 0160 |
| 139 | 8B | ï | 00EF | ï | 00EF | ‹ | 2039 |
| 140 | 8C | î | 00EE | î | 00EE | Œ | 0152 |
| 141 | 8D | ì | 00EC | ì | 00EC | UNUSED | |
| 142 | 8E | Ä | 00C4 | Ä | 00C4 | Ž | 017D |
| 143 | 8F | Å | 00C5 | Å | 00C5 | UNUSED | |
| 144 | 90 | É | 00C9 | É | 00C9 | UNUSED | |
| 145 | 91 | æ | 00E6 | æ | 00E6 | ‘ | 2018 |
| 146 | 92 | Æ | 00C6 | Æ | 00C6 | ’ | 2019 |
| 147 | 93 | ô | 00F4 | ô | 00F4 | “ | 201C |
| 148 | 94 | ö | 00F6 | ö | 00F6 | ” | 201D |
| 149 | 95 | ò | 00F2 | ò | 00F2 | • | 2022 |
| 150 | 96 | û | 00FB | û | 00FB | – | 2013 |
| 151 | 97 | ù | 00F9 | ù | 00F9 | — | 2014 |
| 152 | 98 | ÿ | 00FF | ÿ | 00FF | ˜ | 02DC |
| 153 | 99 | Ö | 00D6 | Ö | 00D6 | ™ | 2122 |
| 154 | 9A | Ü | 00DC | Ü | 00DC | š | 0161 |
| 155 | 9B | ¢ | 00A2 | ø | 00F8 | › | 203A |
| 156 | 9C | £ | 00A3 | £ | 00A3 | œ | 0153 |
| 157 | 9D | ¥ | 00A5 | Ø | 00D8 | UNUSED | |
| 158 | 9E | ₧ | 20A7 | × | 00D7 | ž | 017E |
| 159 | 9F | ƒ | 0192 | ƒ | 0192 | Ÿ | 0178 |
| 160 | A0 | á | 00E1 | á | 00E1 | NBSP | 00A0 |
| 161 | A1 | í | 00ED | í | 00ED | ¡ | 00A1 |
| 162 | A2 | ó | 00F3 | ó | 00F3 | ¢ | 00A2 |
| 163 | A3 | ú | 00FA | ú | 00FA | £ | 00A3 |
| 164 | A4 | ñ | 00F1 | ñ | 00F1 | ¤ | 00A4 |
| 165 | A5 | Ñ | 00D1 | Ñ | 00D1 | ¥ | 00A5 |
| 166 | A6 | ª | 00AA | ª | 00AA | ¦ | 00A6 |
| 167 | A7 | º | 00BA | º | 00BA | § | 00A7 |
| 168 | A8 | ¿ | 00BF | ¿ | 00BF | ¨ | 00A8 |
| 169 | A9 | ⌐ | 2310 | ® | 00AE | © | 00A9 |
| 170 | AA | ¬ | 00AC | ¬ | 00AC | ª | 00AA |
| 171 | AB | ½ | 00BD | ½ | 00BD | « | 00AB |
| 172 | AC | ¼ | 00BC | ¼ | 00BC | ¬ | 00AC |
| 173 | AD | ¡ | 00A1 | ¡ | 00A1 | SHY | 00AD |
| 174 | AE | « | 00AB | « | 00AB | ® | 00AE |
| 175 | AF | » | 00BB | » | 00BB | ¯ | 00AF |
| 176 | B0 | ░ | 2591 | ░ | 2591 | ° | 00B0 |
| 177 | B1 | ▒ | 2592 | ▒ | 2592 | ± | 00B1 |
| 178 | B2 | ▓ | 2593 | ▓ | 2593 | ² | 00B2 |
| 179 | B3 | │ | 2502 | │ | 2502 | ³ | 00B3 |
| 180 | B4 | ┤ | 2524 | ┤ | 2524 | ´ | 00B4 |
| 181 | B5 | ╡ | 2561 | Á | 00C1 | µ | 00B5 |
| 182 | B6 | ╢ | 2562 | Â | 00C2 | ¶ | 00B6 |
| 183 | B7 | ╖ | 2556 | À | 00C0 | · | 00B7 |
| 184 | B8 | ╕ | 2555 | © | 00A9 | ¸ | 00B8 |
| 185 | B9 | ╣ | 2563 | ╣ | 2563 | ¹ | 00B9 |
| 186 | BA | ║ | 2551 | ║ | 2551 | º | 00BA |
| 187 | BB | ╗ | 2557 | ╗ | 2557 | » | 00BB |
| 188 | BC | ╝ | 255D | ╝ | 255D | ¼ | 00BC |
| 189 | BD | ╜ | 255C | ¢ | 00A2 | ½ | 00BD |
| 190 | BE | ╛ | 255B | ¥ | 00A5 | ¾ | 00BE |
| 191 | BF | ┐ | 2510 | ┐ | 2510 | ¿ | 00BF |
| 192 | C0 | └ | 2514 | └ | 2514 | À | 00C0 |
| 193 | C1 | ┴ | 2534 | ┴ | 2534 | Á | 00C1 |
| 194 | C2 | ┬ | 252C | ┬ | 252C | Â | 00C2 |
| 195 | C3 | ├ | 251C | ├ | 251C | Ã | 00C3 |
| 196 | C4 | ─ | 2500 | ─ | 2500 | Ä | 00C4 |
| 197 | C5 | ┼ | 253C | ┼ | 253C | Å | 00C5 |
| 198 | C6 | ╞ | 255E | ã | 00E3 | Æ | 00C6 |
| 199 | C7 | ╟ | 255F | Ã | 00C3 | Ç | 00C7 |
| 200 | C8 | ╚ | 255A | ╚ | 255A | È | 00C8 |
| 201 | C9 | ╔ | 2554 | ╔ | 2554 | É | 00C9 |
| 202 | CA | ╩ | 2569 | ╩ | 2569 | Ê | 00CA |
| 203 | CB | ╦ | 2566 | ╦ | 2566 | Ë | 00CB |
| 204 | CC | ╠ | 2560 | ╠ | 2560 | Ì | 00CC |
| 205 | CD | ═ | 2550 | ═ | 2550 | Í | 00CD |
| 206 | CE | ╬ | 256C | ╬ | 256C | Î | 00CE |
| 207 | CF | ╧ | 2567 | ¤ | 00A4 | Ï | 00CF |
| 208 | D0 | ╨ | 2568 | ð | 00F0 | Ð | 00D0 |
| 209 | D1 | ╤ | 2564 | Ð | 00D0 | Ñ | 00D1 |
| 210 | D2 | ╥ | 2565 | Ê | 00CA | Ò | 00D2 |
| 211 | D3 | ╙ | 2559 | Ë | 00CB | Ó | 00D3 |
| 212 | D4 | ╘ | 2558 | È | 00C8 | Ô | 00D4 |
| 213 | D5 | ╒ | 2552 | ı | 0131 | Õ | 00D5 |
| 214 | D6 | ╓ | 2553 | Í | 00CD | Ö | 00D6 |
| 215 | D7 | ╫ | 256B | Î | 00CE | × | 00D7 |
| 216 | D8 | ╪ | 256A | Ï | 00CF | Ø | 00D8 |
| 217 | D9 | ┘ | 2518 | ┘ | 2518 | Ù | 00D9 |
| 218 | DA | ┌ | 250C | ┌ | 250C | Ú | 00DA |
| 219 | DB | █ | 2588 | █ | 2588 | Û | 00DB |
| 220 | DC | ▄ | 2584 | ▄ | 2584 | Ü | 00DC |
| 221 | DD | ▌ | 258C | ¦ | 00A6 | Ý | 00DD |
| 222 | DE | ▐ | 2590 | Ì | 00CC | Þ | 00DE |
| 223 | DF | ▀ | 2580 | ▀ | 2580 | ß | 00DF |
| 224 | E0 | α | 03B1 | Ó | 00D3 | à | 00E0 |
| 225 | E1 | ß | 00DF | ß | 00DF | á | 00E1 |
| 226 | E2 | Γ | 0393 | Ô | 00D4 | â | 00E2 |
| 227 | E3 | π | 03C0 | Ò | 00D2 | ã | 00E3 |
| 228 | E4 | Σ | 03A3 | õ | 00F5 | ä | 00E4 |
| 229 | E5 | σ | 03C3 | Õ | 00D5 | å | 00E5 |
| 230 | E6 | µ | 00B5 | µ | 00B5 | æ | 00E6 |
| 231 | E7 | τ | 03C4 | þ | 00FE | ç | 00E7 |
| 232 | E8 | Φ | 03A6 | Þ | 00DE | è | 00E8 |
| 233 | E9 | Θ | 0398 | Ú | 00DA | é | 00E9 |
| 234 | EA | Ω | 03A9 | Û | 00DB | ê | 00EA |
| 235 | EB | δ | 03B4 | Ù | 00D9 | ë | 00EB |
| 236 | EC | ∞ | 221E | ý | 00FD | ì | 00EC |
| 237 | ED | φ | 03C6 | Ý | 00DD | í | 00ED |
| 238 | EE | ε | 03B5 | ¯ | 00AF | î | 00EE |
| 239 | EF | ∩ | 2229 | ´ | 00B4 | ï | 00EF |
| 240 | F0 | ≡ | 2261 | SHY | 00AD | ð | 00F0 |
| 241 | F1 | ± | 00B1 | ± | 00B1 | ñ | 00F1 |
| 242 | F2 | ≥ | 2265 | ‗ | 2017 | ò | 00F2 |
| 243 | F3 | ≤ | 2264 | ¾ | 00BE | ó | 00F3 |
| 244 | F4 | ⌠ | 2320 | ¶ | 00B6 | ô | 00F4 |
| 245 | F5 | ⌡ | 2321 | § | 00A7 | õ | 00F5 |
| 246 | F6 | ÷ | 00F7 | ÷ | 00F7 | ö | 00F6 |
| 247 | F7 | ≈ | 2248 | ¸ | 00B8 | ÷ | 00F7 |
| 248 | F8 | ° | 00B0 | ° | 00B0 | ø | 00F8 |
| 249 | F9 | ∙ | 2219 | ¨ | 00A8 | ù | 00F9 |
| 250 | FA | · | 00B7 | · | 00B7 | ú | 00FA |
| 251 | FB | √ | 221A | ¹ | 00B9 | û | 00FB |
| 252 | FC | ⁿ | 207F | ³ | 00B3 | ü | 00FC |
| 253 | FD | ² | 00B2 | ² | 00B2 | ý | 00FD |
| 254 | FE | ■ | 25A0 | ■ | 25A0 | þ | 00FE |
| 255 | FF | NBSP | 00A0 | NBSP | 00A0 | ÿ | 00FF |
SP is the space character.
NBSP is the "no break space" character.
SHY is the "soft hyphen" character.
"₧" is one character, signifying the "pesetas" currency.