This page is still under development. Please take with a grain of salt!
The pages linked to are each displayed in a different charset, showing how different bytes-values are displayed in the given charset.

ISO-8859 Character sets

iso-8859-1 - Latin-1 (Northern European)
Western Europe and Scandanavian: Afrikaans, Basque, Catalan, Danish, Dutch, English, Faeroese, Finnish, French, Galician, German, Icelandic, Irish, Italian, Norwegian, Portuguese, Spanish and Swedish.
Note: The Dutch IJ and ij (IJ & &307;), the German versions of double-quotes: „ & ” („ & ”), and the French œ & Œ (œ & Œ) are in the Supplementary character Set.

Additional Characters for Vietnamese

iso-8859-2 - Latin-2 (Eastern European)
Latin-written Slavic and Central European: Czech, German, Hungarian, Polish, Romanian, Croatian, Slovak, Slovene.
Note: Š & š (Š & š) Č & č (Č & č) and Ž & ž (Ž & ž) are in the Supplementary Character Set.

iso-8859-3 - Latin-3 (Southern European)
Esperanto, Galician, Maltese, and Turkish.

iso-8859-5 - Cyrillic
Bulgarian, Byelorussian, Macedonian, Russian, Serbian and Ukrainian.

Cyrillic Characters in Supplementary Set

iso-8859-6 - Non-accented Arabic

Arabic Characters in Supplementary Set

iso-8859-7 Greek

Greek Characters in Supplementary Set

iso-8859-8 - Non-accented Hebrew

iso-8859-9 - Latin-5 (Turkish)
As for iso-8859-1, but Turkish instead of Icelandic.

iso-8859-10 - Latin-6 (Nordic)
Lappish/Nordic/Eskimo languages: Adds the last Inuit (Greenlandic) and Sami (Lappish) letters that were missing in Latin 4 to cover the entire Nordic area.

iso-8859-11 - Thai

Boishakhi font - Bengali


Supplementary Character set 256 to 8993

Favourites from Supplementary Character set

non-SGML Characters (129 to 159: not to be used!)

See Global Development lists.

See Examples of Mathematical Formulae in HTML.


Windows Character Sets

windows-1250 - Central European

windows-1251 - Russian

windows-1252 - Western Europe

windows-1253 - Greek

windows-1254 - Turkish

windows-1255 - Hebrew

windows-1256 - Arabic

windows-1257 - Baltic

windows-874 - Thai

See Global Development lists.


Other Character Sets

unicode1200Universal Alphabet
unicodeFEFF1201Universal Alphabet (Big-Endian)
utf-765000Universal Alphabet (UTF-7)
utf-865001Universal Alphabet (UTF-8)
iso-2022-jp50220Japanese (JIS)
iso-2022-jp50222Japanese (JIS-Allow 1 byte Kana)
iso-2022-kr50225Korean (ISO)
DIN_6600320106IA5 (German)
NS_4551-120108IA5 (Norwegian)
SEN_850200_B20107IA5 (Swedish)
_autodetect50932Japanese (Auto Select)
_autodetect_kr50949Korean (Auto Select)
big5950Chinese Traditional (Big5)
csISO2022JP50221Japanese (JIS-Allow 1 byte Kana)
euc-kr51949Korean (EUC)
gb2312936Chinese Simplified (GB2312)
hz-gb-231252936Chinese Simplified (HZ)
ibm852852Central European (DOS)
ibm866866Cyrillic Alphabet (DOS)
irv20105IA5 (IRV)
koi8-r 20866Cyrillic Alphabet (KOI8-R)
ks_c_5601949Korean
shift-jis932Japanese (Shift-JIS)
windows-874874Thai (Windows)
x-euc51932Japanese (EUC)