Menu

HTML TUTORIALS - HTML Character Encodings

HTML Character Encodings

ADVERTISEMENTS

Character Set Description
ISO-8859-1 Latin alphabet part 1
Covering North America,Western Europe, Latin America, theCaribbean, Canada, Africa
ISO-8859-2 Latin alphabet part 2
Covering Eastern Europe
ISO-8859-3 Latin alphabet part 3
Covering SE Europe, Esperanto, miscellaneous others
ISO-8859-4 Latin alphabet part 4
Covering Scandinavia/Baltics (and others not in ISO-8859-1)
ISO-8859-5 Latin/Cyrillic alphabet part 5
ISO-8859-6 Latin/Arabic alphabet part 6
ISO-8859-7 Latin/Greek alphabet part 7
ISO-8859-8 Latin/Hebrew alphabet part 8
ISO-8859-9 Latin 5 alphabet part 9
Same as ISO-8859-1 except Turkish characters replace Icelandic ones
ISO-8859-10 Latin 6 Latin 6 Lappish, Nordic, and Eskimo
ISO-8859-15 The same as ISO-8859-1 but with more characters added
ISO-2022-JP Latin/Japanese alphabet part 1
ISO-2022-JP-2 Latin/Japanese alphabet part 2
ISO-2022-KR Latin/Korean alphabet part 1

ADVERTISEMENTS

Character Set Description
UTF-8 A Unicode Translation Format that comes in 8-bit units. That is, it comes in bytes. A character in UTF8 can be from 1 to 4 bytes long, making UTF8 variable width.
UTF-16 A Unicode Translation Format that comes in 16-bit units. That is, it comes in shorts. It can be 1 or 2 shorts long, making UTF16 variable width.
UTF-32 A Unicode Translation Format that comes in 32-bit units. That is, it comes in longs. It is a fixed-width format and is always 1 "long" in length.

ADVERTISEMENTS