|
UTF - 8 stands for Unicode Transformation Format-8. It is an octet (8-bit) lossless encoding of Unicode characters.UTF - 8 encodes each Unicode character as a variable number of 1 to 4 octets, where the number of octets depends on the integer value assigned to the Unicode character. It is an efficient encoding of Unicode documents that use mostly US-ASCII characters. UTF - 8 is the Unicode encoding supported on UNIX platforms and used for HTML and most Internet browsers. Other platforms, such as Windows and Java, use UCS - 2 encoding. |
