Altova Mailing List Archives>Archive Index >xml-dev Archive Home >Recent entries >Thread Prev - Re: [xml-dev] Use of UTF-8 and UTF-16 [Thread Next] Re: [xml-dev] Use of UTF-8 and UTF-16To: xml-dev@-----.---.--- Date: 11/2/2005 4:35:00 PM In article <4368C783.3070008@s...> you write: >UTF-8 uses 6 bytes for ISO/IEC 10646 >UTF-8 uses 4 bytes for Unicode UTF-8 would need 6 bytes to represent code points up to 2^31-1, but the Unicode codespace only goes to 10ffff, so only 4 bytes are needed for Unicode characters. 10ffff is (presumably not coincidentally) the limit of what UTF-16 can represent using surrogates. -- Richard | ||||||
| Company | Legal | Press | Partners | Careers | Sitemap | Contact Us | Altova Blog | Mobile | Full Site | |||
|
