Altova Mailing List Archives>Archive Index >comp.text.xml Archive Home >Recent entries >Thread Prev - Re: XML parse validation [Thread Next] Re: XML parse validationTo: NULL Date: 1/9/2008 5:36:00 AM On 9 Jan, 06:29, sarosh.shir...@gmail.com wrote: > To Joseph: It's part of the requirement that such characters would > come up. I doubt this very much. The _character_ / codepoint "&x00" is a different concept to the byte or octet "&x00". Although Unicode encodings may well involve such a byte value at the level of the raw wire protocol, they certainly don't allow it as a valid character (sic, codepoint). XML, at the level you describe it, is a character stream. In XML the entity � is a reference to this possible (albeit forbidden) 00 value as a _character_, not just a raw byte. It sounds as if your problem here is an encoding problem (i.e. a Unicode problem, not an XML problem), even before it gets as far as being an XML well-formedness issue. Raw bytes 0f 00 are just bytes (which might have some correct place in the encoding you're using) but they're not intended to encode a resultant _character_ of 00, or the same thing as a numeric entity of � | ||||||
| Company | Legal | Press | Partners | Careers | Sitemap | Contact Us | Altova Blog | Mobile | Full Site | |||
|
