Home. 
.

transparent

transparent

transparent

Altova Mailing List Archives


Re: [xml-dev] xml over http - RFC 3023

From: Rick Jelliffe <rjelliffe@-------.---.-->
To: xml-dev <xml-dev@-----.---.--->
Date: 12/1/2008 7:47:00 AM
Andrew Welch wrote:
> Hi all,
>
> There's a very good article here about the problem of reading feeds
> from all over the world in different encodings:
>
> http://www.xml.com/pub/a/2004/07/21/dive.html
>   
That article, when it came out, was a little irritating, because it 
claimed to have discovered that, behind the mechanisms that we had put 
into place in XML to workaround the crapulous problems in the 
internet/MIME/HTTP specs, there was a problem. 

It is like seeing a plaster cast on a broken arm and saying "I have 
discovered that arm is broken!". Or, in the way the heading was worded, 
saying "I have discovered that plaster casts do not prevent broken arms: 
look underneath it, the arm is broken!"

The out-of-band signalling of character encoding is a fundamentally 
broken idea, because there are no mechanisms for programs which generate 
data to memoize the character encoding used that can then feed the rest 
of the food-chain. It was workable before the WWW and outside of East 
Asia, but as soon as UTF-8 came along it was impractical even for the 
West: this was obvious by the mid-90s. So what does a standards group do 
when the official standards are broken and there is little hope of 
fixing them? It creatively ignores them. Ignoring dumb standards is a 
virtue.

So XML got the XML header in the full knowledge that many (most) web 
systems that used text/*  implemented the ASCII default by being 8-bit 
clean and non-transcoding, which leaves the XML file uncorrupted and the 
XML header in full play.

> At the moment it all seems pretty complicated... especially
> considering XML was designed for the web.  The problem of parsing
> feeds from all over the world must have tackled a few times over by
> now?
>   
It is not complicated. Use application/xml

If you do find intermediate web systems that implement the ASCII default 
or the IS8859-1 default as anything other than 8-bit clean for text/xml 
submit a bug report.  

If you find systems that accept text/xml but not application/xml then 
find some way to discretely help the developers out of their 
embarrassing bozo-the-clown moment.

Cheers
Rick Jelliffe

_______________________________________________________________________

XML-DEV is a publicly archived, unmoderated list hosted by OASIS
to support XML implementation and development. To minimize
spam in the archives, you must subscribe before posting.

[Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
Or unsubscribe: xml-dev-unsubscribe@l...
subscribe: xml-dev-subscribe@l...
List archive: http://lists.xml.org/archives/xml-dev/
List Guidelines: http://www.oasis-open.org/maillists/guidelines.php



transparent
Print
Mail
Digg
delicious
Disclaimer
.

These Archives are provided for informational purposes only and have been generated directly from the Altova mailing list archive system and are comprised of the lists set forth on www.altova.com/list/index.html. Therefore, Altova does not warrant or guarantee the accuracy, reliability, completeness, usefulness, non-infringement of intellectual property rights, or quality of any content on the Altova Mailing List Archive(s), regardless of who originates that content. You expressly understand and agree that you bear all risks associated with using or relying on that content. Altova will not be liable or responsible in any way for any content posted including, but not limited to, any errors or omissions in content, or for any losses or damage of any kind incurred as a result of the use of or reliance on any content. This disclaimer and limitation on liability is in addition to the disclaimers and limitations contained in the Website Terms of Use and elsewhere on the site.

.
.

transparent

transparent