Home. 
.

transparent

transparent

transparent

Altova Mailing List Archives


Re: [xml-dev] XML not ideal for Big Data

From: David Carver <d_a_carver@-----.--->
To: Jim Tivy <jimt@----------.--->
Date: 9/3/2009 8:24:00 PM
> Three limitations to processing XML files are:
>
> 1. XML File Size as set by the OS.
> 2. RAM consumption.
> 3. CPU consumption.
>
> Most XML Parsers can be used on big files (100GB) without exceeding these
> limitations.  This is because XML Parsers are stream based - reading small
> chunks at a time.  If you want to process the XML file, however, you will
> need to use a streaming technology like SAX.  Other XML processing
> technologies like many DOM implementations will cause you to exceed RAM.
>
> Choosing a max like 1MB to 50MB will allow you to more freely use a wide
> variety of XML and other technologies (like EMail attachments) making your
> XML less constrained.  Again, it depends on your use cases for the XML.
>   

Jim in many ways you have hit the nail on the head. XML can be 
successfully used with large data dumps. The underlying problem is not 
necessarily with the data in XML format but the tools and frameworks 
that are used to process it. The knee jerk reaction to the common 
programmer when they have to deal with XML is to try and data bind 
against it. There are a wide variety of ways to process the XML, and 
normally the most common method isn't going to be the correct method 
with large data stores. XML Data Bases, Streaming, STAX, SAX, etc are 
much more efficient ways, then just trying to databind and store 
everything in memory (which is what typically is the first reaction). 
When the data binding fails, the programmer typically blames XML not 
their own choice in technology to process it.

Dave



_______________________________________________________________________

XML-DEV is a publicly archived, unmoderated list hosted by OASIS
to support XML implementation and development. To minimize
spam in the archives, you must subscribe before posting.

[Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
Or unsubscribe: xml-dev-unsubscribe@l...
subscribe: xml-dev-subscribe@l...
List archive: http://lists.xml.org/archives/xml-dev/
List Guidelines: http://www.oasis-open.org/maillists/guidelines.php



transparent
Print
Mail
Like It
Disclaimer
.

These Archives are provided for informational purposes only and have been generated directly from the Altova mailing list archive system and are comprised of the lists set forth on www.altova.com/list/index.html. Therefore, Altova does not warrant or guarantee the accuracy, reliability, completeness, usefulness, non-infringement of intellectual property rights, or quality of any content on the Altova Mailing List Archive(s), regardless of who originates that content. You expressly understand and agree that you bear all risks associated with using or relying on that content. Altova will not be liable or responsible in any way for any content posted including, but not limited to, any errors or omissions in content, or for any losses or damage of any kind incurred as a result of the use of or reliance on any content. This disclaimer and limitation on liability is in addition to the disclaimers and limitations contained in the Website Terms of Use and elsewhere on the site.

.
.

transparent

transparent