Home. 
.

transparent

transparent

transparent

Altova Mailing List Archives


RE: [xml-dev] Auto-generate a DTD from multiple XML documents?

From: "Michael Kay" <mike@--------.--->
To: "'Jon Noring'" <jon@------.---->,<xml-dev@-----.---.--->
Date: 10/7/2007 3:40:00 AM
The Saxon DTDGenerator only handles one input document, but I've used it for
this scenario in the past, simply by concatenating all the documents within
a dummy root element, generating the DTD, and then discarding the dummy root
element definition. (It doesn't need to be a physical concatenation, you can
set up a document that brings all the others in as external entities).

I produced a new version of the DTDGenerator a couple of weeks ago, as it
happens - the first release for 6 years, fixing a bug that caused it to
slowly eat memory in proportion to the number of distinct attribute values.
After fixing that, I was able to process 20Gb of input in about 25 minutes.
It's available as a free-standing download from the Saxon project on
sourceforge.

Michael Kay
http://www.saxonica.com/ 


> -----Original Message-----
> From: Jon Noring [mailto:jnoring@g...] On Behalf Of Jon Noring
> Sent: 06 October 2007 21:10
> To: xml-dev@l...
> Subject: [xml-dev] Auto-generate a DTD from multiple XML documents?
> 
> Everyone,
> 
> Several of us involved with Distributed Proofreaders and 
> Project Gutenberg are analyzing a number of TEI documents 
> representing PG etexts.
> 
> As part of the analysis, we'd like to find an application 
> which will autogenerate a minimal DTD (or schema) for 
> validating a given set of documents sharing the same root element.
> 
> I have seen several applications which will autogenerate a 
> DTD for a single XML document, but have not yet found one 
> which will generate a DTD representing a set of XML documents.
> 
> Of course, we prefer a "free" application for this one-time 
> job, but if there's a good commercial package, hopefully we 
> can find someone with a copy who'd be happy to volunteer 
> running the job for us.
> 
> Thanks in advance.
> 
> Jon Noring
> 
> 
> 
> ______________________________________________________________
> _________
> 
> XML-DEV is a publicly archived, unmoderated list hosted by 
> OASIS to support XML implementation and development. To 
> minimize spam in the archives, you must subscribe before posting.
> 
> [Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
> Or unsubscribe: xml-dev-unsubscribe@l...
> subscribe: xml-dev-subscribe@l... List archive: 
> http://lists.xml.org/archives/xml-dev/
> List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
>


transparent
Print
Mail
Like It
Disclaimer
.

These Archives are provided for informational purposes only and have been generated directly from the Altova mailing list archive system and are comprised of the lists set forth on www.altova.com/list/index.html. Therefore, Altova does not warrant or guarantee the accuracy, reliability, completeness, usefulness, non-infringement of intellectual property rights, or quality of any content on the Altova Mailing List Archive(s), regardless of who originates that content. You expressly understand and agree that you bear all risks associated with using or relying on that content. Altova will not be liable or responsible in any way for any content posted including, but not limited to, any errors or omissions in content, or for any losses or damage of any kind incurred as a result of the use of or reliance on any content. This disclaimer and limitation on liability is in addition to the disclaimers and limitations contained in the Website Terms of Use and elsewhere on the site.

.
.

transparent

transparent