Home. 
.

transparent

transparent

transparent

Altova Mailing List Archives


Re: Get untagged text between elements

From: "Dimitre Novatchev" <dimitren@---.---.-->
To: NULL
Date: 2/4/2007 7:57:00 AM

> I agree that this is ugly XML but it get parsed witout errors. The
> untaged text varies. It may be also numbers for chapters etc. I also
> thougt about the text() function, but it doesn' seem to help here

Francois,

Peter FlynnYou gave a good idea about a possible solution.

The fact that you don't understand what he's suggesting and are actually 
rejecting a solution is entirely your problem. It is difficult to get help 
in this way. And, of course, one needs to read a litlle bit more about XPath 
in order to understand what solutions people propose; Hint: text() is not a 
"function", nor is it "output function".

Cheers,
Dimitre Novatchev




"FMAS" <massion@g...> wrote in message 
news:1170578863.267061.54470@k......
On 4 Feb., 00:03, Peter Flynn <peter.n...@m.silmaril.ie> wrote:
> FMASwrote:
> > I am transforming XML files which sometimes have untaged text between
> > elements.
>
> > It looks like this:
>
> > <ut Style="external" DisplayText="bb">&lt;bb &quot;Document Numbering
> > Formats&quot;&gt;</ut>
> > <ut Style="external" DisplayText="bb">&lt;bb &quot;Custom Footnote
> > Numbering&quot;&gt;</ut>*??
> > <ut Style="external" DisplayText="bb">&lt;bb &quot;Volume
> > Numbering&quot;&gt;</ut>
>
> > No problem to transform the content of tags. But I do not find a way
> > to copy untagged text such as "*??" in the example above.
>
> > Any suggestion?
>
> a) This looks like rather poor design by someone who really
> hasn't grasped the idea of XML properly.
>
> b) It also appears that they are trying to hide data marked in
> another language within the XML.
>
> c) Use the text() function to access character data in Mixed Content
> (which is what this resembles).
>
> ///Peter
> --
> XML FAQ:http://xml.silmaril.ie/- Zitierten Text ausblenden -
>
> - Zitierten Text anzeigen -

I agree that this is ugly XML but it get parsed witout errors. The
untaged text varies. It may be also numbers for chapters etc. I also
thougt about the text() function, but it doesn' seem to help here as
we are talking about input and not output. I have a structure like
this:

<tag>text I can get</tag>text I can't get<tag>text2 I can get</tag>

I wonder if I have to preprocess the XML file, maybe with perl. But if
it can be avoided so muc the better.

Francois





transparent
Print
Mail
Digg
delicious
Disclaimer
.

These Archives are provided for informational purposes only and have been generated directly from the Altova mailing list archive system and are comprised of the lists set forth on www.altova.com/list/index.html. Therefore, Altova does not warrant or guarantee the accuracy, reliability, completeness, usefulness, non-infringement of intellectual property rights, or quality of any content on the Altova Mailing List Archive(s), regardless of who originates that content. You expressly understand and agree that you bear all risks associated with using or relying on that content. Altova will not be liable or responsible in any way for any content posted including, but not limited to, any errors or omissions in content, or for any losses or damage of any kind incurred as a result of the use of or reliance on any content. This disclaimer and limitation on liability is in addition to the disclaimers and limitations contained in the Website Terms of Use and elsewhere on the site.

.
.

transparent

transparent