Altova Mailing List Archives>Archive Index >microsoft.public.xml Archive Home >Recent entries >Thread Prev - Re: HTML parsing >Thread Next - Re: HTML parsing Re: HTML parsingTo: NULL Date: 3/16/2008 11:23:00 AM "Sam Hobbs" <samuel@s...> wrote in message news:uHKEKQ0hIHA.6092@T...... > I think the XMLTV project does that; are you familiar with it? > > > "Carmen Sei" <fatwallet951@y...> wrote in message > news:kpsot39lpeujctq3jr67n0ovssak7un9fd@4...... >> I need to parse the following HTML page and extract TV listing data >> using VC++ >> >> http://tvlistings.zap2it.com/tvlistings/ZCGrid.do >> >> any good way to extract the data? >> >> is easy for VC++ to call PERL script and do some regular expression? >> >> since the HTML page is not XML well formed, I cannot use a XML parser >> right? >> >> any other good ways to extract HTML page data? >> > > > You can also use tools such as HtmlTidy that will convert to XHTML which can be parsed by standard XML means. -- Joe Fawcett (MVP - XML) http://joe.fawcett.name | ||||||
| Company | Legal | Press | Partners | Careers | Sitemap | Contact Us | Altova Blog | Mobile | Full Site | |||
|
