Altova Mailing List Archives>Archive Index >microsoft.public.xml Archive Home >Recent entries >Thread Prev - Re: HTML parsing [Thread Next] Re: HTML parsingTo: NULL Date: 3/18/2008 12:22:00 PM "Joe Fawcett" <joefawcett@n...> wrote in message news:E9673FB9-F083-4B05-AC30-9FE92258EAF1@m...... > > > "Sam Hobbs" <samuel@s...> wrote in message > news:uHKEKQ0hIHA.6092@T...... >> I think the XMLTV project does that; are you familiar with it? >> >> >> "Carmen Sei" <fatwallet951@y...> wrote in message >> news:kpsot39lpeujctq3jr67n0ovssak7un9fd@4...... >>> I need to parse the following HTML page and extract TV listing data >>> using VC++ >>> >>> http://tvlistings.zap2it.com/tvlistings/ZCGrid.do >>> >>> any good way to extract the data? >>> >>> is easy for VC++ to call PERL script and do some regular expression? >>> >>> since the HTML page is not XML well formed, I cannot use a XML parser >>> right? >>> >>> any other good ways to extract HTML page data? >>> >> >> >> > You can also use tools such as HtmlTidy that will convert to XHTML which > can be parsed by standard XML means. Note that you are replying to me but I did not ask. | ||||||
| Company | Legal | Press | Partners | Careers | Sitemap | Contact Us | Altova Blog | Mobile | Full Site | |||
|
