[Home] [By Thread] [By Date] [Recent Entries]
Badrul Anuar wrote:
I tried to capture some information and data from online XHTML file using XSL. But there are few error before the file can be processed. Maybe because of the html file. So i try to clean the xhtml file and still face the problem. Currently, I'm using Tidy, but still face same problem. Does anybody know how to clean any XHTML file or to convert into XML. If I have the XML, the it would be easier for me to use XSL. Saxon can be used together with TagSoup (http://home.ccil.org/~cowan/XML/tagsoup/), that can deal with the HTML tag soup you find on the web. The page that I try to convert is http://tools.cisco.com/security/center/viewAlert.x?alertId=18261 However that document seems to have a complete XHTML document in the body of the outer document so you will have to try how TagSoup deals with that. -- Martin Honnen http://msmvps.com/blogs/martin_honnen/
|

Cart



