Subject: Re: fault tolerant saxon:parse()
From: David Carlisle <davidc@xxxxxxxxx>
Date: Mon, 17 Nov 2008 11:29:39 GMT
|
> I'm wondering if there's a standard approach for a fault tolerant
> saxon:parse() (or alternative equivalent)
personally I've used tagsoup and htmplparse.xsl, but parhaps the nearest
to a standard these days is http://about.validator.nu/ which implements
the HTML5 parsing algorithm in Java and exposes (so I'm told) sax and
DOM interfaces as if it were reading XML.
David
________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.
This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs.
________________________________________________________________________
|