[Home] [By Thread] [By Date] [Recent Entries]


Elliotte Rusty Harold scripsit:

> > Feed the element content into
> >a tag-soup parser, infer start- and end- tags to turn it into
> >a tree, and strip out all the elements you don't want showing up
> >in the aggregator output.  Took me about two hours to code this up
> >(to be fair, I did use an off-the shelf lexer for the first step).
> 
> If you need to write your own tag soup parser, it ain't XML. That's 
> too much work for a job that shouldn't be necessary in the first 
> place.

Fortunately, Java programmers don't need to write their own tag soup parsers;
I did that.

http://www.ccil.org/~cowan/XML/tagsoup

-- 
It was impossible to inveigle           John Cowan <jcowan@r...>
Georg Wilhelm Friedrich Hegel           http://www.ccil.org/~cowan
Into offering the slightest apology     http://www.reutershealth.com
For his Phenomenology.                      --W. H. Auden, from "People" (1953)

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member