[Home] [By Thread] [By Date] [Recent Entries]
Unfortunately the "Mutual Termination for Patent Action" makes tagsoup a time-bomb. There's no way I could possibly use this. -----Original Message----- From: John Cowan [mailto:cowan@m...] Sent: Wednesday, July 23, 2003 4:41 AM To: Elliotte Rusty Harold Cc: xml-dev@l... Subject: Re: rss regularis(z)ation Elliotte Rusty Harold scripsit: > > Feed the element content into > >a tag-soup parser, infer start- and end- tags to turn it into > >a tree, and strip out all the elements you don't want showing up > >in the aggregator output. Took me about two hours to code this up > >(to be fair, I did use an off-the shelf lexer for the first step). > > If you need to write your own tag soup parser, it ain't XML. That's > too much work for a job that shouldn't be necessary in the first > place. Fortunately, Java programmers don't need to write their own tag soup parsers; I did that. http://www.ccil.org/~cowan/XML/tagsoup -- It was impossible to inveigle John Cowan <jcowan@r...> Georg Wilhelm Friedrich Hegel http://www.ccil.org/~cowan Into offering the slightest apology http://www.reutershealth.com For his Phenomenology. --W. H. Auden, from "People" (1953) ----------------------------------------------------------------- The xml-dev list is sponsored by XML.org <http://www.xml.org>, an initiative of OASIS <http://www.oasis-open.org> The list archives are at http://lists.xml.org/archives/xml-dev/ To subscribe or unsubscribe from this list use the subscription manager: <http://lists.xml.org/ob/adm.pl>
|

Cart



