[Home] [By Thread] [By Date] [Recent Entries]



Hello -

Somewhere in the expat docs this list is given as a place to
ask questions. I've looked on the web for an answer
and have come up empty, so I hope it's OK to ask here.

What I want to do is use expat to read HTML 4.01. I have
no problem with the requirement of the input being
well formed. But, to do this I have to read the standard
"loose" or "strict" DTD. 

What I've tried:

XML_SetParamEntityParsing(parser, XML_PARAM_ENTITY_PARSING_ALWAYS);
XML_SetExternalEntityRefHandler(parser,ext_ref_handler);

and then in ext_ref_handler, create a sub_parser:

XML_ExternalEntityParserCreate(parser,context,encoding)

and feed the DTD file to it.

The problem:

It chokes on the first non-comment line (an ENTITY) in loose.dtd
from w3.org with a syntax error. It doesn't seem to like the "--"
as a comment delimiter.

Should I not use loose.dtd? I kinda assumed it is a valid DTD...

Are there any examples/samples of parsing a DTD? I found lots of
descriptions of how to do it (which seems to be what I did) but
no code.

This is with version expat-1.95.8 

Thanks,

Rick

rickc
rdrop.com



Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member