[Home] [By Thread] [By Date] [Recent Entries]
In 2.4, the holy script says: "Text consists of intermingled character data and markup. Markup takes the form of start-tags, end-tags, empty-element tags, entity references, character references, comments, CDATA section delimiters, document type declarations, processing instructions, XML declarations, text declarations, and any white space that is at the top level of the document entity (that is, outside the document element and not inside any other markup)." Then it says: "All text that is not markup constitutes the character data of the document." As an exercise, I tried to make a similar list for character data. Here's what I came up with: - Element content - Attribut values - CDATA content - Whitespace within the root element Did I miss anything? I wonder about entities. Is the literal entity value character data too? I guess not, because all entity declarations ought to be considered part of the document type declaration mentioned in 2.4. A sentence like "everything outside the document element is considered part of the markup" would have been helpful, assuming this is correct of course. Gustaf
|

Cart



