[Home] [By Thread] [By Date] [Recent Entries]

  • From: "Costello, Roger L." <costello@m...>
  • To: "xml-dev@l..." <xml-dev@l...>
  • Date: Mon, 1 Oct 2012 14:47:15 +0000

David Carlisle wrote:

>> 1. Why does XML not support many of the US-ASCII characters?
>>
> Because it's a textual format and code points like
> U+0017 END OF TRANSMISSION BLOCK
> don't really belong in text.

Not everyone agrees with that point of view. For example, RFC 5322 [1], Internet Message Format, is for *text* document and it says:

      This document specifies the Internet Message Format (IMF), 
      a syntax for *text* messages that are sent between computer 
      users, within the framework of "electronic mail" messages.

      This document specifies a syntax only for *text* messages.  In
      particular, it makes no provision for the transmission of images,
      audio, or other sorts of structured data in electronic mail messages.

      A message that is conformant with this specification is composed of
      characters with values in the range of 1 through 127 and interpreted
      as US-ASCII [ANSI.X3-4.1986] characters.

So, according to this RFC "U+0017 END OF TRANSMISSION BLOCK" does belong in text (as do all the other 27 US-ASCII characters that XML does not support).

/Roger


[1] http://tools.ietf.org/html/rfc5322


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member