[Home] [By Thread] [By Date] [Recent Entries]
On 21.12.2010 00:05, Syd Bauman wrote:
Watching this conversation, I found myself interested in how many experts say there is rarely a need to match text nodes. I do that all the time. Some examples include: I'd like to second that (didnt dare until you did), and add advanced whitespace normalization to the list: converting <b>term </b> or <i> term</i> (you see something like that in office manuscripts frequently) into <supersemantic>term</supersemantic>. The whitespace mustnt be normalized away, but it has to be pulled out of the marked-up term into the surrounding text. Glad you posted that, really. -Gerrit * transforming straight quotes to curly quotes * conditionally changing "-" to en-dash or em-dash * convert PUA characters to<tei:g> elements * deal with soft hyphens * ditching the punctuation that follows a<list> * finding the character before a footnote
|

Cart



