Subject: global find and replace certain "special" characters and replicate structure as it is
From: "SANWAL, ABHISHEK (HP-Houston)" <abhishek.sanwal@xxxxxx>
Date: Wed, 8 Sep 2004 00:27:40 -0500
|
I am using Infopath to collect rich text from a word document and for some
weird reason the output I get in the XML has the special B character in it.
I understand that the UTF-8 encoding etcetra causes it to be visible that way.
Also I have the bEUR" character at several locations.
I do not have control over the way Infopath converts this rich text into XHTML
inside mixed custom XML elements.
I do not have control over the source word documents that have enormous
amounts of "spaces" in odd locations.
I am wondering how I can write a convenient global find/replacer that would
essentially go through ANY and EVERY element or text node in the given source
XML(hybrid HTML) and replace the following:
B with (nothing) (eliminate it)
<I don't even want the spaces to appear>
bEUR" with - (dash)
<?xml version="1.0" encoding="UTF-8"?>
....
<font face="Times New Roman">
<strong>Dangerous bEUR" Management Danger </strong>
<span style="FONT-SIZE: 10pt">(former Danger & Is Now)</span>
<strong>B B B B B B B B B B B B B B B B B B B B B B B B B (0/99 bEUR"
Present)</strong>
</font>
</div>
<h1>hello1</h1>
......
<tr>
<td><div style="PADDING-RIGHT: 0in; MARGIN-TOP: 0in; PADDING-LEFT: 0in;
FONT-WEIGHT: normal; FONT-SIZE: 12pt; MARGIN-BOTTOM: 0pt">
<font face="Times New Roman">
<strong>Interesting bEUR" Avenue Of </strong>
<span style="FONT-SIZE: 10pt">(formerly earl & of england)</span>
<strong>B B B B B B B B B B B B B B B B B B B B B B B B B (0/99 bEUR"
Present)</strong>
</font>
</div></td>
<td>two</td>
<td>three</td>
</tr>
....
____________________________________________________________
Abhishek Sanwal
HP - Houston Campus
..........................................................................
|