Subject: Re: breaking string into substrings or arrays (XSLT 1.0 solution) tokenize
From: Dimitre Novatchev <dnovatchev@xxxxxxxxx>
Date: Thu, 21 Apr 2005 06:04:47 +1000
|
On 4/21/05, RahilQ <qamar_rahil@xxxxxxxxxxx> wrote:
> Hi
>
> If I want to recognize a sequence of letters and characters such as
> 'atnnnn' where 'at' are letters and 'nnnn' are numbers between 0-9 what
> would be the best way to write a token for it ?
>
> Would it be 'at0-9' or something else ?
See the other thread in this list discussing exactly this topic.
>
> Possibly trivial but could you please explain what the expression 

> in your token below signifies? Is the actual expression this or '#xA;' ?
The actual expression is exactly as given. 
 is the character
reference for linefeed.
Cheers,
Dimitre Novatchev
>
> Thanks
> Rahil
>
>
> Dimitre Novatchev wrote:
>
> >If you set the delimiters to:
> >
> > "'.:;,
'"
> >
> >then the result is:
> >
> ><word>good morning</word>
> ><word> can you help me</word>
> ><word> alpha </word>
> ><word> alpha beta</word>
> ><word/>
> >
> >If you want, you may additionally normalize-space() in a second pass.
> >
> >
> >Cheers,
> >Dimitre Novatchev
> >
> >
> >
> >On 4/20/05, Ali Choumane <ali.choumane@xxxxxxx> wrote:
> >
> >
> >>Hi
> >>Thanks for your help, but in my project, if I have this entry:
> >><?xml version="1.0"?>
> >><paragraph>
> >>good morning; can you help me: alpha , alpha beta.
> >></paragraph>
> >>
> >>I would have such result:
> >><?xml version="1.0"?>
> >><sentence>good morning</sentence>
> >><sentence>can you help me<sentence>
> >><sentence>alpha<sentence>
> >><sentence>alpha beta<sentence>
> >>
> >>but I think ,the solutions you provided give an output as:
> >><?xml version="1.0"?>
> >><sentence>good</sentence>
> >><sentence>morning</sentence>
> >><sentence>can</sentence>
> >><sentence>you</sentence>
> >><sentence>help</sentence>
> >><sentence>me<sentence>
> >><sentence>alpha<sentence>
> >><sentence>alpha</sentence>
> >><sentence>beta<sentence>
> >>
> >>can you help me?
> >>
> >>
> >>Quoting "Pawson, David" <David.Pawson@xxxxxxxxxxx>:
> >>
> >>
> >>
> >>> -----Original Message-----
> >>> From: Dimitre Novatchev
> >>>
> >>> Please, note that the str-split-to-words template generally
> >>> handles *a
> >>> set* of delimiters, not just one delim character.
> >>>
> >>> Therefore, one can set the pDelimiters parameters say to:
> >>>
> >>> " ,;?\!"
> >>>
> >>> and it will retrieve words that are delimited by any pair of these.
> >>>
> >>>
> >>>which is exactly what was asked for earlier! Nice one.
> >>> I've never needed multiple separators before.
> >>>This guy did.
> >>>regards DaveP
> >>>
> >>>--
> >>>DISCLAIMER:
> >>>
> >>>NOTICE: The information contained in this email and any attachments is
> >>>confidential and may be privileged. If you are not the intended
> >>>recipient you should not use, disclose, distribute or copy any of the
> >>>content of it or of any attachment; you are requested to notify the
> >>>sender immediately of your receipt of the email and then to delete it
> >>>and any attachments from your system.
> >>>
> >>>RNIB endeavours to ensure that emails and any attachments generated by
> >>>its staff are free from viruses or other contaminants. However, it
> >>>cannot accept any responsibility for any such which are transmitted.
> >>>We therefore recommend you scan all attachments.
> >>>
> >>>Please note that the statements and views expressed in this email and
> >>>any attachments are those of the author and do not necessarily represent
> >>>those of RNIB.
> >>>
> >>>RNIB Registered Charity Number: 226227
> >>>
> >>>Website: http://www.rnib.org.uk
> >>>
> >>>
> >>>
> >>Ali CHOUMANE
> >>M2R III ,Univ Joseph fourrier, UFR IMA
> >>WAM/INRIA & GETA/CLIPS
> >>portable. 06 71 55 80 65
> >>Bureau(INRIA). 04 76 61 54 13
> >>Email. ali.choumane@xxxxxxx
> >>
> >>-------------------------------------------------
> >>envoyi via Webmail/IMAG !
|