Subject: Re: java Regex call
From: Jeni Tennison <jeni@xxxxxxxxxxxxxxxx>
Date: Thu, 10 Jul 2003 09:51:24 +0100
|
Hi John,
> Hi, Does anyone have an example of using the java.util.regex
> functions to return the "components" of the regex that matched.
[snip]
> Also, I'm using the Saxon 7.6 processor.
If you're using Saxon 7.6, why not use the built-in regular expression
support rather than Java extensions?
> Example: if my regex is defined as:-
>
> (([^_]*)_PARA)|((.*?)(PARA)(.*?))
>
> and my input is "ABC_PARA"
>
> Then I need to know what portions of the input matched
> each (if any) part of the regex groups, ([^_]*) and (.*?) etc,.
> in terms of group number and matching string.
If you use <xsl:analyze-string> then within the
<xsl:matching-substring>, the regex-group() function gives you access
to the values of the regex groups. Try:
<xsl:for-each select="$rule//Type">
<xsl:variable name="tdlType" select="." />
<xsl:analyze-string select="." regex="{$regex}">
<xsl:matching-substring>
<RULE_MATCH>
<xsl:value-of select="$rule//@name" />
</RULE_MATCH>
<REGEX><xsl:copy-of select="$regex" /></REGEX>
<GROUP1><xsl:value-of select="regex-group(1)" /></GROUP1>
<GROUP2><xsl:value-of select="regex-group(2)" /></GROUP2>
<GROUP3><xsl:value-of select="regex-group(3)" /></GROUP3>
...
</xsl:matching-substring>
</xsl:analyze-string>
</xsl:for-each>
Of course you need to know what regular expression is being used to be
able to tell which regex groups you're actually interested in...
By the way, I'm not sure why you're using "//" in $rule//@name -- this
will return all the name attributes of all the elements that are
descendants of $rule, as well as of $rule itself, and return them in a
whitespace-separated list (this being XSLT 2.0). Is that what you
really want?
Cheers,
Jeni
---
Jeni Tennison
http://www.jenitennison.com/
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
| Current Thread |
- java Regex call
- John Sharp - Wed, 9 Jul 2003 18:47:38 -0400 (EDT)
- Jeni Tennison - Thu, 10 Jul 2003 04:51:38 -0400 (EDT) <=
- John Sharp - Thu, 10 Jul 2003 10:21:34 -0400 (EDT)
- Michael Kay - Thu, 10 Jul 2003 06:34:07 -0400 (EDT)
- <Possible follow-ups>
- David . Pawson - Thu, 10 Jul 2003 06:32:45 -0400 (EDT)
|
|