|Regular Expressions email@example.com (2004-10-09)|
|Re: Regular Expressions firstname.lastname@example.org (Eric Bodden) (2004-10-12)|
|Re: Regular Expressions email@example.com (Randall Hyde) (2004-10-12)|
|Re: Regular Expressions firstname.lastname@example.org (Sylvain Schmitz) (2004-10-12)|
|Re: Regular Expressions Martin.Ward@durham.ac.uk (Martin Ward) (2004-10-12)|
|Re: Regular Expressions email@example.com (2004-10-12)|
|Re: Regular Expressions firstname.lastname@example.org (David Z Maze) (2004-10-12)|
|Re: Regular Expressions Martin.Ward@durham.ac.uk (Martin Ward) (2004-10-17)|
|Re: Regular Expressions email@example.com (ChokSheak Lau) (2004-10-21)|
|[8 later articles]|
|From:||"Randall Hyde" <firstname.lastname@example.org>|
|Date:||12 Oct 2004 00:51:07 -0400|
|Organization:||EarthLink Inc. -- http://www.EarthLink.net|
|Posted-Date:||12 Oct 2004 00:51:07 EDT|
"Mark" <email@example.com> wrote in message
> I just can't seem to figure out how to invent a regular expression
> that will strip all HTML tags (except TABLE tags) out of a string and
> leave the rest of the text. When a TABLE tag is encountered i need to
> strip everything under it.
> This will strip all HTML out <[^>]*>
> But how do I make it also strip entire TABLE elements?
> Perhaps something like <table[^</table>]*</table>|<[^>]*>
> [That seems awfully complex for a single regex. -John]
Indeed, in general this requires a context-free grammar. I don't
understand the OP's exact problem well enough to determine if you can
get by with a regex.
Return to the
Search the comp.compilers archives again.