Regular Expressions

m_j_mather@yahoo.com.au (Mark)
9 Oct 2004 22:34:37 -0400

From comp.compilers

Related articles
*Regular Expressions m_j_mather@yahoo.com.au* (2004-10-09)**
Re: Regular Expressions newsserver_mails@bodden.de (Eric Bodden) (2004-10-12)
Re: Regular Expressions randyhyde@earthlink.net (Randall Hyde) (2004-10-12)
Re: Regular Expressions schmitz@i3s.unice.fr (Sylvain Schmitz) (2004-10-12)
Re: Regular Expressions Martin.Ward@durham.ac.uk (Martin Ward) (2004-10-12)
Re: Regular Expressions torbenm@diku.dk (2004-10-12)
Re: Regular Expressions dmaze@mit.edu (David Z Maze) (2004-10-12)
[10 later articles]

| List of all articles for this month |

From:	m_j_mather@yahoo.com.au (Mark)
Newsgroups:	comp.compilers
Date:	9 Oct 2004 22:34:37 -0400
Organization:	http://groups.google.com
Keywords:	lex, comment
Posted-Date:	09 Oct 2004 22:34:37 EDT

Hi everyone

I just can't seem to figure out how to invent a regular expression
that will strip all HTML tags (except TABLE tags) out of a string and
leave the rest of the text. When a TABLE tag is encountered i need to
strip everything under it.

This will strip all HTML out <[^>]*>

But how do I make it also strip entire TABLE elements?

Perhaps something like <table[^</table>]*</table>|<[^>]*>

Thanks,
Mark
[That seems awfully complex for a single regex. -John]

Post a followup to this message

Return to the comp.compilers page.
Search the comp.compilers archives again.

Regular Expressions

m_j_mather@yahoo.com.au (Mark)9 Oct 2004 22:34:37 -0400

m_j_mather@yahoo.com.au (Mark)
9 Oct 2004 22:34:37 -0400