Related articles |
---|
re2c-1.0 released! skvadrik@gmail.com (Ulya Trofimovich) (2017-08-27) |
Re: re2c-1.0 released! 398-816-0742@kylheku.com (Kaz Kylheku) (2017-09-02) |
Re: re2c-1.0 released! 398-816-0742@kylheku.com (Kaz Kylheku) (2017-09-02) |
Re: re2c-1.0 released! anton@mips.complang.tuwien.ac.at (2017-09-02) |
Re: re2c-1.0 released! gneuner2@comcast.net (George Neuner) (2017-09-02) |
Re: re2c-1.0 released! skvadrik@gmail.com (Ulya Trofimovich) (2017-09-03) |
Re: re2c-1.0 released! jamin.hanson@googlemail.com (Ben Hanson) (2017-09-03) |
Re: re2c-1.0 released! jamin.hanson@googlemail.com (Ben Hanson) (2017-09-03) |
[5 later articles] |
Reroute: | compilers@archive.iecc.com |
From: | Kaz Kylheku <398-816-0742@kylheku.com> |
Newsgroups: | comp.compilers |
Date: | Sat, 2 Sep 2017 11:18:29 -0400 (EDT) |
Organization: | Aioe.org NNTP Server |
References: | 17-08-007 |
Injection-Info: | miucha.iecc.com; posting-host="www.ucenet.org:2001:470:1f07:1126:0:7563:656e:6574"; logging-data="31337"; mail-complaints-to="abuse@iecc.com" |
Keywords: | lex |
Posted-Date: | 02 Sep 2017 11:18:29 EDT |
On 2017-08-27, Ulya Trofimovich <skvadrik@gmail.com> wrote:
> Hello everybody,
>
>
> I'm glad to announce the new major release of re2c lexer generator, 1.0.
> re2c specializes in generating fast and flexible lexers; it was written
> by Peter Bumbulis around 1994 [0] and since then maintained by many
> people. See release notes for the full story [1].
>
> The main breakthrough of this release is the addition of fast submatch
> extraction: re2c now supports capturing groups with POSIX longest-match
> semantics, as well as standalone capturing "tags" with leftmost greedy
> semantics. The implementation is based on the efficient novel algorithm [2].
>
> The challenge of adding submatch extraction to a lexer generator is not
> immediately obvious: this feature is provided by many regular expression
Neither is obvious the advantage/utility of this.
> libraries and command-line tools like grep and sed.
They exist in support of very flimsy, ad hoc approaches for
dealing with some text processing problems.
Briefly, why would you do some hacky regex thing in lex with \1, \2,
\3, when in the level immediately above yylex() you have proper phrase
recognition, with $1, $2, $3.
(What is that, if not a better grade of backreferencing.)
> It may seem that
> only a lack of effort prevents developers of lexer generators like Flex
> from implementing it (as well as fixing the ever-broken trailing
> contexts [3]).
Lack of justification due to, I suspect, lack of demand from
sophisticated users of the tools, who use them for their intended
purpose.
Return to the
comp.compilers page.
Search the
comp.compilers archives again.