Related articles |
---|
Multibyte lexical analysis rod@querix.co.uk (Rod Chamberlin) (1997-08-07) |
Re: Multibyte lexical analysis sreeni@csc.albany.edu (1997-08-09) |
Re: Multibyte lexical analysis ok@cs.rmit.edu.au (1997-08-09) |
Re: Multibyte lexical analysis henry@zoo.toronto.edu (Henry Spencer) (1997-08-09) |
From: | sreeni@csc.albany.edu (Sreenivasa Rao Viswanadha) |
Newsgroups: | comp.compilers |
Date: | 9 Aug 1997 20:09:11 -0400 |
Organization: | The University at Albany |
References: | 97-08-011 |
Keywords: | i18n, lex |
Rod Chamberlin <rod@querix.co.uk> wrote:
> [ multibyte lexical analysis ? ]
I wrote the JavaCC (parser and lexer generator for Java) lexical analyzer
generator that generates lexical analyzers for 2 byte (java-style) characters
with very little overhead (< 10% increse inthe class file for Java lexer wehn
doing unicode processing). But it generates NFAs.
If you are just looking for such a tool, get it from :
http://www.suntest.com/JavaCC
If you want to know the algorithm, unfortunately I haven't documented it.
But if you want, we can do it by private email.
--
Return to the
comp.compilers page.
Search the
comp.compilers archives again.