Re: Multibyte lexical analysis

sreeni@csc.albany.edu (Sreenivasa Rao Viswanadha)
9 Aug 1997 20:09:11 -0400

From comp.compilers

Related articles
Multibyte lexical analysis rod@querix.co.uk (Rod Chamberlin) (1997-08-07)
*Re: Multibyte lexical analysis sreeni@csc.albany.edu* (1997-08-09)**
Re: Multibyte lexical analysis ok@cs.rmit.edu.au (1997-08-09)
Re: Multibyte lexical analysis henry@zoo.toronto.edu (Henry Spencer) (1997-08-09)

| List of all articles for this month |

From:	sreeni@csc.albany.edu (Sreenivasa Rao Viswanadha)
Newsgroups:	comp.compilers
Date:	9 Aug 1997 20:09:11 -0400
Organization:	The University at Albany
References:	97-08-011
Keywords:	i18n, lex

Rod Chamberlin <rod@querix.co.uk> wrote:
> [ multibyte lexical analysis ? ]

I wrote the JavaCC (parser and lexer generator for Java) lexical analyzer
generator that generates lexical analyzers for 2 byte (java-style) characters
with very little overhead (< 10% increse inthe class file for Java lexer wehn
doing unicode processing). But it generates NFAs.

If you are just looking for such a tool, get it from :

http://www.suntest.com/JavaCC

If you want to know the algorithm, unfortunately I haven't documented it.
But if you want, we can do it by private email.
--

Post a followup to this message

Return to the comp.compilers page.
Search the comp.compilers archives again.

Re: Multibyte lexical analysis

sreeni@csc.albany.edu (Sreenivasa Rao Viswanadha)9 Aug 1997 20:09:11 -0400

sreeni@csc.albany.edu (Sreenivasa Rao Viswanadha)
9 Aug 1997 20:09:11 -0400