Related articles |
---|
Universal Character Names eric.b.lemings@lmco.com (Eric Lemings) (1998-10-10) |
Re: Universal Character Names qjackson@wave.home.com (Quinn Tyler Jackson) (1998-10-13) |
Re: Universal Character Names Brian.Inglis@cadvision.com (1998-10-13) |
Re: Universal Character Names eric.b.lemings@lmco.com (Eric Lemings) (1998-10-17) |
Re: Universal Character Names ok@atlas.otago.ac.nz (Dr Richard A. O'Keefe) (1998-10-17) |
Re: Universal Character Names fjh@cs.mu.OZ.AU (1998-10-22) |
Re: Universal Character Names eggert@twinsun.com (1998-10-30) |
From: | Eric Lemings <eric.b.lemings@lmco.com> |
Newsgroups: | comp.compilers |
Date: | 10 Oct 1998 01:05:03 -0400 |
Organization: | Lockheed Martin Astronautics |
Keywords: | i18n, lex |
Dear readers,
In the new C and C++ language standards, identifiers
may contain universal character names (UCN). The
rule for universal character names is:
universal-character-name:
\u hex-quad
\U hex-quad hex-quad
hex-quad:
hex-digit hex-digit hex-digit hex-digit
Needless to say this makes the old regexp for identifiers:
[a-zA-Z_]+[a-zA-Z0-9_]*
obsolete. How would you modify it to handle UCN?
Thanks,
Eric.
Return to the
comp.compilers page.
Search the
comp.compilers archives again.