Re: Parsing postal addresses

John Lindsay <lindsay_j@rmc.ca>
16 Oct 1997 00:19:59 -0400

          From comp.compilers

Related articles
Parsing postal addresses brians0@aol.com (1997-10-14)
Re: Parsing postal addresses lindsay_j@rmc.ca (John Lindsay) (1997-10-16)
Re: Parsing postal addresses cfc@world.std.com (1997-10-16)
Re: Parsing postal addresses dweller@news.imagin.net (1997-10-17)
Re: Parsing postal addresses henry@zoo.toronto.edu (Henry Spencer) (1997-10-19)
Re: Parsing postal addresses mac@coos.dartmouth.edu (1997-10-21)
Re: Parsing postal addresses ct7@mitre.org (W. Craig Trader) (1997-10-26)
| List of all articles for this month |
From: John Lindsay <lindsay_j@rmc.ca>
Newsgroups: comp.compilers
Date: 16 Oct 1997 00:19:59 -0400
Organization: Royal Military College
References: 97-10-067
Keywords: parse

Brians0 wrote:
>
> I am looking for information on parsing postal addresses (especially
> US addresses). Specifically, I am looking for code and/or libraries to
> standardize addresses (in accordance with USPS rules). Examples in
> JavaCC, Lex/Yacc, and PCCTS or pointers to libraries that do this
> would be best, but any help would be much appreciated.


Seriously, in the face of a probable mess of a problem like this,
consider SNOBOL4, and in particular the SPITBOL compiler for SNOBOL4.
This system and its cousins have bailed out more than one programmer
in the face of nests of various incongruous data formats. The
language is now very well matured, and has an active following and a
net discussion group. It's both a character string handling - pattern
matching and a data structures handling language.


The SPITBOL compiler's < $300.00 U.S.; get it from CatsPaw for several
PC OS's, and other sources exist for other platforms. It comes with
its own full documentation, and more is available from CatsPaw.


An alternate is Icon -- it's free and exists for many platforms. Get
it by FTP from the University of Arizona. While Icon is more recent,
for pattern matching it's a bit more demanding to code than SNOBOL4 in
my experience.
--
All the best !


John H. Lindsay, Assistant Professor,
DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE
ROYAL MILITARY COLLEGE OF CANADA
PO BOX 17000 STN FORCES
KINGSTON ON K7K7B4 CANADA


e-mail: Lindsay_J@RMC.CA
Phone: (613) 541-5010-6419
Fax: (613) 542-8129
[I'd also consider perl. Its pattern matching isn't as powerful as
Snobol4's, but it makes up for it with decent control structures and a
high quality free implementation. -John]




--


Post a followup to this message

Return to the comp.compilers page.
Search the comp.compilers archives again.