|Natural Language Parser email@example.com (Seima Rao) (2015-09-29)|
|Re: Natural Language Parser firstname.lastname@example.org (George Neuner) (2015-09-29)|
|Re: Natural Language Parser email@example.com (2015-09-30)|
|Re: unnatural natural language, was Natural Language Parser firstname.lastname@example.org (George Neuner) (2015-10-01)|
|From:||George Neuner <email@example.com>|
|Date:||Thu, 01 Oct 2015 12:41:34 -0400|
|Organization:||A noiseless patient Spider|
|References:||15-09-025 15-09-028 15-09-032|
|Keywords:||parse, administrivia, comment|
|Posted-Date:||01 Oct 2015 21:57:33 EDT|
>| > I am looking for a C++ API based English Language Parser. ....
>| You need to be aware that natural languages *can't* be parsed
>| without semantics - i.e. without considering "parts of speech".
>Exactly. Consider the classical "Buffalo" example:
> "Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo"
My family is from Buffalo.
That particular example actually gets even more perverse than the
article indicates. The staff at the Buffalo Zoo claim that their
animals have a unique method of harassing each other which they call
the "Buffalo buffalo".
If the animals really do "Buffalo buffalo" each other, then it follows
that "Buffalo buffalo Buffalo buffalo Buffalo buffalo Buffalo buffalo
Buffalo buffalo". [That's 10 "buffalos" if you are counting.]
1) The animals in the Buffalo Zoo actually are bison.
2) Bison are related to buffalo, but are a different species.
3) Natives pronounce the city's name as "buf4 lo",
accent on the 1st syllable and without any 'a' sound.
There are competing theories regarding the evolution
of this pronunciation: one claim that it is a mangled
Seneca Indian name, and two different claims that it
is mangled French.
[I think it's time to get back to computer languages, perhaps after we
discuss the derivation of beef on weck. -John]
Return to the
Search the comp.compilers archives again.