Re: Looking for existing research on a variation of CFGs

Antti Huima <huima@localhost.localdomain>
23 Feb 2001 00:08:24 -0500

          From comp.compilers

Related articles
Looking for existing research on a variation of CFGs tperry@stanford.edu (Todd Curtis Perry) (2001-02-17)
Re: Looking for existing research on a variation of CFGs torbenm@diku.dk (2001-02-23)
Re: Looking for existing research on a variation of CFGs huima@localhost.localdomain (Antti Huima) (2001-02-23)
| List of all articles for this month |

From: Antti Huima <huima@localhost.localdomain>
Newsgroups: comp.compilers
Date: 23 Feb 2001 00:08:24 -0500
Organization: KPNQwest Finland customer news service
References: 01-02-098
Keywords: parse, theory
Posted-Date: 23 Feb 2001 00:08:24 EST

Todd Curtis Perry <tperry@stanford.edu> writes:

> Could someone point me towards research that has done on the properities
> of grammars made up of productions of the form A -> B where _both_ A and B
> are strings of terminals or non-terminals.

> [Those are context sensitive grammars. Look in books about compiler theory
> and linguistics. -John]

If X, Y, ... range over non-terminals and a, b, ... range over
terminals, and S ranges over all symbols, then

(1) If every production is of the form

        X -> aY or X -> Y

        the grammar is `regular'.

(2) If every production is of the form

        X -> S+

        the grammar is `positive context-free'. (`Positive' corresponds
        to the requirement that the right-hand side of a production is not
        allowed to be empty.)

(3) If every production is of the general form

        S* -> S*

        but so that the length of the right-hand side is not smaller than
        that of the left-hand side, then the grammar is `context
        sensitive'.

(4) Otherwise, if every production is of the general form

        S* -> S*

        without restrictions, the grammar is a `phrase structure grammar'
        and the language can be whatever accepted by some algorithm.

The corresponding languages are

    (1) regular languages
    (2) context-free languages
    (3) context-sensitive languages
    (4) recursively enumerable languages

and correspond to the types 3, 2, 1 and 0 in what is known the Chomsky
Hierarchy.

The corresponding automata classes are

    (1) finite-state automata
    (2) finite-state push-down automata
    (3) linear bounded automata (non-deterministic Turing machines with
            a linear bound on the amount of tape to be used)
    (4) Turing machines

Thus, Todd's question is a bit too general, as there are different
classes of grammars that all have strings of symbols on the both sides
of a production. Parsing is decidable for context-sensitive grammars
but not for phrase structure grammars, and both grammars fit Todd's
description.

My two cents...

--
Antti Huima


Post a followup to this message

Return to the comp.compilers page.
Search the comp.compilers archives again.