Re: PDF grammar and PostScript grammar

derekn@foolabs.com (Derek B. Noonburg)
16 Feb 2002 01:13:36 -0500

From comp.compilers

Related articles
PDF grammar and PostScript grammar p11111@kr.onet.pl (psobkiew) (2002-01-30)
Re: PDF grammar and PostScript grammar adamo@dblab.ece.ntua.gr (2002-02-06)
Re: PDF grammar and PostScript grammar dmaze@mit.edu (David Z Maze) (2002-02-06)
*Re: PDF grammar and PostScript grammar derekn@foolabs.com* (2002-02-16)**
Re: PDF grammar and PostScript grammar yves.deweerdt@softhome.net (2002-02-28)

| List of all articles for this month |

From:	derekn@foolabs.com (Derek B. Noonburg)
Newsgroups:	comp.compilers
Date:	16 Feb 2002 01:13:36 -0500
Organization:	Prodigy Internet http://www.prodigy.com
References:	02-01-171 02-02-009
Keywords:	parse
Posted-Date:	16 Feb 2002 01:13:36 EST

> [PDF is basically tarted up Postscript, and Postscript has a trivial
> token stack syntax like that of Forth. -John]

A PDF page content stream is simplified PostScript -- no control flow,
no real stack. It's a sequence of operations, where each operation is
zero or more operands followed by an operator, e.g., "10 20 m 100 200
l" means move to the point (10, 20), and then draw a line to (100,
200). Each operator completely consumes its operands and leaves
nothing on the stack (unlike Forth and PostScript).

PDF files are more complex. A PDF file consists of a sequence of
numbered objects. Examples of objects are fonts, images, hyperlinks,
page content streams, and lots more. There's a cross-reference
("xref") table at the end of the file that maps object number to
position in the file (byte offset from the beginning of the file).
It's actually even messier - a file can be "updated": you tack some
more objects on the end, some of which can logically replace existing
objects, and then append a new xref table with offsets for the new
objects and a pointer to the previous xref table.

PDF really isn't something you want to attack with lex and yacc.

- Derek

Post a followup to this message

Return to the comp.compilers page.
Search the comp.compilers archives again.

Re: PDF grammar and PostScript grammar

derekn@foolabs.com (Derek B. Noonburg)16 Feb 2002 01:13:36 -0500

derekn@foolabs.com (Derek B. Noonburg)
16 Feb 2002 01:13:36 -0500