编译器/解释器

开发平台：
Others

antlr.1：源码内容
							.TH ANTLR 1 "September 1995" "ANTLR" "PCCTS Manual Pages"
.SH NAME
antlr - ANother Tool for Language Recognition
.SH SYNTAX
.LP
fBantlrfR [fIoptionsfR] fIgrammar_filesfR
.SH DESCRIPTION
.PP
fIAntlrfP converts an extended form of context-free grammar into a
set of C functions which directly implement an efficient form of
deterministic recursive-descent LL(k) parser.  Context-free grammars
may be augmented with predicates to allow semantics to influence
parsing; this allows a form of context-sensitive parsing.  Selective
backtracking is also available to handle non-LL(k) and even
non-LALR(k) constructs.  fIAntlrfP also produces a definition of a
lexer which can be automatically converted into C code for a DFA-based
lexer by fIdlgfR.  Hence, fIantlrfR serves a function much like
that of fIyaccfR, however, it is notably more flexible and is more
integrated with a lexer generator (fIantlrfR directly generates
fIdlgfR code, whereas fIyaccfR and fIlexfR are given independent
descriptions).  Unlike fIyaccfR which accepts LALR(1) grammars,
fIantlrfR accepts LL(k) grammars in an extended BNF notation (em
which eliminates the need for precedence rules.
.PP
Like fIyaccfR grammars, fIantlrfR grammars can use
automatically-maintained symbol attribute values referenced as dollar
variables.  Further, because fIantlrfR generates top-down parsers,
arbitrary values may be inherited from parent rules (passed like
function parameters).  fIAntlrfP also has a mechanism for creating
and manipulating abstract-syntax-trees.
.PP
There are various other niceties in fIantlrfR, including the ability to
spread one grammar over multiple files or even multiple grammars in a single
file, the ability to generate a version of the grammar with actions stripped
out (for documentation purposes), and lots more.
.SH OPTIONS
.IP "fB-ck fInfR"
Use up to fInfR symbols of lookahead when using compressed (linear
approximation) lookahead.  This type of lookahead is very cheap to
compute and is attempted before full LL(k) lookahead, which is of
exponential complexity in the worst case.  In general, the compressed
lookahead can be much deeper (e.g, f(CW-ck 10fP) than the full
lookahead (which usually must be less than 4).
.IP fB-CCfP
Generate C++ output from both ANTLR and DLG.
.IP fB-crfP
Generate a cross-reference for all rules.  For each rule, print a list
of all other rules that reference it.
.IP fB-e1fP
Ambiguities/errors shown in low detail (default).
.IP fB-e2fP
Ambiguities/errors shown in more detail.
.IP fB-e3fP
Ambiguities/errors shown in excruciating detail.
.IP "fB-fefP file"
Rename fBerr.cfP to file.
.IP "fB-fhfP file"
Rename fBstdpccts.hfP header (turns on fB-ghfP) to file.
.IP "fB-flfP file"
Rename lexical output, fBparser.dlgfP, to file.
.IP "fB-fmfP file"
Rename file with lexical mode definitions, fBmode.hfP, to file.
.IP "fB-frfP file"
Rename file which remaps globally visible symbols, fBremap.hfP, to file.
.IP "fB-ftfP file"
Rename fBtokens.hfP to file.
.IP fB-gafP
Generate ANSI-compatible code (default case).  This has not been
rigorously tested to be ANSI XJ11 C compliant, but it is close.  The
normal output of fIantlrfP is currently compilable under both K&R,
ANSI C, and C++(emthis option does nothing because fIantlrfP
generates a bunch of #ifdef's to do the right thing depending on the
language.
.IP fB-gcfP
Indicates that fIantlrfP should generate no C code, i.e., only
perform analysis on the grammar.
.IP fB-gdfP
C code is inserted in each of the fIantlrfR generated parsing functions to
provide for user-defined handling of a detailed parse trace.  The inserted
code consists of calls to the user-supplied macros or functions called
fBzzTRACEINfR and fBzzTRACEOUTfP.  The only argument is a
fIchar *fR pointing to a C-style string which is the grammar rule
recognized by the current parsing function.  If no definition is given
for the trace functions, upon rule entry and exit, a message will be
printed indicating that a particular rule as been entered or exited.
.IP fB-gefP
Generate an error class for each non-terminal.
.IP fB-ghfP
Generate fBstdpccts.hfP for non-ANTLR-generated files to include.
This file contains all defines needed to describe the type of parser
generated by fIantlrfP (e.g. how much lookahead is used and whether
or not trees are constructed) and contains the fBheaderfP action
specified by the user.
.IP fB-gkfP
Generate parsers that delay lookahead fetches until needed.  Without
this option, fIantlrfP generates parsers which always have fIkfP
tokens of lookahead available.
.IP fB-glfP
Generate line info about grammar actions in C parser of the form
fB# fIlinefP "fIfilefP"fR which makes error messages from
the C/C++ compiler make more sense as they will *Qpoint*U into the
grammar file not the resulting C file.  Debugging is easier as well,
because you will step through the grammar not C file.
.IP fB-gsfR
Do not generate sets for token expression lists; instead generate a
fB||fP-separated sequence of fBLA(1)==fItoken_numberfR.  The
default is to generate sets.
.IP fB-gtfP
Generate code for Abstract-Syntax Trees.
.IP fB-gxfP
Do not create the lexical analyzer files (dlg-related).  This option
should be given when the user wishes to provide a customized lexical
analyzer.  It may also be used in fImakefR scripts to cause only the
parser to be rebuilt when a change not affecting the lexical structure
is made to the input grammars.
.IP "fB-k fInfR"
Set k of LL(k) to fInfR; i.e. set tokens of look-ahead (default==1).
.IP "fB-ofP dir
Directory where output files should go (default=".").  This is very
nice for keeping the source directory clear of ANTLR and DLG spawn.
.IP fB-pfP
The complete grammar, collected from all input grammar files and
stripped of all comments and embedded actions, is listed to
fBstdoutfP.  This is intended to aid in viewing the entire grammar
as a whole and to eliminate the need to keep actions concisely stated
so that the grammar is easier to read.  Hence, it is preferable to
embed even complex actions directly in the grammar, rather than to
call them as subroutines, since the subroutine call overhead will be
saved.
.IP fB-pafP
This option is the same as fB-pfP except that the output is
annotated with the first sets determined from grammar analysis.
.IP "fB-prc onfR
Turn on the computation and hoisting of predicate context.
.IP "fB-prc offfR
Turn off the computation and hoisting of predicate context.  This
option makes 1.10 behave like the 1.06 release with option fB-prfR
on.  Context computation is off by default.
.IP "fB-rl fInfR
Limit the maximum number of tree nodes used by grammar analysis to
fInfP.  Occasionally, fIantlrfP is unable to analyze a grammar
submitted by the user.  This rare situation can only occur when the
grammar is large and the amount of lookahead is greater than one.  A
nonlinear analysis algorithm is used by PCCTS to handle the general
case of LL(k) parsing.  The average complexity of analysis, however, is
near linear due to some fancy footwork in the implementation which
reduces the number of calls to the full LL(k) algorithm.  An error
message will be displayed, if this limit is reached, which indicates
the grammar construct being analyzed when fIantlrfP hit a
non-linearity.  Use this option if fIantlrfP seems to go out to
lunch and your disk start thrashing; try fInfP=10000 to start.  Once
the offending construct has been identified, try to remove the
ambiguity that fIantlrfP was trying to overcome with large lookahead
analysis.  The introduction of (...)? backtracking blocks eliminates
some of these problems (em fIantlrfP does not analyze alternatives
that begin with (...)? (it simply backtracks, if necessary, at run
time).
.IP fB-w1fR
Set low warning level.  Do not warn if semantic predicates and/or
(...)? blocks are assumed to cover ambiguous alternatives.
.IP fB-w2fR
Ambiguous parsing decisions yield warnings even if semantic predicates
or (...)? blocks are used.  Warn if predicate context computed and
semantic predicates incompletely disambiguate alternative productions.
.IP fB-fR
Read grammar from standard input and generate fBstdin.cfP as the
parser file.
.SH "SPECIAL CONSIDERATIONS"
.PP
fIAntlrfP works...  we think.  There is no implicit guarantee of
anything.  We reserve no fBlegalfP rights to the software known as
the Purdue Compiler Construction Tool Set (PCCTS) (em PCCTS is in the
public domain.  An individual or company may do whatever they wish
with source code distributed with PCCTS or the code generated by
PCCTS, including the incorporation of PCCTS, or its output, into
commercial software.  We encourage users to develop software with
PCCTS.  However, we do ask that credit is given to us for developing
PCCTS.  By "credit", we mean that if you incorporate our source code
into one of your programs (commercial product, research project, or
otherwise) that you acknowledge this fact somewhere in the
documentation, research report, etc...  If you like PCCTS and have
developed a nice tool with the output, please mention that you
developed it using PCCTS.  As long as these guidelines are followed,
we expect to continue enhancing this system and expect to make other
tools available as they are completed.
.SH FILES
.IP *.c
output C parser.
.IP *.cpp
output C++ parser when C++ mode is used.
.IP fBparser.dlgfP
output fIdlgfR lexical analyzer.
.IP fBerr.cfP
token string array, error sets and error support routines.  Not used in
C++ mode.
.IP fBremap.hfP
file that redefines all globally visible parser symbols.  The use of
the #parser directive creates this file.  Not used in
C++ mode.
.IP fBstdpccts.hfP
list of definitions needed by C files, not generated by PCCTS, that
reference PCCTS objects.  This is not generated by default.  Not used in
C++ mode.
.IP fBtokens.hfP
output fI#definesfR for tokens used and function prototypes for
functions generated for rules.
.SH "SEE ALSO"
.LP
dlg(1), pccts(1)