SPEC CPU2000 Benchmark Description File

Benchmark Name: 197.parser

Benchmark Author: Danny Sleator (sleator@cs.cmu.edu) and
                  Davy Temperley (dt3@columbia.edu)

Benchmark Program General Category: Word processing

Benchmark Description:

  The Link Grammar Parser is a syntactic parser of English, based on
  link grammar, an original theory of English syntax. Given a
  sentence, the system assigns to it a syntactic structure, which
  consists of set of labeled links connecting pairs of words.

  The parser has a dictionary of about 60000 word forms. It has
  coverage of a wide variety of syntactic constructions, including
  many rare and idiomatic ones. The parser is robust; it is able to
  skip over portions of the sentence that it cannot understand, and
  assign some structure to the rest of the sentence. It is able to
  handle unknown vocabulary, and make intelligent guesses from context
  about the syntactic categories of unknown words.

Input Description:

  The input is a sequence of proposed sentences, one per line.
  Punctuation and case matter.

Output Description:

  The output is an analysis of the proposed input sentence.  The
  analysis includes a set of links which capture the grammatical
  structure of the sentence, a labelling of each word with an
  appropriate part of speech tag, along with a judgement of the
  grammaticality of the input sentence.  Words in square brackets are
  ones that the parser deems superfluous.

Programming Language:

  The parser is written in ANSI C.

Known portability issues: None

Reference:

  See http://www.link.cs.cmu.edu/link for announcements of the latest
    version, detailed documentation, papers, source code, and rules for
    commercial use.