AIR/X - a Rule-Based Multistage Indexing System for Large Subject Fields

  • Citation-Key:
    Fuhr/etal:91
  • Title:
    AIR/X - a Rule-Based Multistage Indexing System for Large Subject Fields
  • Author(s):
    N. Fuhr
    S. Hartmann
    G. Knorz
    G. Lustig
    M. Schwantner
    K. Tzeras
  • In:
    Proceedings of the RIAO'91, Barcelona, Spain, April 2-5, 1991
  • Page(s):
    606--623
  • Year:
    1991

Abstract:


AIR/X is a rule-based system for indexing with terms (descriptors) from a prescribed vocabulary. For this task, an indexing dictionary with rules for mapping terms from the text onto descriptors is required, which can be derived automatically from a set of manually indexed documents. Based on the Darmstadt Indexing Approach, the indexing task is deivided into a description step and a decision step. First, terms (single words or phrases) are identified in the document text. With term-descriptor rules from the dictionary, descriptor indications are formed. The set of all indications from a document leading to the same descriptor is called a relevance description. A probabilistic classification procedure computes indexing weights for each relevance description. Since the whole system is rule-based, it can be adapted to different subject fields by appropriate modifications of the rule bases. A major application of AIR/X is the AIR/PHYS system developed for a large physics database. This application is described in more detail along with experimental results.
Classification(s):
H.3.1
Subject descriptor(s):
Indexing methods
Keywords:
DIA

Fulltext as PS