- Targeted audience
- DAI Hauptstudium with 12 credit points : Bereich "D"
- Kommedia Bachelor: Erste Hälfte der Vorlesung, mit Übungen
Die mündlichen Prüfungen finden in der Woche vom 8.-12.9.08 statt.
|Monday||16:00 - 17:30||LE/105|
|Thursday||12:00 - 13:30||LB/134|
|Wednesday||16:00 - 17:30||LB/134||Dr. rer. nat. Ingo Frommholz|
Information Retrieval (IR) deals with information search in purely structured data like e.g. fulltexts or multimedia databases. Popular applications are web search engines, digital libraries and multimedia archives (e.g. for images).
Due to the vagueness of the information need and the uncertain representation of the content of the stored objects, standard database techniques are not appropriate. Instead, the concepts have to be extended to deal with vagueness and uncertainty. As the major focus is on content-oriented search, special techniques for representing the content of text and multimedia objects are required.
This lecture introduces the underyling concepts of IR and illustrates them based on special application areas.
- A) Basic concepts (information cycle, evaluation)
- B) Representation of content (free text search, documentation languages, special logics)
- C) Models (classic models, models for multimedia documents)
- D) Implementation of IR systems (layer model, visualisation, access paths, algorithms)
- E) IR tasks (retrieval, filtering, categorisation, cross-language retrieval, text mining, summarization)
- F) Application areas (web search engines, multimedia digital libraries, IR and databases)
Besides the slides and the lecture notes, the following books and lecture notes are recommended:
Baeza-Yates, B. Ribeiro-Neto: Modern Information Retrieval.
(The chapter about user interfaces and visualisation is online.)
- R. Belew: Finding Out About. A Cognitive Perspective on Search Engine Technology and the WWW. Cambridge University Press.
- Reginald Ferber: Data Mining und Information Retrieval. dpunkt Verlag . (earlier version)
- C. J. van Rijsbergen: Information Retrieval (HTML version of the book from 1979, but still worth reading)
(The lecture notes only partialy cover the content of the lecture, some parts are available only as slides.)
- Lecture notes (parts in German, parts in English)
- Appelt/Israel: Introduction to Information Extraction Technology
- Gianni Amati, Cornelis Joost Van Rijsbergen Probabilistic models of information retrieval based on measuring the divergence from randomness ACM Transactions on Information Systems (TOIS) 20, (4), 2002, pp. 357-389
- Norbert Fuhr:A Decision-Theoretic Approach to Database Selection in Networked IR. ACM Transactions on Information Systems