Retrieval in text collections with historic spelling using linguistic and spelling variants
-
- Zitationsschlüssel:
- Ernst/Fuhr:07
-
- Titel:
- Retrieval in text collections with historic spelling using linguistic and spelling variants
-
- Autor(en):
- Andrea Ernst-Gerlach
- Norbert Fuhr
-
- In:
- JCDL
-
- In:
-
- Zitationsschlüssel:
- JCDL:07
-
- Titel:
- ACM/IEEE Joint Conference on Digital Libraries, JCDL 2007, Vancouver, BC, Canada, June 18-23, 2007, Proceedings
-
- Herausgeber:
- Edie M. Rasmussen
- Ray R. Larson
- Elaine Toms
- Shigeo Sugimoto
-
- Verlag:
- ACM
-
- In:
- JCDL
-
- Jahr:
- 2007
-
- Seite(n):
- 333-341
-
- Jahr:
- 2007
Zusammenfassung:
We present a new approach for the retrieval of texts with non-standard spelling, which is important for historic texts e.g. in English or German. In this paper, we describe the overall architecture of our system, followed by its evaluation. Given a search term as lemma, we use a dictionary of contemporary German for finding all inflected and derived forms of the lemma. Then we apply transformation rules (derived from training data) for generating historic spelling variants. For the evaluation, we regard the resulting retrieval quality. The experimental results show that we can improve the retrieval quality for historic collections substantially.
Volltext als PDF