Published:
May 2003
Proceedings:
Proceedings of the Sixteenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2003)
Volume
Issue:
Proceedings of the Sixteenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2003)
Track:
All Papers
Downloads:
Abstract:
Document retrieval in languages with a rich and complex morphology -- particularly in terms of derivation and (single-word) composition -- suffers from serious performance degradation with the stemming-only query-term-totext- word matching paradigm. We propose an alternative approach in which morphologically complex word forms are segmented into relevant subwords (such as stems, prefixes, suffixes), and subwords constitute the basic unit for indexing and retrieval. We evaluate our approach on a biomedical document collection.
FLAIRS
Proceedings of the Sixteenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2003)
ISBN 978-1-57735-177-1
Published by The AAAI Press, Menlo Park, California.