Published:
May 2002
Proceedings:
Proceedings of the Fifteenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2002)
Volume
Issue:
Proceedings of the Fifteenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2002)
Track:
All Papers
Downloads:
Abstract:
This paper addresses the problem of performing accurate semantic annotations in a large corpus. The task of creating a sense tagged corpus is different from the word sense disambiguation problem in that the semantic annotations have to be highly accurate, even if the price to be paid is lower coverage. While the state-of-the-art in word sense disambiguation does not exceed 70% precision, we want to find the means to perform semantic annotations with an accuracy close to 100%. We deal with this problem in the process of disambiguating the definitions in the WordNet dictionary. We propose in this paper a method that is able to tag words with high precision, using pattern extraction followed by pattern matching. This algorithm exploits the idiosyncratic nature of the corpus to be tagged, and achieves a precision of 99% with a coverage of 6%, measured on a WordNet subset, respectively more than 12.5% coverage estimated for the entire WordNet.
FLAIRS
Proceedings of the Fifteenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2002)
ISBN 978-1-57735-141-2
Published by The AAAI Press, Menlo Park, California