Abstract:
An important problem in Natural Language Processing is identifying the correct sense of a word in a particular context. Thus far, statistical methods have been considered the best techniques in word sense disambiguation. Unfortunately, these methods produce high accuracy results only for a small number of preselected words. The reduced applicabil ity of statistical methods is due basically to the lack of widely available semantically tagged corpora. In this paper we present a method which enables the automatic acquisi tion of sense tagged corpora. It is based on (1) the information provided in WordNet, particularly the word definitions found within the glosses and (2) the information gathered from Internet using existing search engines.

Published Date: May 1999
Registration: ISBN 978-1-57735-080-4
Copyright: Published by The AAAI Press, Menlo Park, California.