Automatic Acquisition of Sense Tagged Corpora

Rada F Mihalcea, Dan I. Moldovan

An important problem in Natural Language Processing is identifying the correct sense of a word in a particular context. Thus far, statistical methods have been considered the best techniques in word sense disambiguation. Unfortunately, these methods produce high accuracy results only for a small number of preselected words. The reduced applicabil ity of statistical methods is due basically to the lack of widely available semantically tagged corpora. In this paper we present a method which enables the automatic acquisi tion of sense tagged corpora. It is based on (1) the information provided in WordNet, particularly the word definitions found within the glosses and (2) the information gathered from Internet using existing search engines.

This page is copyrighted by AAAI. All rights reserved. Your use of this site constitutes acceptance of all of AAAI's terms and conditions and privacy policy.