AAAI Publications, Twenty-Fourth International FLAIRS Conference

Font Size: 
Disambiguation and Filtering Methods in Using Web Knowledge for Coreference Resolution
Olga Uryupina, Massimo Poesio, Claudio Giuliano, Kateryna Tymoshenko

Last modified: 2011-03-20


We investigate two publicly available web knowledge bases, Wikipedia and Yago, in an attempt to leverage semantic information and increase the performance level of a state-of-the-art coreference resolution (CR) engine. We extract semantic compatibility and aliasing information from Wikipedia and Yago, and incorporate it into a CR system. We show that using such knowledge with no disambiguation and filtering does not bring any improvement over the baseline, mirroring the previous findings. We propose, therefore, a number of solutions to reduce the amount of noise coming from web resources: using disambiguation tools for Wikipedia, pruning Yago to eliminate the most generic categories and imposing additional constraints on affected mentions. Our evaluation experiments on the ACE-02 corpus show that the knowledge, extracted from Wikipedia and Yago, improves our system's performance by 2-3 percentage points.

Full Text: PDF