Corpus-Based Induction of Lexical Representation and Meaning

Maria Lapata, University of Edinburgh

The acquisition of linguistic knowledge, i.e., the identification, extraction, and encoding of linguistic information in a corpus, has been one of the main motivations for data-driven approaches to natural language. Methods have been developed for the acquisition of, for instance, parts of speech, noun compounds, collocations, support verbs, subcategorization frames, phrase structure rules, selectional restrictions and sense induction for an overview). Drawing on this body of research, I am investigating the acquisition of lexical semantic knowledge from corpora, thereby addressing the logical problem of language acquisition, one of the fundamental issues in linguistics and cognitive science.

This page is copyrighted by AAAI. All rights reserved. Your use of this site constitutes acceptance of all of AAAI's terms and conditions and privacy policy.