Beyond String Matching and Clue Phrases: Improving Efficiency and Coverage in Discourse Analysis

Simon Corston-Oliver

RASTA (Rhetorical Structure Theory Analyzer), a discourse analysis component within the Microsoft English Grammar, efficiently computes representations of the structure of written discourse using information available in syntactic and logical form analyses. RASTA heuristically scores the rhetorical relations that it hypothesizes, using those scores to guide it in producing more plausible discourse representations before less plausible ones. The heuristic scores also provide a genre-independent method for evaluating competing discourse analyses: the best discourse analyses are those constructed from the strongest hypotheses.

This page is copyrighted by AAAI. All rights reserved. Your use of this site constitutes acceptance of all of AAAI's terms and conditions and privacy policy.