Finding Photograph Captions Multimodally on the World Wide Web

Authors

Neil C. Rowe and Brian Frew

Proceedings:

Intelligent Integration and Use of Text, Image, Video, and Audio Corpora

Volume

Issue:

Papers from the 1997 AAAI Spring Symposium

Track:

Contents

Downloads:

Download PDF

Abstract:

Several software tools index text of the World Wide Web, but little attention has been paid to the many valuable photographs. We present a relatively simple way to index them by localizing their likely explicit and implicit captions with a kind of expert system. We use multimodal clues from the general appearance of the image, layout of the Web page, and the words nearby the image that are likely to describe it. Our MARIE-3 system avoids full image processing and full natural-language processing, but demonstrates a surprising degree of success, and can thus serve as a preliminary filtering for such detailed content analysis. Experiments with a randomly chosen set of Web pages concerning the military showed 41% recall with 41% precision for individual caption identification, or 70% recall with 30% precision, although captions averaged only 1.4% of the page text.

Spring

Papers from the 1997 AAAI Spring Symposium

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.