Comparable Datasets in Performance Benchmarking

Authors

David Steier

Proceedings:

Information Gathering from Heterogeneous, Distributed Environments

Volume

Issue:

Papers from the 1995 AAAI Spring Symposium

Track:

Contents

Downloads:

Download PDF

Abstract:

A number of tasks require gathering information about a collection of similar objects to perform a comparison. When the information needed to perform these tasks comes from a single database, the amount and the type of data retrieved about each object in the collection is likely to be very similar, and the task of comparison relatively straightforward. But when information comes from many sources, information gatherers face a problem of producing a common comparable dataset for each object being compared. This problem is difficult because what should be in a comparable dataset (as we show in this paper) depends on the task for which the information is being gathered, the target collection of objects to report on, and the data available about each object. The purpose of this workshop paper is to highlight the importance of this problem in gathering information from heterogeneous sources, and to present some detail about a case study encountered in practice while doing a performance benchmarking study. Aspects of producing compsets have been studied in the database literature within the area of schema integration for heterogeneous databases [Batini et al., 1986], because of the shared concern for semantic comparability at the schematic level. For example, the theory of semantic values developed by Sciore et al. [1994] seems like a promising approach to computing comparable datasets because of the explicit representation of contextual information for each value. We discuss a number of issues involved in using contextual information in this way.

Spring

Papers from the 1995 AAAI Spring Symposium

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.