Improving the Learning Efficiencies of Realtime Search

Authors

Toru Ishida

Masashi Shimbo

Proceedings:

No. 1: Agents, AI in Art and Entertainment, Knowledge Representation, and Learning

Volume

Issue:

Proceedings of the AAAI Conference on Artificial Intelligence, 13

Track:

Search & Learning

Downloads:

Download PDF

Abstract:

The capability of learning is one of the salient features of realtime search algorithms such as LRTA*. The major impediment is, however, the instability of the solution quality during convergence: (1) they try to find all optimal solutions even after obtaining fairly good solutions, and (2) they tend to move towards unexplored areas thus failing to balance exploration and exploitation. We propose and analyze two new realtime search algorithms to stabilize the convergence process. E-search (weighted realtime search) allows suboptimal solutions with E error to reduce the total amount of learning performed. d-search (realtime search with upper bounds) utilizes the upper bounds of estimated costs, which become available after the problem is solved once. Guided by the upper bounds, d-search can better control the tradeoff between exploration and exploitation.

AAAI

Proceedings of the AAAI Conference on Artificial Intelligence, 13

ISBN 978-0-262-51091-2

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.