Agents in the Brickworld

Authors

Gregory D. Weber

Proceedings:

Eleventh Midwest Artificial intelligence and Cognitive Science Conference

Volume

Issue:

Eleventh Midwest Artificial intelligence and Cognitive Science Conference

Track:

Contents

Downloads:

Download PDF

Abstract:

Brickworld is a simulated environment which has been developed as a testbed for learning and planning--in particular, for learning and using knowledge of causal relations. The environment is both dynamic--there are other "agents" whose actions affect "the" agent’s performance--and stochastic---future states can be predicted only with uncertainty. The task, building and maintaining a wall, has been formulated as a reinforcement learning problem. The ultimate goal of the Brickworld project is to develop a relational reinforcement learning agent that will learn a causal model of the environment representing both its own causal powers and those of the other "agents." The term "agents" is used here in the broadest possible sense, including not only intelligent agents but brute animals and even natural forces such as wind and rain--anything that can be a cause of environmental change. This paper describes seven implemented agents-- a quasi-reactive agent, four non-learning rule-based agents, and two (non-relational) reinforcement learning agents--and compares their performance. The experiments show that a reasonable knowledge representation for the environment results in a state-value function which has local optima, making greedy and e-greedy policies inappropriate. Deeper search is required, leading to problems of inefficiency, which may be alleviated through hierarchical problem spaces. The paper raises questions about the legitimacy of programmerdesigned hierarchies in the framework of reinforcement learning and suggests a principled solution.

MAICS

Eleventh Midwest Artificial intelligence and Cognitive Science Conference

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.