Hippocampal formation breaks combinatorial explosion for reinforcement learning: A conjecture

Authors

Andras Lorincz

Track:

Contents

Downloads:

Abstract:

There is surmounting evidence that reinforcement learning (RL) is a good model for the dopamine system of the brain and the prefrontal cortex. RL is also promising from the algorithmic point of view, because recent factored RL algorithms have favorable convergence and scaling properties and can counteract the curse of dimensionality problem, the major obstacle of practical applications of RL methods. Learning in navigation tasks then separates (i) to the search and the encoding of the factors, such as position, direction, and speed, and (ii) to the optimization of RL decision making by using these factors. We conjecture that the main task of the hippocampal formation is to separate factors and encode into neocortical areas the different low-dimensional conjunctive representations of them to suit factored RL value estimation. The mathematical framework is sketched. It includes convergent factored RL model and autoregressive (AR) hidden process model that finds factors including the hidden causes. The AR model is mapped to the hippocampal formation.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.