Proceedings:
Proceedings of the International Symposium on Combinatorial Search, 13
Volume
Issue:
Vol. 13 No. 1 (2020): Thirteenth Annual Symposium on Combinatorial Search
Track:
Long Papers
Downloads:
Abstract:
MDPs with factored action spaces, i.e., where actions are described as assignments to a set of action variables, allow reasoning over action variables instead of action states, yet most algorithms only consider a grounded action representation. This includes algorithms that are instantiations of the Trial-based Heuristic Tree Search (THTS) framework, such as AO* or UCT. To be able to reason over factored action spaces, we propose a generalization of THTS where nodes that branch over all applicable actions are replaced with subtrees that consist of nodes that represent the decision for a single action variable. We show that many THTS algorithms retain their theoretical properties under the generalised framework, and show how to approximate any state-action heuristic to a heuristic for partial action assignments. This allows to guide a UCT variant that is able to create exponentially fewer nodes than the same algorithm that considers ground actions. An empirical evaluation on the benchmark set of the probabilistic track of the latest International Planning Competition validates the benefits of the approach.
DOI:
10.1609/socs.v11i1.18533
SOCS
Vol. 13 No. 1 (2020): Thirteenth Annual Symposium on Combinatorial Search