Proceedings:
Vol. 19 (2009): Nineteenth International Conference on Automated Planning and Scheduling
Volume
Issue:
Vol. 19 (2009): Nineteenth International Conference on Automated Planning and Scheduling
Track:
Long Papers
Downloads:
Abstract:
Explaining policies of Markov Decision Processes (MDPs) is complicated due to their probabilistic and sequential nature. We present a technique to explain policies for factored MDP by populating a set of domain-independent templates. We also present a mechanism to determine a minimal set of templates that, viewed together, completely justify the policy. Our explanations can be generated automatically at run-time with no additional effort required from the MDP designer. We demonstrate our technique using the problems of advising undergraduate students in their course selection and assisting people with dementia in completing the task of handwashing. We also evaluate our explanations for course-advising through a user study involving students.
DOI:
10.1609/icaps.v19i1.13365
ICAPS
Vol. 19 (2009): Nineteenth International Conference on Automated Planning and Scheduling