Robotics: Science and Systems XVIII
Meta Value Learning for Fast Policy-Centric Optimal Motion Planning
Siyuan Xu, Minghui ZhuAbstract:
This paper considers policy-centric optimal motion planning with limited reaction time. The motion planning queries are determined by their goal regions and cost functionals, and are generated over time from a distribution. Once a new query is requested, the robot needs to quickly generate a motion planner which can steer the robot to the goal region while minimizing a cost functional. We develop a meta-learning-based algorithm to compute a meta value function, which can be fast adapted using a small number of samples of a new query. Simulations on a unicycle are conducted to evaluate the developed algorithm and show the anytime property of the proposed algorithm.
Bibtex:
@INPROCEEDINGS{Xu-RSS-22,
AUTHOR = {Siyuan Xu AND Minghui Zhu},
TITLE = {{Meta Value Learning for Fast Policy-Centric Optimal Motion Planning}},
BOOKTITLE = {Proceedings of Robotics: Science and Systems},
YEAR = {2022},
ADDRESS = {New York City, NY, USA},
MONTH = {June},
DOI = {10.15607/RSS.2022.XVIII.061}
}
