Accelerating Robot Trajectory Learning for Stochastic Tasks. Vidakovic, J., Jerbić, B., Sekoranja, B., Švaco, M., & Suligoj, F. IEEE Access, 8:71993–72006, 2020.
Accelerating Robot Trajectory Learning for Stochastic Tasks [link]Paper  doi  abstract   bibtex   
Learning from demonstration provides ways to transfer knowledge and skills from humans to robots. Models based solely on learning from demonstration often have very good generalization capabilities but are not completely accurate when adapting to new scenarios. This happens especially when learning stochastic tasks because of the correspondence problem and unmodeled physical properties of tasks. On the other hand, reinforcement learning (RL) methods such as policy search have the capability to refine an initial skill through exploration, where the learning process is often very dependent on the initialization strategy and is efficient in finding only local solutions. These two approaches are, therefore, frequently combined. In this paper, we present how the iterative learning of tasks can be accelerated by a learning from demonstration (LfD) method based on the extraction of via-points. The paper provides an evaluation of the approach on two different primitive motion tasks.
@article{vidakovic_accelerating_2020,
	title = {Accelerating {Robot} {Trajectory} {Learning} for {Stochastic} {Tasks}},
	volume = {8},
	issn = {2169-3536},
	url = {https://ieeexplore.ieee.org/document/9062516/},
	doi = {10.1109/ACCESS.2020.2986999},
	abstract = {Learning from demonstration provides ways to transfer knowledge and skills from humans to robots. Models based solely on learning from demonstration often have very good generalization capabilities but are not completely accurate when adapting to new scenarios. This happens especially when learning stochastic tasks because of the correspondence problem and unmodeled physical properties of tasks. On the other hand, reinforcement learning (RL) methods such as policy search have the capability to refine an initial skill through exploration, where the learning process is often very dependent on the initialization strategy and is efficient in finding only local solutions. These two approaches are, therefore, frequently combined. In this paper, we present how the iterative learning of tasks can be accelerated by a learning from demonstration (LfD) method based on the extraction of via-points. The paper provides an evaluation of the approach on two different primitive motion tasks.},
	language = {en},
	urldate = {2020-06-12},
	journal = {IEEE Access},
	author = {Vidakovic, Josip and Jerbić, Bojan and Sekoranja, Bojan and Švaco, Marko and Suligoj, Filip},
	year = {2020},
	pages = {71993--72006},
}

Downloads: 0