| | |
| | |
Stat |
Members: 3645 Articles: 2'501'711 Articles rated: 2609
20 April 2024 |
|
| | | |
|
Article overview
| |
|
Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup | Devin Schwab
; Tobias Springenberg
; Murilo F. Martins
; Thomas Lampe
; Michael Neunert
; Abbas Abdolmaleki
; Tim Herkweck
; Roland Hafner
; Francesco Nori
; Martin Riedmiller
; | Date: |
13 Feb 2019 | Abstract: | We present a method for fast training of vision based control policies on
real robots. The key idea behind our method is to perform multi-task
Reinforcement Learning with auxiliary tasks that differ not only in the reward
to be optimized but also in the state-space in which they operate. In
particular, we allow auxiliary task policies to utilize task features that are
available only at training-time. This allows for fast learning of auxiliary
policies, which subsequently generate good data for training the main,
vision-based control policies. This method can be seen as an extension of the
Scheduled Auxiliary Control (SAC-X) framework. We demonstrate the efficacy of
our method by using both a simulated and real-world Ball-in-a-Cup game
controlled by a robot arm. In simulation, our approach leads to significant
learning speed-ups when compared to standard SAC-X. On the real robot we show
that the task can be learned from-scratch, i.e., with no transfer from
simulation and no imitation learning. Videos of our learned policies running on
the real robot can be found at
this https URL | Source: | arXiv, 1902.4706 | Services: | Forum | Review | PDF | Favorites |
|
|
No review found.
Did you like this article?
Note: answers to reviews or questions about the article must be posted in the forum section.
Authors are not allowed to review their own article. They can use the forum section.
browser Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)
|
| |
|
|
|
| News, job offers and information for researchers and scientists:
| |