| | |
| | |
Stat |
Members: 3645 Articles: 2'506'133 Articles rated: 2609
26 April 2024 |
|
| | | |
|
Article overview
| |
|
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment | Adrien Ali Taïga
; William Fedus
; Marlos C. Machado
; Aaron Courville
; Marc G. Bellemare
; | Date: |
7 Aug 2019 | Abstract: | This paper provides an empirical evaluation of recently developed exploration
algorithms within the Arcade Learning Environment (ALE). We study the use of
different reward bonuses that incentives exploration in reinforcement learning.
We do so by fixing the learning algorithm used and focusing only on the impact
of the different exploration bonuses in the agent’s performance. We use
Rainbow, the state-of-the-art algorithm for value-based agents, and focus on
some of the bonuses proposed in the last few years. We consider the impact
these algorithms have on performance within the popular game Montezuma’s
Revenge which has gathered a lot of interest from the exploration community,
across the the set of seven games identified by Bellemare et al. (2016) as
challenging for exploration, and easier games where exploration is not an
issue. We find that, in our setting, recently developed bonuses do not provide
significantly improved performance on Montezuma’s Revenge or hard exploration
games. We also find that existing bonus-based methods may negatively impact
performance on games in which exploration is not an issue and may even perform
worse than $epsilon$-greedy exploration. | Source: | arXiv, 1908.2388 | Services: | Forum | Review | PDF | Favorites |
|
|
No review found.
Did you like this article?
Note: answers to reviews or questions about the article must be posted in the forum section.
Authors are not allowed to review their own article. They can use the forum section.
browser Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)
|
| |
|
|
|
| News, job offers and information for researchers and scientists:
| |