| | |
| | |
Stat |
Members: 3645 Articles: 2'504'585 Articles rated: 2609
24 April 2024 |
|
| | | |
|
Article overview
| |
|
A Definition of Happiness for Reinforcement Learning Agents | Mayank Daswani
; Jan Leike
; | Date: |
18 May 2015 | Abstract: | What is happiness for reinforcement learning agents? We seek a formal
definition satisfying a list of desiderata. Our proposed definition of
happiness is the temporal difference error, i.e. the difference between the
value of the obtained reward and observation and the agent’s expectation of
this value. This definition satisfies most of our desiderata and is compatible
with empirical research on humans. We state several implications and discuss
examples. | Source: | arXiv, 1505.4497 | Services: | Forum | Review | PDF | Favorites |
|
|
No review found.
Did you like this article?
Note: answers to reviews or questions about the article must be posted in the forum section.
Authors are not allowed to review their own article. They can use the forum section.
browser Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)
|
| |
|
|
|
| News, job offers and information for researchers and scientists:
| |