Active Reinforcement Learning: Observing Rewards at a Cost

Active Reinforcement Learning: Observing Rewards at a Cost. Krueger, D, Leike, J, Evans, O, & Salvatier, J filmnips.com.
abstract bibtex

Abstract Active reinforcement learning (ARL) is a variant on reinforcement learning where the agent does not observe the reward unless it chooses to pay a query cost c> 0. The central question of ARL is how to quantify the long-term value of reward information. Even in multi-armed bandits, computing the value of this information is intractable and we have to rely on heuristics. We propose and evaluate several heuristic approaches for ARL in multi-.

@Article{Krueger,
author = {Krueger, D and Leike, J and Evans, O and Salvatier, J}, 
title = {Active Reinforcement Learning: Observing Rewards at a Cost}, 
journal = {filmnips.com}, 
volume = {}, 
number = {}, 
pages = {}, 
year = {}, 
abstract = {Abstract Active reinforcement learning (ARL) is a variant on reinforcement learning where the agent does not observe the reward unless it chooses to pay a query cost c\&gt; 0. The central question of ARL is how to quantify the long-term value of reward information. Even in multi-armed bandits, computing the value of this information is intractable and we have to rely on heuristics. We propose and evaluate several heuristic approaches for ARL in multi-.}, 
location = {}, 
keywords = {}}

Downloads: 0

{"_id":"faWjfba4XsQET7gy7","bibbaseid":"krueger-leike-evans-salvatier-activereinforcementlearningobservingrewardsatacost","authorIDs":[],"author_short":["Krueger, D","Leike, J","Evans, O","Salvatier, J"],"bibdata":{"bibtype":"article","type":"article","author":[{"propositions":[],"lastnames":["Krueger"],"firstnames":["D"],"suffixes":[]},{"propositions":[],"lastnames":["Leike"],"firstnames":["J"],"suffixes":[]},{"propositions":[],"lastnames":["Evans"],"firstnames":["O"],"suffixes":[]},{"propositions":[],"lastnames":["Salvatier"],"firstnames":["J"],"suffixes":[]}],"title":"Active Reinforcement Learning: Observing Rewards at a Cost","journal":"filmnips.com","volume":"","number":"","pages":"","year":"","abstract":"Abstract Active reinforcement learning (ARL) is a variant on reinforcement learning where the agent does not observe the reward unless it chooses to pay a query cost c> 0. The central question of ARL is how to quantify the long-term value of reward information. Even in multi-armed bandits, computing the value of this information is intractable and we have to rely on heuristics. We propose and evaluate several heuristic approaches for ARL in multi-.","location":"","keywords":"","bibtex":"@Article{Krueger,\nauthor = {Krueger, D and Leike, J and Evans, O and Salvatier, J}, \ntitle = {Active Reinforcement Learning: Observing Rewards at a Cost}, \njournal = {filmnips.com}, \nvolume = {}, \nnumber = {}, \npages = {}, \nyear = {}, \nabstract = {Abstract Active reinforcement learning (ARL) is a variant on reinforcement learning where the agent does not observe the reward unless it chooses to pay a query cost c\\> 0. The central question of ARL is how to quantify the long-term value of reward information. Even in multi-armed bandits, computing the value of this information is intractable and we have to rely on heuristics. We propose and evaluate several heuristic approaches for ARL in multi-.}, \nlocation = {}, \nkeywords = {}}\n\n\n","author_short":["Krueger, D","Leike, J","Evans, O","Salvatier, J"],"key":"Krueger","id":"Krueger","bibbaseid":"krueger-leike-evans-salvatier-activereinforcementlearningobservingrewardsatacost","role":"author","urls":{},"downloads":0},"bibtype":"article","biburl":"https://gist.githubusercontent.com/stuhlmueller/a37ef2ef4f378ebcb73d249fe0f8377a/raw/6f96f6f779501bd9482896af3e4db4de88c35079/references.bib","creationDate":"2020-01-27T02:13:34.863Z","downloads":0,"keywords":[],"search_terms":["active","reinforcement","learning","observing","rewards","cost","krueger","leike","evans","salvatier"],"title":"Active Reinforcement Learning: Observing Rewards at a Cost","year":null,"dataSources":["hEoKh4ygEAWbAZ5iy"]}