Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives

Abstract

We present a model selection framework for Learning Rate-Free Reinforcement Learning that selects the optimal learning rate on the fly during RL training. This approach of adaptive learning rate tuning neither depends on the underlying RL algorithm nor the optimizer and solely uses the reward feedback to select the learning rate; hence, the framework can input any RL algorithm and produce a learning rate-free version of it.

Publication
In Hugo Blox Builder Conference
Aida Afshar
Aida Afshar
PhD Student

My primary research interests are Sequential Decision-Making and Statistical Foundations of Continual Learning.