Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives

Abstract

We present a model selection framework for Learning Rate-Free Reinforcement Learning that selects the optimal learning rate on the fly during RL training. This approach of adaptive learning rate tuning neither depends on the underlying RL algorithm nor the optimizer and solely uses the reward feedback to select the learning rate; hence, the framework can input any RL algorithm and produce a learning rate-free version of it.

Publication
In Hugo Blox Builder Conference
Aida Afshar
Aida Afshar
PhD Student

My primary research interests are Sequential Decision Making and Reinforcement Learning. Currently, I’m working on using Foundation Models to improve the structure and practicality of Decision Making Frameworks.