Reinforcement Learning in Non-Stationary Environments

Author: Neural Intelligence Network
Published: Thu 26 Jun 2025
Episode Link: https://podcasters.spotify.com/pod/show/neuralintelpod/episodes/Reinforcement-Learning-in-Non-Stationary-Environments-e34mqe4

This academic paper introduces Non-Stationary Natural Actor-Critic (NS-NAC), a novel model-free, policy-based reinforcement learning algorithm designed for time-varying environments where rewards and transition probabilities change. Traditional reinforcement learning often assumes stationary environments, but real-world applications frequently involve dynamic systems. NS-NAC addresses this by incorporating restart-based exploration and adaptive learning rates to balance forgetting outdated information and learning new environmental dynamics. The paper also presents BORL-NS-NAC, a parameter-free extension that does not require prior knowledge of environmental variation, and provides theoretical guarantees for both algorithms through dynamic regret analysis, supported by empirical simulations.

Share to:

EachPod

EachPod

Reinforcement Learning in Non-Stationary Environments