Evolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learning

arXiv:2606.20858v1 Announce Type: new Abstract: The temporal structure of reward composition in reinforcement learning (RL) is typically hand-designed and held fixed throughout training, leaving the progression of motivational priorities largely unexplored. In this work, we propose an evolutionary framework for discovering developmental reward schedules, in which three distinct biologically inspired motivational components -- agency, novelty, and reactivity -- are combined through time-varying w...

arXiv cs.LG ·Alan Nadelsticher Ruvalcaba ·
compartilhar: