Blog
Robótica & RL
Gauging, Measuring, and Controlling Critic Complexity in Actor-Critic Reinforcement Learning
arXiv:2607.00452v1 Announce Type: new Abstract: Actor-critic methods depend on learned critics, but critic quality is often evaluated only indirectly through return, temporal-difference error, or value loss. Critic complexity is introduced as an additional diagnostic and intervention dimension for actor-critic reinforcement learning. The analysis uses spectral effective-rank entropy, a rank-like summary of the singular-value distributions of critic weight matrices, to assess critic model complex...
arXiv cs.LG
·Konstantin Garbers
·
// relacionados