Blog Dados & Embeddings

DREG: A Layer-Wise Jacobian Regularization as a General-Purpose Penalty

arXiv:2606.23942v1 Announce Type: new Abstract: We present a large-scale empirical study isolating the contributions of the Derivative Regularization penalty (DREG). Across a fully-crossed factorial sweep of 960 experiments spanning 4 activations, 6 regularizers, 8 datasets, and 5 random seeds, we ask: when, where, and why does DREG work? Our results establish three principal findings. First, DREG achieves the highest overall and clean-regime accuracy among all regularizers evaluated (significan...

arXiv cs.LG ·Rowan Martnishn · 24 de janeiro de 2026

Ver no Hugging Face

// relacionados

DREG: A Layer-Wise Jacobian Regularization as a General-Purpose Penalty

Leia também

Gradium Launches stt-translate and s2s-translate, Real-Time Speech Translation Models Beating gpt-realtime-translate on Accuracy and Latency

How to Design an OpenHarness Style Agent Runtime with Tools, Memory, Permissions, Skills, and Multi-Agent Coordination

Snowflake CEO finds GLM-5.2 competitive with Opus 4.7 at a fraction of the cost

Talos: Scaling rare disease diagnosis with automated, iterative genomic reanalysis