Counterfactual Residual Data Augmentation for Regression

arXiv:2606.28460v1 Announce Type: new Abstract: Data-driven modeling in real-world regression tasks often suffers from limited training samples, high collection costs, and noisy observations. Inspired by the impact of data augmentation in vision and language, we propose a novel Counterfactual Residual Data Augmentation (CRDA) technique for tabular regression. Our key insight is that once a regressor has modeled the systematic component of the data, the remaining noise can be viewed as an invaria...

arXiv cs.LG ·Hossein Mohebbi, Oliver Schulte, Ke Li, Pascal Poupart ·
compartilhar: