Towards Robust Training in NNGPT AutoML Pipeline: A Loss-Optimizer Pairing Selection Study

arXiv:2606.20933v1 Announce Type: new Abstract: The choice of loss function and optimizer is an important decision, that shapes further model training. Yet automated architecture search pipelines (AutoML) benefits significantly more from the optimal pairing selection and vice versa. This paper investigates whether a single recipe is sufficient for heterogeneous architecture pools, or whether the optimal pairing varies across structurally diverse models. We conduct a systematic empirical study of...

arXiv cs.LG ·Anton Abramochkin, Radu Timofte, Dmitry Ignatov ·
compartilhar: