Overfitting in Bayesian optimization: An empirical study and early-stopping solution

Anastasiia Makarova; Huibin Shen; Valerio Perrone; Aaron Klein; Jean Baptiste Faddoul; Andreas Krause; Matthias Seeger; Cédric Archambeau

Publication

Overfitting in Bayesian optimization: An empirical study and early-stopping solution

By Anastasiia Makarova, Huibin Shen, Valerio Perrone, Aaron Klein, Jean Baptiste Faddoul, Andreas Krause, Matthias Seeger, Cédric Archambeau

2021

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

Bayesian Optimization (BO) is a successful methodology to tune the hyperparameters of machine learning models. The user defines a metric of interest, such as the validation error, and BO finds the optimal hyperparameters that minimize it. However, the metric improvements on the validation set may not translate to improvements on the test set, especially when tuning models trained on small datasets. While cross-validation can mitigate this, it comes with an increased computational cost. In this paper, we carry out the first systematic investigation of overfitting in BO and demonstrate that this issue is a serious, yet often overlooked concern in practice. We propose the first problem-adaptive and interpretable criterion to early stop BO, reducing overfitting while mitigating the cost of cross-validation. Experimental results on real-world hyperparameter optimization tasks show that our approach can substantially reduce compute time with little to no loss of test accuracy, demonstrating a practical advantage over existing techniques.

Overfitting in Bayesian optimization: An empirical study and early-stopping solution

Latest news

Work with us