Efficient Hyperparameter Optimization with Adaptive Fidelity Identification

Jiantong Jiang, Zeyi Wen, Atif Mansoor, Ajmal Mian

Research output: Chapter in Book/Conference paperConference paperpeer-review

14 Downloads (Pure)

Abstract

Hyperparameter Optimization and Neural Architecture Search are powerful in attaining state-of-the-art machine learning models with Bayesian Optimization (BO) standing out as a mainstream method. Extending BO into the multi-fidelity setting has been an emerging research topic in this field but faces the challenge of determining an appropriate fidelity for each hyperparameter configuration to fit the surrogate model. To tackle the challenge we propose a multi-fidelity BO method named FastBO which excels in adaptively deciding the fidelity for each configuration and providing strong performance while ensuring efficient resource usage. These advantages are achieved through our proposed techniques based on the concepts of efficient point and saturation point for each configuration which can be obtained from the empirical learning curve of the configuration estimated from early observations. Extensive experiments demonstrate FastBO's superior anytime performance and efficiency in identifying high-quality configurations and architectures. We also show that our method provides a way to extend any single-fidelity method to the multi-fidelity setting highlighting the wide applicability of our approach.
Original languageEnglish
Title of host publicationIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
PublisherIEEEE
Pages26181
Number of pages10
Publication statusPublished - 19 Jun 2024
EventThe IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 - Seattle Convention Center, Seattle, United States
Duration: 17 Jun 202421 Jun 2024

Conference

ConferenceThe IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024
Abbreviated titleCVPR
Country/TerritoryUnited States
CitySeattle
Period17/06/2421/06/24

Fingerprint

Dive into the research topics of 'Efficient Hyperparameter Optimization with Adaptive Fidelity Identification'. Together they form a unique fingerprint.

Cite this