A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences
Optimizing discrete black-box functions is key in several domains, e.g. protein engineering and drug design. Due to the lack of gradient information and the need for sample efficiency, Bayesian optimization is an ideal candidate for these tasks. Several methods for high-dimensional continuous and ca...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Optimizing discrete black-box functions is key in several domains, e.g.
protein engineering and drug design. Due to the lack of gradient information
and the need for sample efficiency, Bayesian optimization is an ideal candidate
for these tasks. Several methods for high-dimensional continuous and
categorical Bayesian optimization have been proposed recently. However, our
survey of the field reveals highly heterogeneous experimental set-ups across
methods and technical barriers for the replicability and application of
published algorithms to real-world tasks. To address these issues, we develop a
unified framework to test a vast array of high-dimensional Bayesian
optimization methods and a collection of standardized black-box functions
representing real-world application domains in chemistry and biology. These two
components of the benchmark are each supported by flexible, scalable, and
easily extendable software libraries (poli and poli-baselines), allowing
practitioners to readily incorporate new optimization objectives or discrete
optimizers. Project website:
https://machinelearninglifescience.github.io/hdbo_benchmark |
---|---|
DOI: | 10.48550/arxiv.2406.04739 |