LassoBench

LassoBench is a library for high-dimensional hyperparameter optimization benchmarks based on Weighted Lasso regression.

Install the development version

Python version >= 3.8 required.

From a console or terminal clone the repository and install LassoBench:

::

git clone https://github.com/ksehic/LassoBench.git
cd LassoBench/
pip install -e .

Overview

The objective is to optimize the multi-dimensional hyperparameter space that balances the least-squares estimation and the penalty term that promotes the sparsity.

The search space bounds are defined between [-1, 1].

LassoBench comes with two classes SyntheticBenchmark and RealBenchmark. While RealBenchmark is based on real-world applications found in medicine and finance, SyntheticBenchmark covers synthetic well-defined conditions. The user can select one of the predefined synthetic benchmarks or create a different bechmark. For the synthetic benchmarks, the default condition for the noise level is noiseless (noise=False).

Each benchmark comes with .evaluate that is used to evaluate the objective function, .test that provides the post-processing metrics (such as MSE on the test data and the F-score for synt benchs) and the argument mf_opt to define the multi-fidelity framework that is evaluated via .fidelity_evaluate.

Simple experiments are provided in example/example.py where you can ran random search on different benchmarks. In the folder experiments/, the user can access the results provided in our reference.

LassoBench baselines

LassoBench comes with the baselines commonly found in the Lasso community listed in the table that should be used for the comparison. LassoCV and AdaptiveLassoCV are the Lasso-based baselines where a single hyperparameter is optimized via grid search. The user can change the number of points in the grid following the provided documentation. The default value is 100 points. The implementation of the Lasso-based baselines is derived from Celer. Sparse-HO is a sparse hyperparameter optimizer based on coordinate descent. It can be easily applied to the 1D Lasso problem as well as to the Weighted Lasso problem. The user can change the number of steps, gradient solver, and similar following the provided documentation.

Baseline	Status	Description	Command
LassoCV	Included	Standard 1D sparse regression approach	`.run_LASSOCV`
AdaptiveLassoCV	To be implemented soon until then refer to the branch "adaptivelassocv" in https://github.com/mathurinm/celer	Iterative LassoCV approach	NA
Sparse-HO	Included	Sparse multi-dimensional optimizer	`.run_sparseho`

HPO Methods

In the folder ~/example, the user can learn how to use LassoBench with some well-known HPO algorithms for high-dimensional problems hesbo_example.py, cma_example.py, turbo_example.py and alebo_example.py. Please refer to the docstrings and the table for more details.

HPO Methods	Install	Description	File
HeSBO	Check HeSBO lib	Bayesian Optimization with dimensionality reduction	`hesbo_example.py`
ALEBO	Install prerequirements	Bayesian Optimization with dimensionality reduction	`alebo_example.py`
CMA-ES	`python -m pip install cma`	Evolutionary Strategy	`cma_example.py`
TuRBO	Follow https://github.com/uber-research/TuRBO	Local Bayesian Optimization	`turbo_example.py`

License

LassoBench is distributed under the MIT license. More information on the license can be found here

Simple noiseless synthetic bench code

import numpy as np
import LassoBench
synt_bench = LassoBench.SyntheticBenchmark(pick_bench='synt_simple')
d = synt_bench.n_features
random_config = np.random.uniform(low=-1.0, high=1.0, size=(d,))
loss = synt_bench.evaluate(random_config)

Simple noisy synthetic bench code

import numpy as np
import LassoBench
synt_bench = LassoBench.SyntheticBenchmark(pick_bench='synt_simple', noise=True)
d = synt_bench.n_features
random_config = np.random.uniform(low=-1.0, high=1.0, size=(d,))
loss = synt_bench.evaluate(random_config)

Which hyperparameters are important in synthetic benchs?

import numpy as np
import LassoBench
synt_bench = LassoBench.SyntheticBenchmark(pick_bench='synt_simple', noise=True)
true_reg_coef = synt_bench.w_true
hpo_important = np.argwhere(true_reg_coef != 0)

Real-world bench code

import numpy as np
import LassoBench
real_bench = LassoBench.RealBenchmark(pick_data='RCV1')
d = real_bench.n_features
random_config = np.random.uniform(low=-1.0, high=1.0, size=(d,))
loss = real_bench.evaluate(random_config)

Multi-information source bench code

import numpy as np
import LassoBench
real_bench_mf = LassoBench.RealBenchmark(pick_data='RCV1', mf_opt='discrete_fidelity')
d = real_bench_mf.n_features
random_config = np.random.uniform(low=-1.0, high=1.0, size=(d,))
fidelity_pick = 0
loss = real_bench_mf.fidelity_evaluate(random_config, index_fidelity=fidelity_pick)

List of synthetic benchmarks

Name	Dimensionality	Axis-aligned Subspace
synt_simple	60	3
synt_medium	100	5
synt_high	300	15
synt_hard	1000	50

List of real world benchmarks

Name	Dimensionality	Approx. Axis-aligned Subspace
Breast_cancer	10	3
Diabetes	8	5
Leukemia	7 129	22
DNA	180	43
RCV1	19 959	75

Cite

If you use this code, please cite:


Šehić Kenan, Gramfort Alexandre, Salmon Joseph and Nardi Luigi, "LassoBench: A High-Dimensional Hyperparameter Optimization Benchmark Suite for Lasso", Proceedings of the 1st International Conference on Automated Machine Learning, 2022.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
LassoBench		LassoBench
example		example
experiments		experiments
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LassoBench

Install the development version

Overview

LassoBench baselines

HPO Methods

License

Simple noiseless synthetic bench code

Simple noisy synthetic bench code

Which hyperparameters are important in synthetic benchs?

Real-world bench code

Multi-information source bench code

List of synthetic benchmarks

List of real world benchmarks

Cite

About

Releases

Packages

Contributors 2

Languages

License

ksehic/LassoBench

Folders and files

Latest commit

History

Repository files navigation

LassoBench

Install the development version

Overview

LassoBench baselines

HPO Methods

License

Simple noiseless synthetic bench code

Simple noisy synthetic bench code

Which hyperparameters are important in synthetic benchs?

Real-world bench code

Multi-information source bench code

List of synthetic benchmarks

List of real world benchmarks

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages