OSError: [Errno 39] Directory not empty: 'attribute_lock'

HPOBenchExperimentUtils

A small tool to easily run different optimizers on HPOBench-benchmarks with the same settings. The HPOBenchExpUtils extract for each run a runhistory as well as a trajectory.

Running a benchmark

The hpo run can be started from either the commandline:

from HPOBenchExperimentUtils import run_benchmark
run_benchmark(optimizer='hpbandster_bohb_eta_3',
              benchmark='cartpolereduced',
              output_dir='path/to/output',
              rng=0)

or by using the commandline:

python run_benchmark.py --output_dir path/to/output \
                        --optimizer smac_hb_eta_2 \
                        --benchmark xgboost \
                        --task_id 167083
                        --rng 1

The tool automatically saves the evaluated configurations as well as the seen trajectory. Both files are stored in the output_dir/<optimizer_string>-run-. Each line in both files is a json dict, which stores information about the evaluated configuration.

Note that in both cases you can pass benchmark specific parameters to the call. Here, the xgboost benchmark takes an openml task id. Please take a look at the benchmarks in the HPOBench. Also, by default the containerized version of the benchmark is used. This requires singularity 3.5. You can use the local installed benchmarks by adding use_local=True to the function call.

Validating configurations

The HPOBenchExperimentUtils tool also validates previously found trajectories. Validating means running the configuration again but this time on the test-objective function of the benchmark with the highest budget. This step can take a lot of time.

The tool reads all configurations found in the specified path and valdiates them.

Call the validation function again either from code:

from HPOBenchExperimentUtils import validate_benchmark
validate_benchmark(benchmark='cartpolereduced',
                   output_dir='path / to / output',
                   rng=0)

... or the commandline:

python validate_benchmark.py --output_dir path/to/output \
                             --benchmark xgboost \
                             --task_id 167083
                             --rng 1

The validated trajectory is automatically saved in human readable form to the output directory.

Settings

The benchmarks' settings are predefined in the file benchmark_settings.yaml. The settings for the optimizer including timelimits and cutoff times are defined in the optimizer_settings.yaml

Available Optimizer settings

Optimizer	Available options
SMAC - Hyperband	smac_hb_eta_1, smac_hb_eta_2_learna, smac_hb_eta_2, smac_hb_eta_3
SMAC - Successive Halving	smac_sh_eta_1, smac_sh_eta_2_learna, smac_sh_eta_2, smac_sh_eta_3
HpBandSter - BOHB	hpbandster_bohb_eta_2_learna, hpbandster_bohb_eta_2, hpbandster_bohb_eta_3
HpBandSter - Random Search	hpbandster_rs_eta_2_learna, hpbandster_rs_eta_2, hpbandster_rs_eta_3
HpBandSter - Hyperband	hpbandster_hb_eta_2_learna, hpbandster_hb_eta_2, hpbandster_hb_eta_3
HpBandSter - H2BO	hpbandster_h2bo_eta_2_learna, hpbandster_h2bo_eta_2, hpbandster_h2bo_eta_3
Dragonfly	dragonfly_default, dragonfly_realtime

Available Benchmarks:

Benchmarks	benchmark token	HPOBench Link
Cartpole - Full search space	cartpolefull	Link
Cartpole - Reduced search space	cartpolereduced	link
Learna	learna	link
MetaLearna	metalearna	link
NasBench101 - Cifar10A	NASCifar10ABenchmark	link
NasBench101 - Cifar10B	NASCifar10BBenchmark	link
NasBench101 - Cifar10C	NASCifar10CBenchmark	link
TabularBenchmarks - Naval Propulsion	NavalPropulsionBenchmark	link
TabularBenchmarks - Parkinsons Telemonitoring	ParkinsonsTelemonitoringBenchmark	link
TabularBenchmarks - Protein Structure	ProteinStructureBenchmark	link
TabularBenchmarks - Slice Localization	SliceLocalizationBenchmark	link
XGBoost Benchmark	xgboost	link

How to contribute:

New Benchmark Settings:

If you want to add a new benchmark setting, add a yaml conform entry in the benchmark_settings.yaml

Possible options are:

xgboost:
  # Mandatory options:
  # ##################
  time_limit_in_s: 4000
  cutoff_in_s: 1800
  mem_limit_in_mb: 4000
  
  # Address in the hpobench
  import_from: ml.xgboost_benchmark
  import_benchmark: XGBoostBenchmark
  

  # Facultative options: (Only need to be specified if used)
  # ####################
  # If the benchmark has multiple fidelities, you can specify a main fidelity. This then used by the 
  # SingleFidelityOptimizer.
  main_fidelity: subsample

  # If the benchmark is a surrogate (e.g. Nasbench201 is a tabular benchmark), please set the option to true. 
  # By default, this option is set to false. This option changes the remaining budget calculation in the bookkeeper.
  is_surrogate: true

New Optimizer Settings:

Analogously to the benchmark settings, you can add a new optimizer setting to the optimizer_settings.yaml.

# The name of the optimizer setting can be chosen freely.
hpbandster_hb_eta_3_test:
  # Specifies the optimizer to use. See table below for supported optimizers
  optimizer: hpbandster_hb
  
  # Optimizer dependent options:
  # ############################
  eta: 3

Available Optimizers:

Optimizer	optimizer string
SMAC - Hyperband	smac_hb
SMAC - Successive Halving	smac_sh
HpBandSter - BOHB	hpbandster_bohb
HpBandSter - Random Search	hpbandster_rs
HpBandSter - Hyperband	hpbandster_hb
HpBandSter - H2BO	hpbandster_h2bo

Add new optimizer:

Inherit from the Base Optimizer
Implement the run method.
Add a optimizer setting to the optimizer_settings.yaml as described above. It's as simple as that 😉

Some optimizer-specific settings:

Optimizer	setting name	Description
Dragonfly	init_iter_per_dim	An integer N such that, given that the benchmark's configuration space has D dimensions, NxD iterations will be used to randomly sample configurations to warm-start the optimizer with. Makes dragonfly use an internal budget type of 'num_evals'.
Dragonfly	init_capital_frac	A value f in the closed interval [0, 1] such that, given that a benchmark specifies a time limit of T seconds, f * t seconds will be used for initialization. Only comes into effect when 'init_iter_per_dim' is not given. Also switches dragonfly's internal budget type to 'realtime'.

	# for k in keys:
	# t1 = a.data[(777, "valid_acc1es")][k][199]
	# t2 = a.data[(888, "valid_acc1es")][k][199]
	# t3 = a.data[(999, "valid_acc1es")][k][199]
	# te.append(float(100 - np.mean([t1, t2, t3])))
	# v1 = a.data[(777, "test_acc1es")][k]
	# v2 = a.data[(888, "test_acc1es")][k]
	# v3 = a.data[(999, "test_acc1es")][k]
	# ve.append(float(100 - np.mean([v1, v2, v3])))
	# best_test = np.min(te)
	# best_valid = np.min(ve)
	# print(b, best_test, best_valid)

automl / hpobenchexperimentutils Goto Github PK

hpobenchexperimentutils's Introduction

HPOBenchExperimentUtils

Running a benchmark

Validating configurations

Settings

Available Optimizer settings

Available Benchmarks:

How to contribute:

New Benchmark Settings:

New Optimizer Settings:

Available Optimizers:

Add new optimizer:

Some optimizer-specific settings:

hpobenchexperimentutils's People

Contributors

Stargazers

Watchers

Forkers

hpobenchexperimentutils's Issues

Recommend Projects

Recommend Topics

Recommend Org