import os
from math import inf
import warnings
"ignore") warnings.filterwarnings(
31 Hyperparameter Tuning with spotpython
and PyTorch
Lightning for the Diabetes Data Set
In this section, we will show how spotpython
can be integrated into the PyTorch
Lightning training workflow for a regression task. It demonstrates how easy it is to use spotpython
to tune hyperparameters for a PyTorch
Lightning model.
31.1 The Basic Setting
After importing the necessary libraries, the fun_control
dictionary is set up via the fun_control_init
function. The fun_control
dictionary contains
PREFIX
: a unique identifier for the experimentfun_evals
: the number of function evaluationsmax_time
: the maximum run time in minutesdata_set
: the data set. Here we use theDiabetes
data set that is provided byspotpython
.core_model_name
: the class name of the neural network model. This neural network model is provided byspotpython
.hyperdict
: the hyperparameter dictionary. This dictionary is used to define the hyperparameters of the neural network model. It is also provided byspotpython
._L_in
: the number of input features. Since theDiabetes
data set has 10 features,_L_in
is set to 10._L_out
: the number of output features. Since we want to predict a single value,_L_out
is set to 1.
The HyperLight
class is used to define the objective function fun
. It connects the PyTorch
and the spotpython
methods and is provided by spotpython
.
from spotpython.data.diabetes import Diabetes
from spotpython.hyperdict.light_hyper_dict import LightHyperDict
from spotpython.fun.hyperlight import HyperLight
from spotpython.utils.init import (fun_control_init, surrogate_control_init, design_control_init)
from spotpython.utils.eda import print_exp_table, print_res_table
from spotpython.spot import Spot
from spotpython.utils.file import get_experiment_filename
="601"
PREFIX
= Diabetes()
data_set
= fun_control_init(
fun_control =PREFIX,
PREFIX=inf,
fun_evals=1,
max_time= data_set,
data_set ="light.regression.NNLinearRegressor",
core_model_name=LightHyperDict,
hyperdict=10,
_L_in=1)
_L_out
= HyperLight().fun fun
module_name: light
submodule_name: regression
model_name: NNLinearRegressor
The method set_hyperparameter
allows the user to modify default hyperparameter settings. Here we modify some hyperparameters to keep the model small and to decrease the tuning time.
from spotpython.hyperparameters.values import set_hyperparameter
"optimizer", [ "Adadelta", "Adam", "Adamax"])
set_hyperparameter(fun_control, "l1", [3,4])
set_hyperparameter(fun_control, "epochs", [3,7])
set_hyperparameter(fun_control, "batch_size", [4,11])
set_hyperparameter(fun_control, "dropout_prob", [0.0, 0.025])
set_hyperparameter(fun_control, "patience", [2,3])
set_hyperparameter(fun_control,
= design_control_init(init_size=10)
design_control print_exp_table(fun_control)
| name | type | default | lower | upper | transform |
|----------------|--------|-----------|---------|---------|-----------------------|
| l1 | int | 3 | 3 | 4 | transform_power_2_int |
| epochs | int | 4 | 3 | 7 | transform_power_2_int |
| batch_size | int | 4 | 4 | 11 | transform_power_2_int |
| act_fn | factor | ReLU | 0 | 5 | None |
| optimizer | factor | SGD | 0 | 2 | None |
| dropout_prob | float | 0.01 | 0 | 0.025 | None |
| lr_mult | float | 1.0 | 0.1 | 10 | None |
| patience | int | 2 | 2 | 3 | transform_power_2_int |
| batch_norm | factor | 0 | 0 | 1 | None |
| initialization | factor | Default | 0 | 4 | None |
Finally, a Spot
object is created. Calling the method run()
starts the hyperparameter tuning process.
= Spot(fun=fun,fun_control=fun_control, design_control=design_control)
S S.run()
Experiment saved to 601_exp.pkl
train_model result: {'val_loss': 23075.166015625, 'hp_metric': 23075.166015625}
train_model result: {'val_loss': 3600.9345703125, 'hp_metric': 3600.9345703125}
train_model result: {'val_loss': 4820.02490234375, 'hp_metric': 4820.02490234375}
train_model result: {'val_loss': 24072.45703125, 'hp_metric': 24072.45703125}
train_model result: {'val_loss': 22559.53515625, 'hp_metric': 22559.53515625}
train_model result: {'val_loss': 4145.4853515625, 'hp_metric': 4145.4853515625}
train_model result: {'val_loss': 20313.955078125, 'hp_metric': 20313.955078125}
train_model result: {'val_loss': 4210.201171875, 'hp_metric': 4210.201171875}
train_model result: {'val_loss': 20617.552734375, 'hp_metric': 20617.552734375}
train_model result: {'val_loss': 23311.947265625, 'hp_metric': 23311.947265625}
train_model result: {'val_loss': 3265.11962890625, 'hp_metric': 3265.11962890625}
spotpython tuning: 3265.11962890625 [----------] 2.27%
train_model result: {'val_loss': 24083.953125, 'hp_metric': 24083.953125}
spotpython tuning: 3265.11962890625 [----------] 3.94%
train_model result: {'val_loss': 4196.44921875, 'hp_metric': 4196.44921875}
spotpython tuning: 3265.11962890625 [#---------] 5.66%
train_model result: {'val_loss': 4332.9111328125, 'hp_metric': 4332.9111328125}
spotpython tuning: 3265.11962890625 [#---------] 8.08%
train_model result: {'val_loss': 3560.48828125, 'hp_metric': 3560.48828125}
spotpython tuning: 3265.11962890625 [#---------] 9.47%
train_model result: {'val_loss': 3284.0498046875, 'hp_metric': 3284.0498046875}
spotpython tuning: 3265.11962890625 [#---------] 11.30%
train_model result: {'val_loss': 3254.800048828125, 'hp_metric': 3254.800048828125}
spotpython tuning: 3254.800048828125 [#---------] 12.92%
train_model result: {'val_loss': 12218.865234375, 'hp_metric': 12218.865234375}
spotpython tuning: 3254.800048828125 [##--------] 20.24%
train_model result: {'val_loss': 3224.18701171875, 'hp_metric': 3224.18701171875}
spotpython tuning: 3224.18701171875 [##--------] 22.13%
train_model result: {'val_loss': 3383.2724609375, 'hp_metric': 3383.2724609375}
spotpython tuning: 3224.18701171875 [##--------] 23.76%
train_model result: {'val_loss': 3486.415283203125, 'hp_metric': 3486.415283203125}
spotpython tuning: 3224.18701171875 [###-------] 25.66%
train_model result: {'val_loss': 3600.288330078125, 'hp_metric': 3600.288330078125}
spotpython tuning: 3224.18701171875 [###-------] 27.88%
train_model result: {'val_loss': 3584.28955078125, 'hp_metric': 3584.28955078125}
spotpython tuning: 3224.18701171875 [###-------] 31.49%
train_model result: {'val_loss': nan, 'hp_metric': nan}
train_model result: {'val_loss': nan, 'hp_metric': nan}
spotpython tuning: 3224.18701171875 [###-------] 34.94%
train_model result: {'val_loss': nan, 'hp_metric': nan}
train_model result: {'val_loss': 24276.884765625, 'hp_metric': 24276.884765625}
spotpython tuning: 3224.18701171875 [####------] 37.80%
train_model result: {'val_loss': 23527.43359375, 'hp_metric': 23527.43359375}
spotpython tuning: 3224.18701171875 [####------] 40.63%
train_model result: {'val_loss': 21606.408203125, 'hp_metric': 21606.408203125}
spotpython tuning: 3224.18701171875 [####------] 44.67%
train_model result: {'val_loss': nan, 'hp_metric': nan}
train_model result: {'val_loss': 3041.011474609375, 'hp_metric': 3041.011474609375}
spotpython tuning: 3041.011474609375 [#####-----] 49.70%
train_model result: {'val_loss': 3150.69775390625, 'hp_metric': 3150.69775390625}
spotpython tuning: 3041.011474609375 [#####-----] 53.92%
train_model result: {'val_loss': 6081.50927734375, 'hp_metric': 6081.50927734375}
spotpython tuning: 3041.011474609375 [######----] 60.49%
train_model result: {'val_loss': 4328.36962890625, 'hp_metric': 4328.36962890625}
spotpython tuning: 3041.011474609375 [#######---] 66.10%
train_model result: {'val_loss': 19815.8984375, 'hp_metric': 19815.8984375}
spotpython tuning: 3041.011474609375 [#######---] 69.46%
train_model result: {'val_loss': 4146.05419921875, 'hp_metric': 4146.05419921875}
spotpython tuning: 3041.011474609375 [#######---] 73.06%
train_model result: {'val_loss': 3251.239013671875, 'hp_metric': 3251.239013671875}
spotpython tuning: 3041.011474609375 [########--] 76.43%
train_model result: {'val_loss': 3085.599853515625, 'hp_metric': 3085.599853515625}
spotpython tuning: 3041.011474609375 [########--] 79.49%
train_model result: {'val_loss': 3205.931640625, 'hp_metric': 3205.931640625}
spotpython tuning: 3041.011474609375 [########--] 82.26%
train_model result: {'val_loss': 3022.802490234375, 'hp_metric': 3022.802490234375}
spotpython tuning: 3022.802490234375 [#########-] 85.32%
train_model result: {'val_loss': 3247.79443359375, 'hp_metric': 3247.79443359375}
spotpython tuning: 3022.802490234375 [#########-] 88.38%
train_model result: {'val_loss': 3001.1240234375, 'hp_metric': 3001.1240234375}
spotpython tuning: 3001.1240234375 [#########-] 90.88%
train_model result: {'val_loss': 2947.1845703125, 'hp_metric': 2947.1845703125}
spotpython tuning: 2947.1845703125 [#########-] 94.06%
train_model result: {'val_loss': 3039.773681640625, 'hp_metric': 3039.773681640625}
spotpython tuning: 2947.1845703125 [##########] 96.74%
train_model result: {'val_loss': 22987.359375, 'hp_metric': 22987.359375}
spotpython tuning: 2947.1845703125 [##########] 100.00% Done...
Experiment saved to 601_res.pkl
<spotpython.spot.spot.Spot at 0x108354320>
31.2 Looking at the Results
31.2.1 Tuning Progress
After the hyperparameter tuning run is finished, the progress of the hyperparameter tuning can be visualized with spotpython
’s method plot_progress
. The black points represent the performace values (score or metric) of hyperparameter configurations from the initial design, whereas the red points represents the hyperparameter configurations found by the surrogate model based optimization.
S.plot_progress()
31.2.2 Tuned Hyperparameters and Their Importance
Results can be printed in tabular form.
print_res_table(S)
| name | type | default | lower | upper | tuned | transform | importance | stars |
|----------------|--------|-----------|---------|---------|-------------------|-----------------------|--------------|---------|
| l1 | int | 3 | 3.0 | 4.0 | 3.0 | transform_power_2_int | 88.76 | ** |
| epochs | int | 4 | 3.0 | 7.0 | 7.0 | transform_power_2_int | 0.10 | . |
| batch_size | int | 4 | 4.0 | 11.0 | 5.0 | transform_power_2_int | 71.69 | ** |
| act_fn | factor | ReLU | 0.0 | 5.0 | ReLU | None | 0.10 | . |
| optimizer | factor | SGD | 0.0 | 2.0 | Adam | None | 100.00 | *** |
| dropout_prob | float | 0.01 | 0.0 | 0.025 | 0.025 | None | 0.96 | . |
| lr_mult | float | 1.0 | 0.1 | 10.0 | 3.258209479901434 | None | 0.10 | . |
| patience | int | 2 | 2.0 | 3.0 | 3.0 | transform_power_2_int | 0.50 | . |
| batch_norm | factor | 0 | 0.0 | 1.0 | 0 | None | 26.45 | * |
| initialization | factor | Default | 0.0 | 4.0 | kaiming_uniform | None | 0.10 | . |
A histogram can be used to visualize the most important hyperparameters.
=1.0) S.plot_importance(threshold
=3) S.plot_important_hyperparameter_contour(max_imp
l1: 88.75648818779752
epochs: 0.10279201167527423
batch_size: 71.6940227800961
act_fn: 0.10279201167527423
optimizer: 100.0
dropout_prob: 0.963416309760362
lr_mult: 0.10279201167527423
patience: 0.5032161575306312
batch_norm: 26.448552682305014
initialization: 0.10279201167527423
31.2.3 Get the Tuned Architecture
import pprint
from spotpython.hyperparameters.values import get_tuned_architecture
= get_tuned_architecture(S)
config pprint.pprint(config)
{'act_fn': ReLU(),
'batch_norm': False,
'batch_size': 32,
'dropout_prob': 0.025,
'epochs': 128,
'initialization': 'kaiming_uniform',
'l1': 8,
'lr_mult': 3.258209479901434,
'optimizer': 'Adam',
'patience': 8}
31.2.4 Test on the full data set
# set the value of the key "TENSORBOARD_CLEAN" to True in the fun_control dictionary and use the update() method to update the fun_control dictionary
"TENSORBOARD_CLEAN": True})
fun_control.update({"tensorboard_log": True}) fun_control.update({
from spotpython.light.testmodel import test_model
from spotpython.utils.init import get_feature_names
test_model(config, fun_control) get_feature_names(fun_control)
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓ ┃ Test metric ┃ DataLoader 0 ┃ ┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩ │ hp_metric │ 3044.058349609375 │ │ val_loss │ 3044.058349609375 │ └───────────────────────────┴───────────────────────────┘
test_model result: {'val_loss': 3044.058349609375, 'hp_metric': 3044.058349609375}
['age',
'sex',
'bmi',
'bp',
's1_tc',
's2_ldl',
's3_hdl',
's4_tch',
's5_ltg',
's6_glu']
31.3 Cross Validation With Lightning
- The
KFold
class fromsklearn.model_selection
is used to generate the folds for cross-validation. - These mechanism is used to generate the folds for the final evaluation of the model.
- The
CrossValidationDataModule
class [SOURCE] is used to generate the folds for the hyperparameter tuning process. - It is called from the
cv_model
function [SOURCE].
config
{'l1': 8,
'epochs': 128,
'batch_size': 32,
'act_fn': ReLU(),
'optimizer': 'Adam',
'dropout_prob': 0.025,
'lr_mult': 3.258209479901434,
'patience': 8,
'batch_norm': False,
'initialization': 'kaiming_uniform'}
from spotpython.light.cvmodel import cv_model
"k_folds": 2})
fun_control.update({"test_size": 0.6})
fun_control.update({ cv_model(config, fun_control)
k: 0
train_model result: {'val_loss': 2960.755126953125, 'hp_metric': 2960.755126953125}
k: 1
train_model result: {'val_loss': 3040.9697265625, 'hp_metric': 3040.9697265625}
3000.8624267578125
31.4 Extending the Basic Setup
This basic setup can be adapted to user-specific needs in many ways. For example, the user can specify a custom data set, a custom model, or a custom loss function. The following sections provide more details on how to customize the hyperparameter tuning process. Before we proceed, we will provide an overview of the basic settings of the hyperparameter tuning process and explain the parameters used so far.
31.4.1 General Experiment Setup
To keep track of the different experiments, we use a PREFIX
for the experiment name. The PREFIX
is used to create a unique experiment name. The PREFIX
is also used to create a unique TensorBoard folder, which is used to store the TensorBoard log files.
spotpython
allows the specification of two different types of stopping criteria: first, the number of function evaluations (fun_evals
), and second, the maximum run time in seconds (max_time
). Here, we will set the number of function evaluations to infinity and the maximum run time to one minute.
max_time
is set to one minute for demonstration purposes. For real experiments, this value should be increased. Note, the total run time may exceed the specified max_time
, because the initial design is always evaluated, even if this takes longer than max_time
.
31.4.2 Data Setup
Here, we have provided the Diabetes
data set class, which is a subclass of torch.utils.data.Dataset
. Data preprocessing is handled by Lightning
and PyTorch
. It is described in the LIGHTNINGDATAMODULE documentation.
The data splitting, i.e., the generation of training, validation, and testing data, is handled by Lightning
.
31.4.3 Objective Function fun
The objective function fun
from the class HyperLight
[SOURCE] is selected next. It implements an interface from PyTorch
’s training, validation, and testing methods to spotpython
.
31.4.4 Core-Model Setup
By using core_model_name = "light.regression.NNLinearRegressor"
, the spotpython
model class NetLightRegression
[SOURCE] from the light.regression
module is selected.
31.4.5 Hyperdict Setup
For a given core_model_name
, the corresponding hyperparameters are automatically loaded from the associated dictionary, which is stored as a JSON file. The JSON file contains hyperparameter type information, names, and bounds. For spotpython
models, the hyperparameters are stored in the LightHyperDict
, see [SOURCE] Alternatively, you can load a local hyper_dict. The hyperdict
uses the default hyperparameter settings. These can be modified as described in Section D.15.1.
31.4.6 Other Settings
There are several additional parameters that can be specified, e.g., since we did not specify a loss function, mean_squared_error
is used, which is the default loss function. These will be explained in more detail in the following sections.
31.5 Tensorboard
The textual output shown in the console (or code cell) can be visualized with Tensorboard, if the argument tensorboard_log
to fun_control_init()
is set to True
. The Tensorboard log files are stored in the runs
folder. To start Tensorboard, run the following command in the terminal:
tensorboard --logdir="runs/"
Further information can be found in the PyTorch Lightning documentation for Tensorboard.
31.6 Loading the Saved Experiment and Getting the Hyperparameters of the Tuned Model
To get the tuned hyperparameters as a dictionary, the get_tuned_architecture
function can be used.
from spotpython.utils.file import load_result
= load_result(PREFIX=PREFIX)
spot_tuner = get_tuned_architecture(spot_tuner)
config config
Loaded experiment from 601_res.pkl
{'l1': 8,
'epochs': 128,
'batch_size': 32,
'act_fn': ReLU(),
'optimizer': 'Adam',
'dropout_prob': 0.025,
'lr_mult': 3.258209479901434,
'patience': 8,
'batch_norm': False,
'initialization': 'kaiming_uniform'}
31.7 Using the spotgui
The spotgui
[github] provides a convenient way to interact with the hyperparameter tuning process. To obtain the settings from Section 31.1, the spotgui
can be started as shown in Figure 31.1.
31.8 Summary
This section presented an introduction to the basic setup of hyperparameter tuning with spotpython
and PyTorch
Lightning.