[[46]{.chapter-number}  [Early Stopping Explained: HPT with `spotpython` and `PyTorch` Lightning for the Diabetes Data Set]{.chapter-title}]{#sec-hpt-pytorch-early .quarto-section-identifier}

doi:10.48550/arXiv.2307.10262

46 Early Stopping Explained: HPT with `spotpython` and `PyTorch` Lightning for the Diabetes Data Set

In this section, we will show how early stopping can be integrated into the PyTorch Lightning training workflow for a regression task. We will use the setting described in Chapter 45, i.e., the Diabetes data set, which is provided by spotpython, and the HyperLight class to define the objective function.

46.1 The Basic Setting

import os
from math import inf
import warnings
warnings.filterwarnings("ignore")

After importing the necessary libraries, the fun_control dictionary is set up via the fun_control_init function. The fun_control dictionary contains the same parameters as in Chapter 45, i.e., it contains the following parameters:

PREFIX: a unique identifier for the experiment
fun_evals: the number of function evaluations
max_time: the maximum run time in minutes
data_set: the data set. Here we use the Diabetes data set that is provided by spotpython.
core_model_name: the class name of the neural network model. This neural network model is provided by spotpython.
hyperdict: the hyperparameter dictionary. This dictionary is used to define the hyperparameters of the neural network model. It is also provided by spotpython.
_L_in: the number of input features. Since the Diabetes data set has 10 features, _L_in is set to 10.
_L_out: the number of output features. Since we want to predict a single value, _L_out is set to 1.

In addition, the fun_control dictionary contains the following parameters that are specific to the early-stopping mechanism:

divergence_threshold: Stop training as soon as the monitored quantity becomes worse than this threshold.
check_finite: When set True, stops training when the monitor becomes NaN or infinite.
stopping_threshold: Stop training immediately once the monitored quantity reaches this threshold.

divergence_threshold

We will set the divergence_threshold to 25,000, because good values are in a range around 15,000. This means that the training will be stopped if the monitored quantity becomes worse than this threshold.

Furthermore, the patience parameter can be used as a hyperparameter to control the early stopping mechanism. It defines how many validation checks with no improvement are allowed before training is stopped.

patience

It must be noted that the patience parameter counts the number of validation checks with no improvement, and not the number of training epochs. Therefore, with parameters check_val_every_n_epoch=10 and patience=3, the trainer will perform at least 40 training epochs before being stopped.

The HyperLight class is used to define the objective function fun. It connects the PyTorch and the spotpython methods and is provided by spotpython.

from spotpython.data.diabetes import Diabetes
from spotpython.hyperdict.light_hyper_dict import LightHyperDict
from spotpython.fun.hyperlight import HyperLight
from spotpython.utils.init import (fun_control_init, surrogate_control_init, design_control_init)
from spotpython.utils.eda import print_exp_table, print_res_table
from spotpython.spot import Spot
from spotpython.utils.file import get_experiment_filename

PREFIX="601-early_stopping"

data_set = Diabetes()

fun_control = fun_control_init(
    PREFIX=PREFIX,
    fun_evals=inf,
    TENSORBOARD_CLEAN=True,
    tensorboard_log=True,
    max_time=1,
    data_set = data_set,
    core_model_name="light.regression.NNLinearRegressor",
    hyperdict=LightHyperDict,
    divergence_threshold=25_000,
    _L_in=10,
    _L_out=1)

fun = HyperLight().fun

Moving TENSORBOARD_PATH: runs/ to TENSORBOARD_PATH_OLD: runs_OLD/runs_2025_11_06_18_28_25_0
Created spot_tensorboard_path: runs/spot_logs/601-early_stopping_maans08_2025-11-06_18-28-25 for SummaryWriter()
module_name: light
submodule_name: regression
model_name: NNLinearRegressor

The method set_hyperparameter allows the user to modify default hyperparameter settings. Here we modify some hyperparameters to keep the model small and to decrease the tuning time.

from spotpython.hyperparameters.values import set_hyperparameter
set_hyperparameter(fun_control, "optimizer", [ "Adadelta", "Adam", "Adamax"])
set_hyperparameter(fun_control, "l1", [3,4])
set_hyperparameter(fun_control, "epochs", [3,7])
set_hyperparameter(fun_control, "batch_size", [4,11])
set_hyperparameter(fun_control, "dropout_prob", [0.0, 0.025])
set_hyperparameter(fun_control, "patience", [2,3])

design_control = design_control_init(init_size=10)
print_exp_table(fun_control)

| name           | type   | default   |   lower |   upper | transform             |
|----------------|--------|-----------|---------|---------|-----------------------|
| l1             | int    | 3         |     3   |   4     | transform_power_2_int |
| epochs         | int    | 4         |     3   |   7     | transform_power_2_int |
| batch_size     | int    | 4         |     4   |  11     | transform_power_2_int |
| act_fn         | factor | ReLU      |     0   |   5     | None                  |
| optimizer      | factor | SGD       |     0   |   2     | None                  |
| dropout_prob   | float  | 0.01      |     0   |   0.025 | None                  |
| lr_mult        | float  | 1.0       |     0.1 |  10     | None                  |
| patience       | int    | 2         |     2   |   3     | transform_power_2_int |
| batch_norm     | factor | 0         |     0   |   1     | None                  |
| initialization | factor | Default   |     0   |   4     | None                  |

Finally, a Spot object is created. Calling the method run() starts the hyperparameter tuning process.

S = Spot(fun=fun,fun_control=fun_control, design_control=design_control)
S.run()

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  202 K │ train │ 826 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 202 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 202 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 826 M

train_model result: {'val_loss': 23075.09765625, 'hp_metric': 23075.09765625}

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 6.6 M │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 6.6 M

train_model result: {'val_loss': 3466.626220703125, 'hp_metric': 3466.626220703125}

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  205 K │ train │ 103 M │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 205 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 205 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 103 M

train_model result: {'val_loss': 44729112.0, 'hp_metric': 44729112.0}

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 53.2 K │ train │ 52.5 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 53.2 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 53.2 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 52.5 M

train_model result: {'val_loss': 24005.33984375, 'hp_metric': 24005.33984375}

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  205 K │ train │ 51.6 M │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 205 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 205 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 51.6 M

train_model result: {'val_loss': 22881.826171875, 'hp_metric': 22881.826171875}

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  202 K │ train │ 12.9 M │ [32, 10] │   [32, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴──────────┴───────────┘

Trainable params: 202 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 202 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 12.9 M

train_model result: {'val_loss': 4215.99072265625, 'hp_metric': 4215.99072265625}

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 52.5 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 52.5 M

train_model result: {'val_loss': 20590.513671875, 'hp_metric': 20590.513671875}

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 53.2 K │ train │ 3.3 M │ [32, 10] │   [32, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 53.2 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 53.2 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 3.3 M

train_model result: {'val_loss': 3969.725830078125, 'hp_metric': 3969.725830078125}

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 13.1 M │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 13.1 M

train_model result: {'val_loss': 20699.109375, 'hp_metric': 20699.109375}

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  205 K │ train │ 413 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 205 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 205 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 413 M

train_model result: {'val_loss': 23871.84765625, 'hp_metric': 23871.84765625}

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  205 K │ train │ 206 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 205 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 205 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 206 M

train_model result: {'val_loss': 23622.703125, 'hp_metric': 23622.703125}
spotpython tuning: 3466.626220703125 [----------] 2.03%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  202 K │ train │ 103 M │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 202 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 202 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 103 M

train_model result: {'val_loss': 22528.830078125, 'hp_metric': 22528.830078125}
spotpython tuning: 3466.626220703125 [----------] 4.88%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  202 K │ train │ 826 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 202 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 202 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 826 M

train_model result: {'val_loss': 22143.693359375, 'hp_metric': 22143.693359375}
spotpython tuning: 3466.626220703125 [#---------] 7.13%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 53.2 K │ train │ 52.5 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 53.2 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 53.2 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 52.5 M

train_model result: {'val_loss': 23968.31640625, 'hp_metric': 23968.31640625}
spotpython tuning: 3466.626220703125 [#---------] 8.87%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 13.1 M │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 13.1 M

train_model result: {'val_loss': 21933.705078125, 'hp_metric': 21933.705078125}
spotpython tuning: 3466.626220703125 [#---------] 11.37%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 26.2 M │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 26.2 M

train_model result: {'val_loss': 23276.97265625, 'hp_metric': 23276.97265625}
spotpython tuning: 3466.626220703125 [#---------] 13.56%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  202 K │ train │ 51.6 M │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 202 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 202 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 51.6 M

train_model result: {'val_loss': 21300.08984375, 'hp_metric': 21300.08984375}
spotpython tuning: 3466.626220703125 [##--------] 16.15%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 53.2 K │ train │ 52.5 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 53.2 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 53.2 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 52.5 M

train_model result: {'val_loss': 23967.828125, 'hp_metric': 23967.828125}
spotpython tuning: 3466.626220703125 [##--------] 18.08%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 52.5 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 52.5 M

train_model result: {'val_loss': 3467.520263671875, 'hp_metric': 3467.520263671875}
spotpython tuning: 3466.626220703125 [##--------] 21.42%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 53.2 K │ train │ 52.5 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 53.2 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 53.2 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 52.5 M

train_model result: {'val_loss': 24110.126953125, 'hp_metric': 24110.126953125}
spotpython tuning: 3466.626220703125 [###-------] 25.16%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 52.5 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 52.5 M

train_model result: {'val_loss': 798487.5, 'hp_metric': 798487.5}
spotpython tuning: 3466.626220703125 [###-------] 27.83%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 52.5 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 52.5 M

train_model result: {'val_loss': 21710.76171875, 'hp_metric': 21710.76171875}
spotpython tuning: 3466.626220703125 [###-------] 31.99%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 52.5 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 52.5 M

train_model result: {'val_loss': 4556.01904296875, 'hp_metric': 4556.01904296875}
spotpython tuning: 3466.626220703125 [####------] 35.67%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  205 K │ train │ 6.5 M │ [16, 10] │   [16, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 205 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 205 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 6.5 M

train_model result: {'val_loss': 23870.84375, 'hp_metric': 23870.84375}
spotpython tuning: 3466.626220703125 [####------] 40.10%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  205 K │ train │ 413 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 205 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 205 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 413 M

train_model result: {'val_loss': 23618.93359375, 'hp_metric': 23618.93359375}
spotpython tuning: 3466.626220703125 [####------] 43.89%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  205 K │ train │ 6.5 M │ [16, 10] │   [16, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 205 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 205 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 6.5 M

train_model result: {'val_loss': 23767.53515625, 'hp_metric': 23767.53515625}
spotpython tuning: 3466.626220703125 [#####-----] 48.95%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 53.2 K │ train │ 3.3 M │ [32, 10] │   [32, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 53.2 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 53.2 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 3.3 M

train_model result: {'val_loss': 24076.083984375, 'hp_metric': 24076.083984375}
spotpython tuning: 3466.626220703125 [#####-----] 52.71%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 13.1 M │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 13.1 M

train_model result: {'val_loss': 22512.0, 'hp_metric': 22512.0}
spotpython tuning: 3466.626220703125 [######----] 57.22%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  205 K │ train │ 103 M │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 205 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 205 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 103 M

train_model result: {'val_loss': 23715.1171875, 'hp_metric': 23715.1171875}
spotpython tuning: 3466.626220703125 [######----] 64.43%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 51.9 K │ train │ 26.2 M │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 51.9 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 51.9 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 26.2 M

train_model result: {'val_loss': 23959.41015625, 'hp_metric': 23959.41015625}
spotpython tuning: 3466.626220703125 [#######---] 69.27%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  205 K │ train │ 51.6 M │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 205 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 205 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 51.6 M

train_model result: {'val_loss': 23177.203125, 'hp_metric': 23177.203125}
spotpython tuning: 3466.626220703125 [#######---] 74.25%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 53.2 K │ train │ 6.6 M │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 53.2 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 53.2 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 6.6 M

train_model result: {'val_loss': 24018.24609375, 'hp_metric': 24018.24609375}
spotpython tuning: 3466.626220703125 [########--] 80.56%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 53.2 K │ train │ 26.2 M │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 53.2 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 53.2 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 26.2 M

train_model result: {'val_loss': 23840.671875, 'hp_metric': 23840.671875}
spotpython tuning: 3466.626220703125 [########--] 84.42%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │ 53.2 K │ train │ 3.3 M │ [32, 10] │   [32, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 53.2 K                                                                                           
Non-trainable params: 0                                                                                            
Total params: 53.2 K                                                                                               
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 3.3 M

train_model result: {'val_loss': 20081.701171875, 'hp_metric': 20081.701171875}
spotpython tuning: 3466.626220703125 [#########-] 90.37%. Success rate: 0.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  202 K │ train │ 25.8 M │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴──────────┴───────────┘

Trainable params: 202 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 202 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 25.8 M

train_model result: {'val_loss': 2976.613525390625, 'hp_metric': 2976.613525390625}
spotpython tuning: 2976.613525390625 [#########-] 94.53%. Success rate: 4.00%

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  205 K │ train │ 206 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 205 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 205 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 24                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 206 M

train_model result: {'val_loss': 23504.173828125, 'hp_metric': 23504.173828125}
spotpython tuning: 2976.613525390625 [##########] 100.00%. Success rate: 3.85% Done...

Experiment saved to 601-early_stopping_res.pkl

<spotpython.spot.spot.Spot at 0x1198770e0>

46.2 Looking at the Results

46.2.1 Tuning Progress

After the hyperparameter tuning run is finished, the progress of the hyperparameter tuning can be visualized with spotpython’s method plot_progress. The black points represent the performance values (score or metric) of hyperparameter configurations from the initial design, whereas the red points represents the hyperparameter configurations found by the surrogate model based optimization.

S.plot_progress(log_y=True)

46.2.2 Tuned Hyperparameters and Their Importance

Results can be printed in tabular form.

print_res_table(S)

| name           | type   | default   |   lower |   upper | tuned                | transform             |   importance | stars   |
|----------------|--------|-----------|---------|---------|----------------------|-----------------------|--------------|---------|
| l1             | int    | 3         |     3.0 |     4.0 | 4.0                  | transform_power_2_int |         0.08 |         |
| epochs         | int    | 4         |     3.0 |     7.0 | 7.0                  | transform_power_2_int |         0.08 |         |
| batch_size     | int    | 4         |     4.0 |    11.0 | 6.0                  | transform_power_2_int |         0.08 |         |
| act_fn         | factor | ReLU      |     0.0 |     5.0 | LeakyReLU            | None                  |         0.08 |         |
| optimizer      | factor | SGD       |     0.0 |     2.0 | Adamax               | None                  |         0.08 |         |
| dropout_prob   | float  | 0.01      |     0.0 |   0.025 | 0.004766429873889258 | None                  |         0.51 | .       |
| lr_mult        | float  | 1.0       |     0.1 |    10.0 | 2.6506652267065136   | None                  |         0.08 |         |
| patience       | int    | 2         |     2.0 |     3.0 | 2.0                  | transform_power_2_int |         0.08 |         |
| batch_norm     | factor | 0         |     0.0 |     1.0 | 0                    | None                  |         0.08 |         |
| initialization | factor | Default   |     0.0 |     4.0 | xavier_uniform       | None                  |       100.00 | ***     |

A histogram can be used to visualize the most important hyperparameters.

S.plot_importance(threshold=1.0)

S.plot_important_hyperparameter_contour(max_imp=3)

l1:  0.08168308593570597
epochs:  0.08168308593570597
batch_size:  0.08168308593570597
act_fn:  0.08168308593570597
optimizer:  0.08168308593570597
dropout_prob:  0.5085867740038202
lr_mult:  0.08168308593570597
patience:  0.08168308593570597
batch_norm:  0.08168308593570597
initialization:  100.0

46.2.3 Get the Tuned Architecture

import pprint
from spotpython.hyperparameters.values import get_tuned_architecture
config = get_tuned_architecture(S)
pprint.pprint(config)

{'act_fn': LeakyReLU(),
 'batch_norm': False,
 'batch_size': 64,
 'dropout_prob': 0.004766429873889258,
 'epochs': 128,
 'initialization': 'xavier_uniform',
 'l1': 16,
 'lr_mult': 2.6506652267065136,
 'optimizer': 'Adamax',
 'patience': 4}

46.2.4 Test on the full data set

fun_control.update({"TENSORBOARD_CLEAN": True})
fun_control.update({"tensorboard_log": True})

from spotpython.light.testmodel import test_model
from spotpython.utils.init import get_feature_names

test_model(config, fun_control)
get_feature_names(fun_control)

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  202 K │ train │ 25.8 M │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴──────────┴───────────┘

Trainable params: 202 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 202 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 25.8 M

┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃        Test metric        ┃       DataLoader 0        ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩
│         hp_metric         │     3278.120849609375     │
│         val_loss          │     3278.120849609375     │
└───────────────────────────┴───────────────────────────┘

test_model result: {'val_loss': 3278.120849609375, 'hp_metric': 3278.120849609375}

['age',
 'sex',
 'bmi',
 'bp',
 's1_tc',
 's2_ldl',
 's3_hdl',
 's4_tch',
 's5_ltg',
 's6_glu']

46.3 Cross Validation With Lightning

The KFold class from sklearn.model_selection is used to generate the folds for cross-validation.
These mechanism is used to generate the folds for the final evaluation of the model.
The CrossValidationDataModule class [SOURCE] is used to generate the folds for the hyperparameter tuning process.
It is called from the cv_model function [SOURCE].

config

{'l1': 16,
 'epochs': 128,
 'batch_size': 64,
 'act_fn': LeakyReLU(),
 'optimizer': 'Adamax',
 'dropout_prob': 0.004766429873889258,
 'lr_mult': 2.6506652267065136,
 'patience': 4,
 'batch_norm': False,
 'initialization': 'xavier_uniform'}

from spotpython.light.cvmodel import cv_model
fun_control.update({"k_folds": 2})
fun_control.update({"test_size": 0.6})
cv_model(config, fun_control)

k: 0

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  202 K │ train │ 25.8 M │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴──────────┴───────────┘

Trainable params: 202 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 202 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 25.8 M

train_model result: {'val_loss': 2898.214111328125, 'hp_metric': 2898.214111328125}
k: 1

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  202 K │ train │ 25.8 M │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴──────────┴───────────┘

Trainable params: 202 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 202 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 17                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 25.8 M

train_model result: {'val_loss': 3062.371337890625, 'hp_metric': 3062.371337890625}

2980.292724609375

46.4 Extending the Basic Setup

This basic setup can be adapted to user-specific needs in many ways. For example, the user can specify a custom data set, a custom model, or a custom loss function. The following sections provide more details on how to customize the hyperparameter tuning process. Before we proceed, we will provide an overview of the basic settings of the hyperparameter tuning process and explain the parameters used so far.

46.4.1 General Experiment Setup

To keep track of the different experiments, we use a PREFIX for the experiment name. The PREFIX is used to create a unique experiment name. The PREFIX is also used to create a unique TensorBoard folder, which is used to store the TensorBoard log files.

spotpython allows the specification of two different types of stopping criteria: first, the number of function evaluations (fun_evals), and second, the maximum run time in seconds (max_time). Here, we will set the number of function evaluations to infinity and the maximum run time to one minute.

max_time is set to one minute for demonstration purposes. For real experiments, this value should be increased. Note, the total run time may exceed the specified max_time, because the initial design is always evaluated, even if this takes longer than max_time.

46.4.2 Data Setup

Here, we have provided the Diabetes data set class, which is a subclass of torch.utils.data.Dataset. Data preprocessing is handled by Lightning and PyTorch. It is described in the LIGHTNINGDATAMODULE documentation.

The data splitting, i.e., the generation of training, validation, and testing data, is handled by Lightning.

46.4.3 Objective Function `fun`

The objective function fun from the class HyperLight [SOURCE] is selected next. It implements an interface from PyTorch’s training, validation, and testing methods to spotpython.

46.4.4 Core-Model Setup

By using core_model_name = "light.regression.NNLinearRegressor", the spotpython model class NetLightRegression [SOURCE] from the light.regression module is selected.

46.4.5 Hyperdict Setup

For a given core_model_name, the corresponding hyperparameters are automatically loaded from the associated dictionary, which is stored as a JSON file. The JSON file contains hyperparameter type information, names, and bounds. For spotpython models, the hyperparameters are stored in the LightHyperDict, see [SOURCE] Alternatively, you can load a local hyper_dict. The hyperdict uses the default hyperparameter settings. These can be modified as described in Section 12.19.1.

46.4.6 Other Settings

There are several additional parameters that can be specified, e.g., since we did not specify a loss function, mean_squared_error is used, which is the default loss function. These will be explained in more detail in the following sections.

46.5 Tensorboard

The textual output shown in the console (or code cell) can be visualized with Tensorboard, if the argument tensorboard_log to fun_control_init() is set to True. The Tensorboard log files are stored in the runs folder. To start Tensorboard, run the following command in the terminal:

tensorboard --logdir="runs/"

Further information can be found in the PyTorch Lightning documentation for Tensorboard.

46.6 Loading the Saved Experiment and Getting the Hyperparameters of the Tuned Model

To get the tuned hyperparameters as a dictionary, the get_tuned_architecture function can be used.

from spotpython.utils.file import load_result
spot_tuner = load_result(PREFIX=PREFIX)
config = get_tuned_architecture(spot_tuner)
config

Loaded experiment from 601-early_stopping_res.pkl

{'l1': 16,
 'epochs': 128,
 'batch_size': 64,
 'act_fn': LeakyReLU(),
 'optimizer': 'Adamax',
 'dropout_prob': 0.004766429873889258,
 'lr_mult': 2.6506652267065136,
 'patience': 4,
 'batch_norm': False,
 'initialization': 'xavier_uniform'}

46.7 Using the `spotgui`

The spotgui [github] provides a convenient way to interact with the hyperparameter tuning process. To obtain the settings from Section 45.1, the spotgui can be started as shown in Figure 46.1.

46.8 Summary

This section presented an introduction to the basic setup of hyperparameter tuning with spotpython and PyTorch Lightning.