[56]{.chapter-number}  [Hyperparameter Tuning with `spotpython` and `PyTorch` Lightning for the Diabetes Data Set Using a User Specified ResNet Model]{.chapter-title}

doi:10.48550/arXiv.2307.10262

56 Hyperparameter Tuning with `spotpython` and `PyTorch` Lightning for the Diabetes Data Set Using a User Specified ResNet Model

After importing the necessary libraries, the fun_control dictionary is set up via the fun_control_init function. The fun_control dictionary contains

PREFIX: a unique identifier for the experiment
fun_evals: the number of function evaluations
max_time: the maximum run time in minutes
data_set: the data set. Here we use the Diabetes data set that is provided by spotpython.
core_model_name: the class name of the neural network model. This neural network model is provided by spotpython.
hyperdict: the hyperparameter dictionary. This dictionary is used to define the hyperparameters of the neural network model. It is also provided by spotpython.
_L_in: the number of input features. Since the Diabetes data set has 10 features, _L_in is set to 10.
_L_out: the number of output features. Since we want to predict a single value, _L_out is set to 1.

The HyperLight class is used to define the objective function fun. It connects the PyTorch and the spotpython methods and is provided by spotpython.

To access the user specified ResNet model, the path to the user model must be added to the Python path:

import sys
sys.path.insert(0, './userModel')
import my_resnet
import my_hyper_dict

In the following code, we do not specify the ResNet model in the fun_control dictionary. It will be added in a second step as the user specified model.

Note, the divergence_threshold is set to 5,000, which is based on some pre-experiments with the Diabetes data set.

from spotpython.data.diabetes import Diabetes
from spotpython.hyperdict.light_hyper_dict import LightHyperDict
from spotpython.fun.hyperlight import HyperLight
from spotpython.utils.init import (fun_control_init, surrogate_control_init, design_control_init)
from spotpython.utils.eda import print_exp_table
from spotpython.spot import Spot
from spotpython.utils.file import get_experiment_filename

PREFIX="606-user-resnet"

data_set = Diabetes()

fun_control = fun_control_init(
    PREFIX=PREFIX,
    fun_evals=inf,
    max_time=1,
    data_set = data_set,
    divergence_threshold=5_000,
    _L_in=10,
    _L_out=1)

fun = HyperLight().fun

In a second step, we can add the user specified ResNet model to the fun_control dictionary:

from spotpython.hyperparameters.values import add_core_model_to_fun_control
add_core_model_to_fun_control(fun_control=fun_control,
                              core_model=my_resnet.MyResNet,
                              hyper_dict=my_hyper_dict.MyHyperDict)

The method set_hyperparameter allows the user to modify default hyperparameter settings. Here we modify some hyperparameters to keep the model small and to decrease the tuning time.

from spotpython.hyperparameters.values import set_hyperparameter
set_hyperparameter(fun_control, "optimizer", [ "Adadelta", "Adam", "Adamax"])
set_hyperparameter(fun_control, "l1", [3,4])
set_hyperparameter(fun_control, "epochs", [3,7])
set_hyperparameter(fun_control, "batch_size", [4,11])
set_hyperparameter(fun_control, "dropout_prob", [0.0, 0.025])
set_hyperparameter(fun_control, "patience", [2,3])
set_hyperparameter(fun_control, "lr_mult", [0.1, 20.0])

design_control = design_control_init(init_size=10)

print_exp_table(fun_control)

| name           | type   | default   |   lower |   upper | transform             |
|----------------|--------|-----------|---------|---------|-----------------------|
| l1             | int    | 3         |     3   |   4     | transform_power_2_int |
| epochs         | int    | 4         |     3   |   7     | transform_power_2_int |
| batch_size     | int    | 4         |     4   |  11     | transform_power_2_int |
| act_fn         | factor | ReLU      |     0   |   5     | None                  |
| optimizer      | factor | SGD       |     0   |   2     | None                  |
| dropout_prob   | float  | 0.01      |     0   |   0.025 | None                  |
| lr_mult        | float  | 1.0       |     0.1 |  20     | None                  |
| patience       | int    | 2         |     2   |   3     | transform_power_2_int |
| initialization | factor | Default   |     0   |   4     | None                  |

Finally, a Spot object is created. Calling the method run() starts the hyperparameter tuning process.

spot_tuner = Spot(fun=fun,fun_control=fun_control, design_control=design_control)
res = spot_tuner.run()

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 95.7 K │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴───────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 95.7 K

train_model result: {'val_loss': 24029.263671875, 'hp_metric': 24029.263671875}
Milestones: [2, 4, 6]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 310 K │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 310 K

train_model result: {'val_loss': 23935.533203125, 'hp_metric': 23935.533203125}
Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 23620.296875, 'hp_metric': 23620.296875}
Milestones: [2, 4, 6]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 38.8 K │ [16, 10] │   [16, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴──────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 38.8 K

train_model result: {'val_loss': 23686.23828125, 'hp_metric': 23686.23828125}
Milestones: [32, 64, 96]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 23590.0078125, 'hp_metric': 23590.0078125}
Milestones: [32, 64, 96]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 1.5 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.5 M

train_model result: {'val_loss': 24738.689453125, 'hp_metric': 24738.689453125}
Milestones: [4, 8, 12]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 191 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 191 K

train_model result: {'val_loss': 24000.283203125, 'hp_metric': 24000.283203125}
Milestones: [4, 8, 12]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 382 K │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 382 K

train_model result: {'val_loss': 24577.533203125, 'hp_metric': 24577.533203125}
Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 23.9 K │ [32, 10] │   [32, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴──────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 23.9 K

train_model result: {'val_loss': 24449.7421875, 'hp_metric': 24449.7421875}
Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 155 K │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 155 K

train_model result: {'val_loss': 24279.90625, 'hp_metric': 24279.90625}
Anisotropic model: n_theta set to 9

Milestones: [4, 8, 12]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 310 K │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 310 K

train_model result: {'val_loss': 23912.20703125, 'hp_metric': 23912.20703125}
Anisotropic model: n_theta set to 9
spotpython tuning: 23590.0078125 [----------] 0.97%

Milestones: [32, 64, 96]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 5.0 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 5.0 M

train_model result: {'val_loss': 22983.38671875, 'hp_metric': 22983.38671875}
Anisotropic model: n_theta set to 9
spotpython tuning: 22983.38671875 [----------] 1.87%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 23250.544921875, 'hp_metric': 23250.544921875}
Anisotropic model: n_theta set to 9
spotpython tuning: 22983.38671875 [----------] 3.38%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 20473.287109375, 'hp_metric': 20473.287109375}
Anisotropic model: n_theta set to 9
spotpython tuning: 20473.287109375 [----------] 4.54%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 18676.08984375, 'hp_metric': 18676.08984375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#---------] 5.61%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 23920.427734375, 'hp_metric': 23920.427734375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#---------] 6.96%

Milestones: [32, 64, 96]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 5.0 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 5.0 M

train_model result: {'val_loss': 22713.513671875, 'hp_metric': 22713.513671875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#---------] 8.19%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 20514.65625, 'hp_metric': 20514.65625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#---------] 9.24%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 19462.26953125, 'hp_metric': 19462.26953125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#---------] 10.42%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 23231.2265625, 'hp_metric': 23231.2265625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#---------] 11.63%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 19630.44140625, 'hp_metric': 19630.44140625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#---------] 12.78%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 23600.146484375, 'hp_metric': 23600.146484375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#---------] 13.85%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 21278.68359375, 'hp_metric': 21278.68359375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#---------] 14.99%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 20509.60546875, 'hp_metric': 20509.60546875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##--------] 16.14%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 5.0 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 5.0 M

train_model result: {'val_loss': 22230.6796875, 'hp_metric': 22230.6796875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##--------] 17.32%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 21643.802734375, 'hp_metric': 21643.802734375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##--------] 18.54%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 24138.056640625, 'hp_metric': 24138.056640625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##--------] 19.68%

Milestones: [2, 4, 6]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 155 K │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 155 K

train_model result: {'val_loss': 23465.794921875, 'hp_metric': 23465.794921875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##--------] 21.09%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 155 K │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 155 K

train_model result: {'val_loss': 24515.06640625, 'hp_metric': 24515.06640625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##--------] 22.31%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 5.0 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 5.0 M

train_model result: {'val_loss': 24167.859375, 'hp_metric': 24167.859375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##--------] 23.58%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 765 K │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 765 K

train_model result: {'val_loss': 23823.00390625, 'hp_metric': 23823.00390625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [###-------] 25.14%

Milestones: [4, 8, 12]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 765 K │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 765 K

train_model result: {'val_loss': 23558.8203125, 'hp_metric': 23558.8203125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [###-------] 26.39%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 20187.87109375, 'hp_metric': 20187.87109375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [###-------] 27.63%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 20004.548828125, 'hp_metric': 20004.548828125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [###-------] 28.98%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 22654.873046875, 'hp_metric': 22654.873046875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [###-------] 30.28%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 21932.8828125, 'hp_metric': 21932.8828125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [###-------] 31.50%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 23942.806640625, 'hp_metric': 23942.806640625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [###-------] 32.82%

Milestones: [2, 4, 6]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 155 K │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 155 K

train_model result: {'val_loss': 24368.052734375, 'hp_metric': 24368.052734375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [###-------] 34.37%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 22168.896484375, 'hp_metric': 22168.896484375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [####------] 35.68%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 23556.5234375, 'hp_metric': 23556.5234375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [####------] 37.13%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 310 K │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 310 K

train_model result: {'val_loss': 20889.677734375, 'hp_metric': 20889.677734375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [####------] 38.37%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 621 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 621 K

train_model result: {'val_loss': 19179.884765625, 'hp_metric': 19179.884765625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [####------] 39.72%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 155 K │ [64, 10] │   [64, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴──────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 155 K

train_model result: {'val_loss': 23971.1953125, 'hp_metric': 23971.1953125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [####------] 41.19%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 621 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 621 K

train_model result: {'val_loss': 21662.771484375, 'hp_metric': 21662.771484375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [####------] 42.42%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 5.0 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 5.0 M

train_model result: {'val_loss': 21074.25390625, 'hp_metric': 21074.25390625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [####------] 43.67%

Milestones: [32, 64, 96]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 310 K │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 310 K

train_model result: {'val_loss': 23851.310546875, 'hp_metric': 23851.310546875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [####------] 44.80%

Milestones: [32, 64, 96]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 5.0 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 5.0 M

train_model result: {'val_loss': 24399.580078125, 'hp_metric': 24399.580078125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#####-----] 45.92%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 24144.05859375, 'hp_metric': 24144.05859375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#####-----] 47.19%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 22020.150390625, 'hp_metric': 22020.150390625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#####-----] 48.28%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 22826.787109375, 'hp_metric': 22826.787109375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#####-----] 49.31%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 19749.046875, 'hp_metric': 19749.046875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#####-----] 50.37%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 621 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 621 K

train_model result: {'val_loss': 21434.62890625, 'hp_metric': 21434.62890625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#####-----] 52.10%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 382 K │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 382 K

train_model result: {'val_loss': 23245.810546875, 'hp_metric': 23245.810546875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#####-----] 52.99%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 21583.15625, 'hp_metric': 21583.15625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#####-----] 54.07%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 23981.6015625, 'hp_metric': 23981.6015625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [######----] 55.21%

Milestones: [32, 64, 96]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 24047.78515625, 'hp_metric': 24047.78515625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [######----] 56.28%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 24063.236328125, 'hp_metric': 24063.236328125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [######----] 57.94%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 621 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 621 K

train_model result: {'val_loss': 20419.12109375, 'hp_metric': 20419.12109375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [######----] 59.10%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 621 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 621 K

train_model result: {'val_loss': 22024.41015625, 'hp_metric': 22024.41015625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [######----] 60.26%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 20101.5703125, 'hp_metric': 20101.5703125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [######----] 61.34%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 21690.080078125, 'hp_metric': 21690.080078125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [######----] 62.58%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 23184.138671875, 'hp_metric': 23184.138671875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [######----] 64.47%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 310 K │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 310 K

train_model result: {'val_loss': 23350.859375, 'hp_metric': 23350.859375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#######---] 65.97%

Milestones: [32, 64, 96]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 5.0 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 5.0 M

train_model result: {'val_loss': 20614.466796875, 'hp_metric': 20614.466796875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#######---] 67.21%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 24008.123046875, 'hp_metric': 24008.123046875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#######---] 68.41%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 21547.009765625, 'hp_metric': 21547.009765625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#######---] 70.22%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 21479.583984375, 'hp_metric': 21479.583984375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#######---] 71.39%

Milestones: [4, 8, 12]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 191 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 191 K

train_model result: {'val_loss': 23873.962890625, 'hp_metric': 23873.962890625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#######---] 72.38%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 20538.41796875, 'hp_metric': 20538.41796875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#######---] 73.59%

Milestones: [4, 8, 12]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 621 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 621 K

train_model result: {'val_loss': 23339.38671875, 'hp_metric': 23339.38671875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#######---] 74.80%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 21071.2265625, 'hp_metric': 21071.2265625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [########--] 76.08%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 310 K │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 310 K

train_model result: {'val_loss': 22538.693359375, 'hp_metric': 22538.693359375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [########--] 77.60%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 23864.25390625, 'hp_metric': 23864.25390625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [########--] 78.76%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 621 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 621 K

train_model result: {'val_loss': 21562.197265625, 'hp_metric': 21562.197265625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [########--] 79.97%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 21651.728515625, 'hp_metric': 21651.728515625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [########--] 81.04%

Milestones: [32, 64, 96]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 310 K │ [128, 10] │  [128, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 310 K

train_model result: {'val_loss': 22679.478515625, 'hp_metric': 22679.478515625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [########--] 82.31%

Milestones: [2, 4, 6]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 621 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 621 K

train_model result: {'val_loss': 23072.109375, 'hp_metric': 23072.109375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [########--] 83.42%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 21094.8828125, 'hp_metric': 21094.8828125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [########--] 84.51%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 621 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 621 K

train_model result: {'val_loss': 22167.552734375, 'hp_metric': 22167.552734375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#########-] 85.67%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 621 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 621 K

train_model result: {'val_loss': 21192.462890625, 'hp_metric': 21192.462890625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#########-] 86.78%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 12.0 K │ [16, 10] │   [16, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴──────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 12.0 K

train_model result: {'val_loss': 23562.796875, 'hp_metric': 23562.796875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#########-] 88.02%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 191 K │ [256, 10] │  [256, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 191 K

train_model result: {'val_loss': 23284.650390625, 'hp_metric': 23284.650390625}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#########-] 89.53%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 22356.51953125, 'hp_metric': 22356.51953125}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [#########-] 90.96%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃  FLOPs ┃ In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 12.0 K │ [16, 10] │   [16, 1] │
└───┴────────┴────────────┴────────┴───────┴────────┴──────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 12.0 K

train_model result: {'val_loss': 24031.701171875, 'hp_metric': 24031.701171875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##########] 95.19%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │    637 │ train │ 382 K │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 637                                                                                              
Non-trainable params: 0                                                                                            
Total params: 637                                                                                                  
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 63                                                                                          
Modules in eval mode: 0                                                                                            
Total FLOPs: 382 K

train_model result: {'val_loss': 22813.669921875, 'hp_metric': 22813.669921875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##########] 96.06%

Milestones: [8, 16, 24]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 2.5 M │ [1024, 10] │ [1024, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 2.5 M

train_model result: {'val_loss': 23904.787109375, 'hp_metric': 23904.787109375}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##########] 97.35%

Milestones: [4, 8, 12]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃   In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 5.0 M │ [2048, 10] │ [2048, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴────────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 5.0 M

train_model result: {'val_loss': 22157.591796875, 'hp_metric': 22157.591796875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##########] 99.17%

Milestones: [16, 32, 48]

┏━━━┳━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓
┃   ┃ Name   ┃ Type       ┃ Params ┃ Mode  ┃ FLOPs ┃  In sizes ┃ Out sizes ┃
┡━━━╇━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩
│ 0 │ layers │ Sequential │  1.8 K │ train │ 1.2 M │ [512, 10] │  [512, 1] │
└───┴────────┴────────────┴────────┴───────┴───────┴───────────┴───────────┘

Trainable params: 1.8 K                                                                                            
Non-trainable params: 0                                                                                            
Total params: 1.8 K                                                                                                
Total estimated model params size (MB): 0                                                                          
Modules in train mode: 101                                                                                         
Modules in eval mode: 0                                                                                            
Total FLOPs: 1.2 M

train_model result: {'val_loss': 23643.919921875, 'hp_metric': 23643.919921875}
Anisotropic model: n_theta set to 9
spotpython tuning: 18676.08984375 [##########] 100.00% Done...

Experiment saved to 606-user-resnet_res.pkl

56.1 Looking at the Results

56.1.1 Tuning Progress

After the hyperparameter tuning run is finished, the progress of the hyperparameter tuning can be visualized with spotpython’s method plot_progress. The black points represent the performace values (score or metric) of hyperparameter configurations from the initial design, whereas the red points represents the hyperparameter configurations found by the surrogate model based optimization.

spot_tuner.plot_progress()

56.1.2 Tuned Hyperparameters and Their Importance

Results can be printed in tabular form.

from spotpython.utils.eda import print_res_table
print_res_table(spot_tuner)

| name           | type   | default   |   lower |   upper | tuned              | transform             |   importance | stars   |
|----------------|--------|-----------|---------|---------|--------------------|-----------------------|--------------|---------|
| l1             | int    | 3         |     3.0 |     4.0 | 4.0                | transform_power_2_int |        13.47 | *       |
| epochs         | int    | 4         |     3.0 |     7.0 | 6.0                | transform_power_2_int |         0.00 |         |
| batch_size     | int    | 4         |     4.0 |    11.0 | 10.0               | transform_power_2_int |         0.03 |         |
| act_fn         | factor | ReLU      |     0.0 |     5.0 | LeakyReLU          | None                  |       100.00 | ***     |
| optimizer      | factor | SGD       |     0.0 |     2.0 | Adam               | None                  |         0.00 |         |
| dropout_prob   | float  | 0.01      |     0.0 |   0.025 | 0.025              | None                  |         0.00 |         |
| lr_mult        | float  | 1.0       |     0.1 |    20.0 | 19.682229123842966 | None                  |         0.00 |         |
| patience       | int    | 2         |     2.0 |     3.0 | 3.0                | transform_power_2_int |        13.37 | *       |
| initialization | factor | Default   |     0.0 |     4.0 | Default            | None                  |         1.93 | *       |

A histogram can be used to visualize the most important hyperparameters.

spot_tuner.plot_importance(threshold=1.0)

spot_tuner.plot_important_hyperparameter_contour(max_imp=3)

l1:  13.468155267972813
epochs:  0.001
batch_size:  0.03028681600390036
act_fn:  100.0
optimizer:  0.001
dropout_prob:  0.001
lr_mult:  0.002568325998315264
patience:  13.371722451762531
initialization:  1.9296192795437492

56.1.3 Get the Tuned Architecture

import pprint
from spotpython.hyperparameters.values import get_tuned_architecture
config = get_tuned_architecture(spot_tuner)
pprint.pprint(config)

{'act_fn': LeakyReLU(),
 'batch_size': 1024,
 'dropout_prob': 0.025,
 'epochs': 64,
 'initialization': 'Default',
 'l1': 16,
 'lr_mult': 19.682229123842966,
 'optimizer': 'Adam',
 'patience': 8}

56.2 Details of the User-Specified ResNet Model

The specification of a user model requires three files:

my_resnet.py: the Python file containing the user specified ResNet model
my_hyperdict.py: the Python file for loading the hyperparameter dictionary my_hyperdict.json for the user specified ResNet model
my_hyperdict.json: the JSON file containing the hyperparameter dictionary for the user specified ResNet model

56.2.1 `my_resnet.py`

import lightning as L
import torch
from torch import nn
from spotpython.hyperparameters.optimizer import optimizer_handler
import torchmetrics.functional.regression
import torch.optim as optim

class ResidualBlock(nn.Module):
    def __init__(self, input_dim, output_dim, act_fn, dropout_prob):
        super(ResidualBlock, self).__init__()
        self.fc1 = nn.Linear(input_dim, output_dim)
        self.bn1 = nn.BatchNorm1d(output_dim)
        self.ln1 = nn.LayerNorm(output_dim)  
        self.fc2 = nn.Linear(output_dim, output_dim)
        self.bn2 = nn.BatchNorm1d(output_dim)
        self.ln2 = nn.LayerNorm(output_dim)
        self.act_fn = act_fn
        self.dropout = nn.Dropout(dropout_prob)
        self.shortcut = nn.Sequential()

        if input_dim != output_dim:
            self.shortcut = nn.Sequential(
                nn.Linear(input_dim, output_dim),
                nn.BatchNorm1d(output_dim)
            )
    
    def forward(self, x):
        identity = self.shortcut(x)
        
        out = self.fc1(x)
        out = self.bn1(out)
        out = self.ln1(out)
        out = self.act_fn(out)
        out = self.dropout(out)
        out = self.fc2(out)
        out = self.bn2(out)
        out = self.ln2(out)
        out += identity  # Residual connection
        out = self.act_fn(out)
        return out

class MyResNet(L.LightningModule):
    def __init__(
        self,
        l1: int,
        epochs: int,
        batch_size: int,
        initialization: str,
        act_fn: nn.Module,
        optimizer: str,
        dropout_prob: float,
        lr_mult: float,
        patience: int,
        _L_in: int,
        _L_out: int,
        _torchmetric: str,
    ):
        super().__init__()
        self._L_in = _L_in
        self._L_out = _L_out
        if _torchmetric is None:
            _torchmetric = "mean_squared_error"
        self._torchmetric = _torchmetric
        self.metric = getattr(torchmetrics.functional.regression, _torchmetric)
        self.save_hyperparameters(ignore=["_L_in", "_L_out", "_torchmetric"])
        self.example_input_array = torch.zeros((batch_size, self._L_in))
        
        if self.hparams.l1 < 4:
            raise ValueError("l1 must be at least 4")
        
        # Get hidden sizes
        hidden_sizes = self._get_hidden_sizes()
        layer_sizes = [self._L_in] + hidden_sizes

        # Construct the layers with Residual Blocks and Linear Layer at the end
        layers = []
        for i in range(len(layer_sizes) - 1):
            layers.append(
                ResidualBlock(
                    layer_sizes[i], 
                    layer_sizes[i + 1], 
                    self.hparams.act_fn, 
                    self.hparams.dropout_prob
                )
            )
        layers.append(nn.Linear(layer_sizes[-1], self._L_out))
        
        self.layers = nn.Sequential(*layers)

        # Initialization (Xavier, Kaiming, or Default)
        self.apply(self._init_weights)

    def _init_weights(self, module):        
        if isinstance(module, nn.Linear):
            if self.hparams.initialization == "xavier_uniform":
                nn.init.xavier_uniform_(module.weight)
            elif self.hparams.initialization == "xavier_normal":
                nn.init.xavier_normal_(module.weight)
            elif self.hparams.initialization == "kaiming_uniform":
                nn.init.kaiming_uniform_(module.weight)
            elif self.hparams.initialization == "kaiming_normal":
                nn.init.kaiming_normal_(module.weight)
            else: # "Default"
                nn.init.uniform_(module.weight)
            if module.bias is not None:
                nn.init.zeros_(module.bias)
    
    def _generate_div2_list(self, n, n_min) -> list:
        result = []
        current = n
        repeats = 1
        max_repeats = 4
        while current >= n_min:
            result.extend([current] * min(repeats, max_repeats))
            current = current // 2
            repeats = repeats + 1
        return result

    def _get_hidden_sizes(self):
        n_low = max(2, int(self._L_in / 4))  # Ensure minimum reasonable size
        n_high = max(self.hparams.l1, 2 * n_low)
        hidden_sizes = self._generate_div2_list(n_high, n_low)
        return hidden_sizes

    def forward(self, x: torch.Tensor) -> torch.Tensor:
        x = self.layers(x)
        return x

    def _calculate_loss(self, batch):
        x, y = batch
        y = y.view(len(y), 1)
        y_hat = self(x)
        loss = self.metric(y_hat, y)
        return loss

    def training_step(self, batch: tuple) -> torch.Tensor:
        val_loss = self._calculate_loss(batch)
        return val_loss

    def validation_step(self, batch: tuple, batch_idx: int, prog_bar: bool = False) -> torch.Tensor:
        val_loss = self._calculate_loss(batch)
        self.log("val_loss", val_loss, prog_bar=prog_bar)
        self.log("hp_metric", val_loss, prog_bar=prog_bar)
        return val_loss

    def test_step(self, batch: tuple, batch_idx: int, prog_bar: bool = False) -> torch.Tensor:
        val_loss = self._calculate_loss(batch)
        self.log("val_loss", val_loss, prog_bar=prog_bar)
        self.log("hp_metric", val_loss, prog_bar=prog_bar)
        return val_loss

    def predict_step(self, batch: tuple, batch_idx: int, prog_bar: bool = False) -> torch.Tensor:
        x, y = batch
        yhat = self(x)
        y = y.view(len(y), 1)
        yhat = yhat.view(len(yhat), 1)
        return (x, y, yhat)

    def configure_optimizers(self):
        optimizer = optimizer_handler(
            optimizer_name=self.hparams.optimizer,
            params=self.parameters(),
            lr_mult=self.hparams.lr_mult
        )

        # Dynamic creation of milestones based on the number of epochs.
        num_milestones = 3  # Number of milestones to divide the epochs
        milestones = [int(self.hparams.epochs / (num_milestones + 1) * (i + 1)) for i in range(num_milestones)]

        # Print milestones for debug purposes
        print(f"Milestones: {milestones}")

        # Create MultiStepLR scheduler with dynamic milestones and learning rate multiplier.
        scheduler = optim.lr_scheduler.MultiStepLR(
            optimizer, 
            milestones=milestones, 
            gamma=0.1  # Decay factor
        )

        # Learning rate scheduler configuration
        lr_scheduler_config = {
            "scheduler": scheduler,
            "interval": "epoch",  # Adjust learning rate per epoch
            "frequency": 1,      # Apply the scheduler at every epoch
        }
        
        return {"optimizer": optimizer, "lr_scheduler": lr_scheduler_config}

56.2.2 `my_hyperdict.py`

import json
from spotpython.data import base
import pathlib


class MyHyperDict(base.FileConfig):
    """User specified hyperparameter dictionary.

    This class extends the FileConfig class to provide a dictionary for storing hyperparameters.

    Attributes:
        filename (str):
            The name of the file where the hyperparameters are stored.
    """

    def __init__(
        self,
        filename: str = "my_hyper_dict.json",
        directory: None = None,
    ) -> None:
        super().__init__(filename=filename, directory=directory)
        self.filename = filename
        self.directory = directory
        self.hyper_dict = self.load()

    @property
    def path(self):
        if self.directory:
            return pathlib.Path(self.directory).joinpath(self.filename)
        return pathlib.Path(__file__).parent.joinpath(self.filename)

    def load(self) -> dict:
        """Load the hyperparameters from the file.

        Returns:
            dict: A dictionary containing the hyperparameters.

        Examples:
            # Assume the user specified file `my_hyper_dict.json` is in the `./hyperdict/` directory.
            >>> user_lhd = MyHyperDict(filename='my_hyper_dict.json', directory='./hyperdict/')
        """
        with open(self.path, "r") as f:
            d = json.load(f)
        return d

56.2.3 `my_hyperdict.json`

 "MyResNet": {
        "l1": {
            "type": "int",
            "default": 3,
            "transform": "transform_power_2_int",
            "lower": 3,
            "upper": 10
        },
        "epochs": {
            "type": "int",
            "default": 4,
            "transform": "transform_power_2_int",
            "lower": 4,
            "upper": 9
        },
        "batch_size": {
            "type": "int",
            "default": 4,
            "transform": "transform_power_2_int",
            "lower": 1,
            "upper": 6
        },
        "act_fn": {
            "levels": [
                "Sigmoid",
                "Tanh",
                "ReLU",
                "LeakyReLU",
                "ELU",
                "Swish"
            ],
            "type": "factor",
            "default": "ReLU",
            "transform": "None",
            "class_name": "spotpython.torch.activation",
            "core_model_parameter_type": "instance()",
            "lower": 0,
            "upper": 5
        },
        "optimizer": {
            "levels": [
                "Adadelta",
                "Adagrad",
                "Adam",
                "AdamW",
                "SparseAdam",
                "Adamax",
                "ASGD",
                "NAdam",
                "RAdam",
                "RMSprop",
                "Rprop",
                "SGD"
            ],
            "type": "factor",
            "default": "SGD",
            "transform": "None",
            "class_name": "torch.optim",
            "core_model_parameter_type": "str",
            "lower": 0,
            "upper": 11
        },
        "dropout_prob": {
            "type": "float",
            "default": 0.01,
            "transform": "None",
            "lower": 0.0,
            "upper": 0.25
        },
        "lr_mult": {
            "type": "float",
            "default": 1.0,
            "transform": "None",
            "lower": 0.1,
            "upper": 10.0
        },
        "patience": {
            "type": "int",
            "default": 2,
            "transform": "transform_power_2_int",
            "lower": 2,
            "upper": 6
        },
        "initialization": {
            "levels": [
                "Default",
                "kaiming_uniform",
                "kaiming_normal",
                "xavier_uniform",
                "xavier_normal"
            ],
            "type": "factor",
            "default": "Default",
            "transform": "None",
            "core_model_parameter_type": "str",
            "lower": 0,
            "upper": 4
        }
    }

56.3 Summary

This section presented an introduction to the basic setup of hyperparameter tuning with spotpython and PyTorch Lightning using a ResNet model for the Diabetes data set.