Skip to content

transform

cod_to_nat_X(cod_X, cod_type, min_X=None, max_X=None, mean_X=None, std_X=None)

Compute natural X-values from coded units based on the setting of the cod_type attribute. If cod_type is “norm”, the values are de-normalized from [0,1]. If cod_type is “std”, the values are de-standardized. Otherwise, the values are not modified.

Parameters:

Name Type Description Default
cod_X array

The coded X-values.

required
cod_type str

The type of coding (“norm”, “std”, or other).

required
min_X array

The minimum values of X. Defaults to None.

None
max_X array

The maximum values of X. Defaults to None.

None
mean_X array

The mean values of X. Defaults to None.

None
std_X array

The standard deviation of X. Defaults to None.

None

Returns:

Name Type Description
X array

The natural (physical or real world) X-values.

Source code in spotpython/utils/transform.py
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
def cod_to_nat_X(cod_X, cod_type, min_X=None, max_X=None, mean_X=None, std_X=None) -> np.ndarray:
    """
    Compute natural X-values from coded units based on the
    setting of the `cod_type` attribute. If `cod_type` is "norm", the values are
    de-normalized from [0,1]. If `cod_type` is "std", the values are de-standardized.
    Otherwise, the values are not modified.

    Args:
        cod_X (np.array):
            The coded X-values.
        cod_type (str):
            The type of coding ("norm", "std", or other).
        min_X (np.array):
            The minimum values of X. Defaults to None.
        max_X (np.array):
            The maximum values of X. Defaults to None.
        mean_X (np.array):
            The mean values of X. Defaults to None.
        std_X (np.array):
            The standard deviation of X. Defaults to None.

    Returns:
        X (np.array): The natural (physical or real world) X-values.
    """
    X_copy = copy.deepcopy(cod_X)
    # k is the number of columns in X, i.e., the dimension of the input space.
    k = cod_X.shape[1]
    if cod_type == "norm":
        # De-normalize X from [0,1] column-wise.
        for i in range(k):
            X_copy[:, i] = X_copy[:, i] * (max_X[i] - min_X[i]) + min_X[i]
        X = X_copy
    elif cod_type == "std":
        # De-standardize X column-wise.
        for i in range(k):
            X_copy[:, i] = X_copy[:, i] * std_X[i] + mean_X[i]
        X = X_copy
    else:
        X = X_copy
    return X

cod_to_nat_y(cod_y, cod_type, min_y=None, max_y=None, mean_y=None, std_y=None)

Compute natural y-values from coded units based on the setting of the cod_type attribute. If cod_type is “norm”, the values are de-normalized from [0,1]. If cod_type is “std”, the values are de-standardized. Otherwise, the values are not modified.

Parameters:

Name Type Description Default
cod_y array

The coded y-values.

required
cod_type str

The type of coding (“norm”, “std”, or other).

required
min_y array

The minimum values of y. Defaults to None.

None
max_y array

The maximum values of y. Defaults to None.

None
mean_y array

The mean values of y. Defaults to None.

None
std_y array

The standard deviation of y. Defaults to None.

None

Returns:

Name Type Description
y array

The natural (physical or real world) y-values.

Source code in spotpython/utils/transform.py
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
def cod_to_nat_y(cod_y, cod_type, min_y=None, max_y=None, mean_y=None, std_y=None) -> np.ndarray:
    """
    Compute natural y-values from coded units based on the
    setting of the `cod_type` attribute. If `cod_type` is "norm", the values are
    de-normalized from [0,1]. If `cod_type` is "std", the values are de-standardized.
    Otherwise, the values are not modified.

    Args:
        cod_y (np.array):
            The coded y-values.
        cod_type (str):
            The type of coding ("norm", "std", or other).
        min_y (np.array):
            The minimum values of y. Defaults to None.
        max_y (np.array):
            The maximum values of y. Defaults to None.
        mean_y (np.array):
            The mean values of y. Defaults to None.
        std_y (np.array):
            The standard deviation of y. Defaults to None.

    Returns:
        y (np.array): The natural (physical or real world) y-values.
    """
    y_copy = copy.deepcopy(cod_y)
    if cod_type == "norm":
        y = y_copy * (max_y - min_y) + min_y
    elif cod_type == "std":
        y = y_copy * std_y + mean_y
    else:
        y = y_copy
    return y

nat_to_cod_X(X, cod_type)

Compute coded X-values from natural (physical or real world) units based on the setting of the cod_type attribute. If cod_type is “norm”, the values are normalized to [0,1]. If cod_type is “std”, the values are standardized. Otherwise, the values are not modified.

Parameters:

Name Type Description Default
X array

The input array.

required
cod_type str

The type of coding (“norm”, “std”, or other).

required

Returns:

Name Type Description
cod_X array

The coded X-values.

min_X array

The minimum values of X.

max_X array

The maximum values of X.

mean_X array

The mean values of X.

std_X array

The standard deviation of X.

Source code in spotpython/utils/transform.py
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
def nat_to_cod_X(X, cod_type):
    """
    Compute coded X-values from natural (physical or real world) units based on the
    setting of the `cod_type` attribute. If `cod_type` is "norm", the values are
    normalized to [0,1]. If `cod_type` is "std", the values are standardized.
    Otherwise, the values are not modified.

    Args:
        X (np.array): The input array.
        cod_type (str): The type of coding ("norm", "std", or other).

    Returns:
        cod_X (np.array): The coded X-values.
        min_X (np.array): The minimum values of X.
        max_X (np.array): The maximum values of X.
        mean_X (np.array): The mean values of X.
        std_X (np.array): The standard deviation of X.
    """
    min_X = np.min(X, axis=0)
    max_X = np.max(X, axis=0)
    mean_X = np.mean(X, axis=0)
    # make std_X array similar to mean_X array
    std_X = np.zeros_like(mean_X)
    X_copy = copy.deepcopy(X)
    # k is the number of columns in X, i.e., the dimension of the input space.
    k = X.shape[1]
    if cod_type == "norm":
        # Normalize X to [0,1] column-wise. If the range is zero, set the value to 0.5.
        for i in range(k):
            if max_X[i] - min_X[i] == 0:
                X_copy[:, i] = 0.5
            else:
                X_copy[:, i] = (X_copy[:, i] - min_X[i]) / (max_X[i] - min_X[i])
        cod_X = X_copy
    elif cod_type == "std":
        # Standardize X column-wise. If the standard deviation is zero, do not divide.
        for i in range(k):
            if max_X[i] - min_X[i] == 0:
                X_copy[:, i] = 0
            else:
                std_X[i] = np.std(X_copy[:, i], ddof=1)
                X_copy[:, i] = (X_copy[:, i] - mean_X[i]) / std_X[i]
        cod_X = X_copy
    else:
        cod_X = X_copy
    return cod_X, min_X, max_X, mean_X, std_X

nat_to_cod_y(y, cod_type)

Compute coded y-values from natural (physical or real world) units based on the setting of the cod_type attribute. If cod_type is “norm”, the values are normalized to [0,1]. If cod_type is “std”, the values are standardized. Otherwise, the values are not modified.

Parameters:

Name Type Description Default
y array

The input array.

required
cod_type str

The type of coding (“norm”, “std”, or other).

required

Returns:

Name Type Description
cod_y array

The coded y-values.

min_y array

The minimum values of y.

max_y array

The maximum values of y.

mean_y array

The mean values of y.

std_y array

The standard deviation of y.

Source code in spotpython/utils/transform.py
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
def nat_to_cod_y(y, cod_type) -> np.ndarray:
    """
    Compute coded y-values from natural (physical or real world) units based on the
    setting of the `cod_type` attribute. If `cod_type` is "norm", the values are
    normalized to [0,1]. If `cod_type` is "std", the values are standardized.
    Otherwise, the values are not modified.

    Args:
        y (np.array): The input array.
        cod_type (str): The type of coding ("norm", "std", or other).

    Returns:
        cod_y (np.array):
            The coded y-values.
        min_y (np.array):
            The minimum values of y.
        max_y (np.array):
            The maximum values of y.
        mean_y (np.array):
            The mean values of y.
        std_y (np.array):
            The standard deviation of y.
    """
    mean_y = np.mean(y)
    std_y = None
    min_y = min(y)
    max_y = max(y)
    y_copy = copy.deepcopy(y)
    if cod_type == "norm":
        if (max_y - min_y) != 0:
            cod_y = (y_copy - min_y) / (max_y - min_y)
        else:
            cod_y = 0.5 * np.ones_like(y_copy)
    elif cod_type == "std":
        if (max_y - min_y) != 0:
            std_y = np.std(y, ddof=1)
            cod_y = (y_copy - mean_y) / std_y
        else:
            cod_y = np.zeros_like(y_copy)
    else:
        cod_y = y_copy
    return cod_y, min_y, max_y, mean_y, std_y

scale(X, lower, upper)

Sample scaling from unit hypercube to different bounds. Converts a sample from [0, 1) to [a, b). The following transformation is used: (b - a) * X + a

Note

equal lower and upper bounds are feasible.

Parameters:

Name Type Description Default
X array

Sample to scale.

required
lower array

lower bound of transformed data.

required
upper array

upper bounds of transformed data.

required

Returns:

Type Description
array

Scaled sample.

Examples:

Transform three samples in the unit hypercube to (lower, upper) bounds:

>>> import numpy as np
>>> from scipy.stats import qmc
>>> from spotpython.utils.transform import scale
>>> lower = np.array([6, 0])
>>> upper = np.array([6, 5])
>>> sample = np.array([[0.5 , 0.75],
>>>             [0.5 , 0.5],
>>>             [0.75, 0.25]])
>>> scale(sample, lower, upper)
Source code in spotpython/utils/transform.py
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
def scale(X: np.ndarray, lower: np.ndarray, upper: np.ndarray) -> np.ndarray:
    """
    Sample scaling from unit hypercube to different bounds. Converts a sample from `[0, 1)` to `[a, b)`.
    The following transformation is used:
    `(b - a) * X + a`

    Note:
        equal lower and upper bounds are feasible.

    Args:
        X (array):
            Sample to scale.
        lower (array):
            lower bound of transformed data.
        upper (array):
            upper bounds of transformed data.

    Returns:
        (array):
            Scaled sample.

    Examples:
        Transform three samples in the unit hypercube to (lower, upper) bounds:

        >>> import numpy as np
        >>> from scipy.stats import qmc
        >>> from spotpython.utils.transform import scale
        >>> lower = np.array([6, 0])
        >>> upper = np.array([6, 5])
        >>> sample = np.array([[0.5 , 0.75],
        >>>             [0.5 , 0.5],
        >>>             [0.75, 0.25]])
        >>> scale(sample, lower, upper)

    """
    # Checking that X is within (0,1) interval
    if (X.max() > 1.0) or (X.min() < 0.0):
        raise ValueError("Sample is not in unit hypercube")
    # Vectorized scaling operation
    X = (upper - lower) * X + lower
    # Handling case where lower == upper
    X[:, lower == upper] = lower[lower == upper]
    return X

transform_hyper_parameter_values(fun_control, hyper_parameter_values)

Transform the values of the hyperparameters according to the transform function specified in fun_control if the hyperparameter is of type “int”, or “float” or “num”. Let fun_control = {“core_model_hyper_dict”:{ “leaf_prediction”: { “levels”: [“mean”, “model”, “adaptive”], “type”: “factor”, “default”: “mean”, “core_model_parameter_type”: “str”}, “max_depth”: { “type”: “int”, “default”: 20, “transform”: “transform_power_2”, “lower”: 2, “upper”: 20}}} and v = {‘max_depth’: 20,’leaf_prediction’: ‘mean’} and def transform_power_2(x): return 2**x. The function takes fun_control and v as input and returns a dictionary with the same structure as v. The function transforms the values of the hyperparameters according to the transform function specified in fun_control if the hyperparameter is of type “int”, or “float” or “num”. For example, transform_hyper_parameter_values(fun_control, v) returns {‘max_depth’: 1048576, ‘leaf_prediction’: ‘mean’}.

Parameters:

Name Type Description Default
fun_control dict

A dictionary containing the information about the core model and the hyperparameters.

required
hyper_parameter_values dict

A dictionary containing the values of the hyperparameters.

required

Returns:

Type Description
dict

A dictionary containing the values of the hyperparameters.

Examples:

>>> import copy
    from spotpython.utils.prepare import transform_hyper_parameter_values
    fun_control = {
    "core_model_hyper_dict": {
        "leaf_prediction": {
            "levels": ["mean", "model", "adaptive"],
            "type": "factor",
            "default": "mean",
            "core_model_parameter_type": "str"},
        "max_depth": {"type": "int",
                      "default": 20
                      "transform": "transform_power_2",
                      "lower": 2,
                      "upper": 20}}}
    hyper_parameter_values = {'max_depth': 20,
                              'leaf_prediction': 'mean'}
    transform_hyper_parameter_values(fun_control, hyper_parameter_values)
    {'max_depth': 1048576,
     'leaf_prediction': 'mean'}
Source code in spotpython/utils/transform.py
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
def transform_hyper_parameter_values(fun_control, hyper_parameter_values):
    """
    Transform the values of the hyperparameters according to the transform function specified in fun_control
    if the hyperparameter is of type "int", or "float" or "num".
    Let fun_control = {"core_model_hyper_dict":{ "leaf_prediction":
    { "levels": ["mean", "model", "adaptive"], "type": "factor", "default": "mean", "core_model_parameter_type": "str"},
    "max_depth": { "type": "int", "default": 20, "transform": "transform_power_2", "lower": 2, "upper": 20}}}
    and v = {'max_depth': 20,'leaf_prediction': 'mean'} and def transform_power_2(x): return 2**x.
    The function takes fun_control and v as input and returns a dictionary with the same structure as v.
    The function transforms the values of the hyperparameters according to the transform function
    specified in fun_control if the hyperparameter is of type "int", or "float" or "num".
    For example, transform_hyper_parameter_values(fun_control, v) returns
     {'max_depth': 1048576, 'leaf_prediction': 'mean'}.

    Args:
        fun_control (dict):
            A dictionary containing the information about the core model and the hyperparameters.
        hyper_parameter_values (dict):
            A dictionary containing the values of the hyperparameters.

    Returns:
        (dict):
            A dictionary containing the values of the hyperparameters.

    Examples:
        >>> import copy
            from spotpython.utils.prepare import transform_hyper_parameter_values
            fun_control = {
            "core_model_hyper_dict": {
                "leaf_prediction": {
                    "levels": ["mean", "model", "adaptive"],
                    "type": "factor",
                    "default": "mean",
                    "core_model_parameter_type": "str"},
                "max_depth": {"type": "int",
                              "default": 20
                              "transform": "transform_power_2",
                              "lower": 2,
                              "upper": 20}}}
            hyper_parameter_values = {'max_depth': 20,
                                      'leaf_prediction': 'mean'}
            transform_hyper_parameter_values(fun_control, hyper_parameter_values)
            {'max_depth': 1048576,
             'leaf_prediction': 'mean'}
    """
    hyper_parameter_values = copy.deepcopy(hyper_parameter_values)
    for key, value in hyper_parameter_values.items():
        if fun_control["core_model_hyper_dict"][key]["type"] in ["int", "float", "num", "factor"] and fun_control["core_model_hyper_dict"][key]["transform"] != "None":
            hyper_parameter_values[key] = eval(fun_control["core_model_hyper_dict"][key]["transform"])(value)
    return hyper_parameter_values

transform_multby2_int(x)

Transformations for hyperparameters of type int.

Parameters:

Name Type Description Default
x int

input, will be multiplied by 2

required

Returns:

Type Description
int

The result of multiplying x by 2.

Examples:

>>> from spotpython.utils.transform import transform_multby2_int
>>> transform_multby2_int(3)
6
Source code in spotpython/utils/transform.py
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
def transform_multby2_int(x: int) -> int:
    """Transformations for hyperparameters of type int.

    Args:
        x (int):
            input, will be multiplied by 2

    Returns:
        (int):
            The result of multiplying x by 2.

    Examples:
        >>> from spotpython.utils.transform import transform_multby2_int
        >>> transform_multby2_int(3)
        6
    """
    return int(2 * x)

transform_none_to_None(x)

Transformations for hyperparameters of type None.

Parameters:

Name Type Description Default
x str

The string to transform.

required

Returns:

Type Description
str

The transformed string.

Examples:

>>> from spotpython.utils.transform import transform_none_to_None
>>> transform_none_to_None("none")
None
Note

Needed for sklearn.linear_model.LogisticRegression

Source code in spotpython/utils/transform.py
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
def transform_none_to_None(x):
    """
    Transformations for hyperparameters of type None.

    Args:
        x (str): The string to transform.

    Returns:
        (str): The transformed string.

    Examples:
        >>> from spotpython.utils.transform import transform_none_to_None
        >>> transform_none_to_None("none")
        None

    Note:
        Needed for sklearn.linear_model.LogisticRegression
    """
    if x == "none":
        return None
    else:
        return x

transform_power(base, x, as_int=False)

Raises a given base to the power of x.

Parameters:

Name Type Description Default
base int

The base to raise to the power of x.

required
x int

The exponent.

required
as_int bool

If True, returns the result as an integer.

False

Returns:

Type Description
float

The result of raising the base to the power of x.

Examples:

>>> from spotpython.utils.transform import transform_power
>>> transform_power(2, 3)
8
Source code in spotpython/utils/transform.py
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
def transform_power(base: int, x: int, as_int: bool = False) -> float:
    """
    Raises a given base to the power of x.

    Args:
        base (int):
            The base to raise to the power of x.
        x (int):
            The exponent.
        as_int (bool):
            If True, returns the result as an integer.

    Returns:
        (float):
            The result of raising the base to the power of x.

    Examples:
        >>> from spotpython.utils.transform import transform_power
        >>> transform_power(2, 3)
        8
    """
    result = base**x
    if as_int:
        result = int(result)
    return result

transform_power_10(x)

Transformations for hyperparameters of type float.

Parameters:

Name Type Description Default
x float

The exponent.

required

Returns:

Type Description
float

The result of raising 10 to the power of x.

Examples:

>>> from spotpython.utils.transform import transform_power_10
>>> transform_power_10(3)
1000
Source code in spotpython/utils/transform.py
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
def transform_power_10(x):
    """Transformations for hyperparameters of type float.

    Args:
        x (float): The exponent.

    Returns:
        (float): The result of raising 10 to the power of x.

    Examples:
        >>> from spotpython.utils.transform import transform_power_10
        >>> transform_power_10(3)
        1000
    """
    return 10**x

transform_power_10_int(x)

Transformations for hyperparameters of type int. Args: x (int): The exponent.

Returns:

Type Description
int

The result of raising 10 to the power of x.

Examples:

>>> from spotpython.utils.transform import transform_power_10_int
>>> transform_power_10_int(3)
1000
Source code in spotpython/utils/transform.py
86
87
88
89
90
91
92
93
94
95
96
97
98
99
def transform_power_10_int(x: int) -> int:
    """Transformations for hyperparameters of type int.
    Args:
        x (int): The exponent.

    Returns:
        (int): The result of raising 10 to the power of x.

    Examples:
        >>> from spotpython.utils.transform import transform_power_10_int
        >>> transform_power_10_int(3)
        1000
    """
    return int(10**x)

transform_power_2(x)

Transformations for hyperparameters of type float.

Parameters:

Name Type Description Default
x float

The exponent.

required

Returns:

Type Description
float

The result of raising 2 to the power of x.

Examples:

>>> from spotpython.utils.transform import transform_power_2
>>> transform_power_2(3)
8
Source code in spotpython/utils/transform.py
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
def transform_power_2(x):
    """Transformations for hyperparameters of type float.

    Args:
        x (float): The exponent.

    Returns:
        (float): The result of raising 2 to the power of x.

    Examples:
        >>> from spotpython.utils.transform import transform_power_2
        >>> transform_power_2(3)
        8
    """
    return 2**x

transform_power_2_int(x)

Transformations for hyperparameters of type int.

Parameters:

Name Type Description Default
x int

The exponent.

required

Returns:

Type Description
int

The result of raising 2 to the power of x.

Examples:

>>> from spotpython.utils.transform import transform_power_2_int
>>> transform_power_2_int(3)
8
Source code in spotpython/utils/transform.py
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
def transform_power_2_int(x: int) -> int:
    """Transformations for hyperparameters of type int.

    Args:
        x (int): The exponent.

    Returns:
        (int): The result of raising 2 to the power of x.

    Examples:
        >>> from spotpython.utils.transform import transform_power_2_int
        >>> transform_power_2_int(3)
        8
    """
    return int(2**x)