convert
add_logical_columns(X, arity=2, operations=['and', 'or', 'xor'])
¶
Combines all features in a dataframe with each other using bitwise operations
Parameters:
Name | Type | Description | Default |
---|---|---|---|
X |
DataFrame
|
dataframe with features |
required |
arity |
int
|
the number of columns to combine at once |
2
|
operations |
list of str
|
the operations to apply. Possible values are ‘and’, ‘or’ and ‘xor’ |
['and', 'or', 'xor']
|
Returns:
Name | Type | Description |
---|---|---|
X |
DataFrame
|
dataframe with new features |
Examples:
>>> X = pd.DataFrame({"a": [True, False, True], "b": [True, True, False], "c": [False, False, True]})
>>> add_logical_columns(X)
a b c a_and_b a_and_c b_and_c a_or_b a_or_c b_or_c a_xor_b a_xor_c b_xor_c
0 True True False True False False True True True False True True
1 False True False False False False True False True True True False
2 True False True False True False True True True True False True
Source code in spotpython/utils/convert.py
83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 |
|
check_type(value)
¶
Check the type of the input value and return the type as a string.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
value |
object
|
The input value. |
required |
Returns:
Name | Type | Description |
---|---|---|
str |
str
|
The type of the input value as a string. Possible values are “int”, “float”, “str”, “bool”, or None. Checks for numpy types as well, i.e., np.integer, np.floating, np.str_, np.bool_. |
Examples:
>>> from spotpython.utils.convert import check_type
>>> check_type(5)
"int"
Source code in spotpython/utils/convert.py
185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 |
|
class_for_name(module_name, class_name)
¶
Returns a class for a given module and class name.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
module_name |
str
|
The name of the module. |
required |
class_name |
str
|
The name of the class. |
required |
Returns:
Name | Type | Description |
---|---|---|
object |
object
|
The class. |
Examples:
>>> from spotpython.utils.convert import class_for_name
from scipy.optimize import rosen
bounds = [(0,2), (0, 2), (0, 2), (0, 2), (0, 2)]
shgo_class = class_for_name("scipy.optimize", "shgo")
result = shgo_class(rosen, bounds)
Source code in spotpython/utils/convert.py
8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 |
|
get_Xy_from_df(df, target_column)
¶
Get X and y from a dataframe.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
df |
DataFrame
|
The input dataframe. |
required |
target_column |
str
|
The name of the target column. |
required |
Returns:
Name | Type | Description |
---|---|---|
tuple |
tuple
|
The tuple (X, y). |
Examples:
>>> from spotpython.utils.convert import get_Xy_from_df
>>> import pandas as pd
>>> df = pd.DataFrame({"a": [1, 2, 3], "b": [4, 5, 6], "c": [7, 8, 9]})
>>> X, y = get_Xy_from_df(df, "c")
Source code in spotpython/utils/convert.py
30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 |
|
get_shape(x)
¶
Get the shape of a numpy array x
.
This function returns the number of rows and columns of the input array x
.
If x
is a 1D array (shape (n,)
), it returns (n, None)
.
If x
is a 2D array (shape (n, k)
), it returns (n, k)
.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
x |
ndarray
|
The input numpy array. |
required |
Returns:
Name | Type | Description |
---|---|---|
tuple |
tuple
|
A tuple |
Examples:
>>> import numpy as np
>>> from spotpython.utils.convert import get_shape
>>> x1 = np.array([1, 2, 3])
>>> get_shape(x1)
(3, None)
>>> x2 = np.array([[1, 2], [3, 4], [5, 6]])
>>> get_shape(x2)
(3, 2)
Source code in spotpython/utils/convert.py
252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 |
|
map_to_True_False(value)
¶
Map the string value to a boolean value. If the value is “True” or “true”, return True. Otherwise, return False.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
value |
str
|
The string to be mapped to a boolean value. |
required |
Returns: bool: True if the value is “True” or “true”, False otherwise.
Examples:
from spotpython.utils.convert import map_to_True_False map_to_True_False(“True”) True
Source code in spotpython/utils/convert.py
121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 |
|
series_to_array(series)
¶
Converts a pandas series to a numpy array. Args: series (pandas.Series): The input series.
Returns:
Type | Description |
---|---|
ndarray
|
The output array. |
Examples:
>>> from spotpython.utils.convert import series_to_array
>>> import pandas as pd
>>> series = pd.Series([1, 2, 3])
>>> series_to_array(series)
array([1, 2, 3])
Source code in spotpython/utils/convert.py
62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 |
|
set_dataset_target_type(dataset, target='y')
¶
Set the target column to 0 and 1 for boolean and string values.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
dataset |
DataFrame
|
The input dataset. |
required |
target |
str
|
The name of the target column. Default is “y”. |
'y'
|
Returns:
Type | Description |
---|---|
DataFrame
|
pd.DataFrame: The dataset with boolean and string target column values set to 0 and 1. |
Examples:
from spotpython.utils.convert import set_dataset_target_type import pandas as pd dataset = pd.DataFrame({“a”: [1, 2, 3], “b”: [4, 5, 6], “c”: [7, 8, 9], “y”: [True, False, True]}) print(dataset) dataset = set_dataset_target_type(dataset) print(dataset) a b c y 0 1 4 7 True 1 2 5 8 False 2 3 6 9 True a b c y 0 1 4 7 1 1 2 5 8 0 2 3 6 9 1
Source code in spotpython/utils/convert.py
215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 |
|
set_shape(x, target_shape)
¶
Adjust the shape of a numpy array x
to match the target shape (n, k)
.
If the target shape is (n, None)
, the array is reshaped to 1D with n
elements.
If the target shape is (n, k)
, the array is reshaped to 2D with n
rows and k
columns.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
x |
ndarray
|
The input numpy array. |
required |
target_shape |
tuple
|
The target shape |
required |
Returns:
Type | Description |
---|---|
ndarray
|
numpy.ndarray: The reshaped numpy array. |
Raises:
Type | Description |
---|---|
ValueError
|
If the target shape is incompatible with the size of the array. |
Examples:
>>> import numpy as np
>>> from spotpython.utils.convert import set_shape
>>> x = np.array([1, 2, 3, 4])
>>> set_shape(x, (4, None))
array([1, 2, 3, 4])
>>> x = np.array([1, 2, 3, 4])
>>> set_shape(x, (2, 2))
array([[1, 2],
[3, 4]])
>>> x = np.array([[1, 2], [3, 4]])
>>> set_shape(x, (4, None))
array([1, 2, 3, 4])
Source code in spotpython/utils/convert.py
287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 |
|
sort_by_kth_and_return_indices(array, k)
¶
Sorts an array of arrays based on the k-th values in descending order and returns the indices of the original array entries.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
array |
list of lists
|
The array to be sorted. Each sub-array should have at least
|
required |
k |
int
|
The index (zero-based) of the element within each sub-array to sort by. |
required |
Returns:
Type | Description |
---|---|
list
|
list of int: Indices of the original array entries after sorting by the k-th value. |
Raises:
Type | Description |
---|---|
ValueError
|
If the input array is empty, None, or any sub-array does not have at least
|
Examples:
>>> from spotpython.utils.convert import sort_by_kth_and_return_indices
try:
array = [['x0', 85.50983192204619], ['x1', 100.0], ['x2', 81.35712613549178]]
k = 1 # Sort by the second element in each sub-array
indices = sort_by_kth_and_return_indices(array, k)
print("Indices of the sorted elements using the k-th value:", indices)
except ValueError as error:
print(f"Sorting failed due to: {error}")
Source code in spotpython/utils/convert.py
145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 |
|