repair
remove_nan(X, y)
¶
Remove rows from X and y where y contains NaN values.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
X |
ndarray
|
X array |
required |
y |
ndarray
|
y array |
required |
Returns:
Type | Description |
---|---|
Tuple[ndarray, ndarray]
|
Tuple[numpy.ndarray, numpy.ndarray]: X and y arrays with rows containing NaN values in y removed |
Examples:
>>> X = np.array([[1, 2], [3, 4], [5, 6]])
>>> y = np.array([1, np.nan, 2])
>>> remove_nan(X, y)
(array([[1, 2],
[5, 6]]), array([1., 2.]))
Source code in spotPython/utils/repair.py
29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 |
|
repair_non_numeric(X, var_type)
¶
Round non-numeric values to integers. This applies to all variables except for “num” and “float”.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
X |
ndarray
|
X array |
required |
var_type |
list
|
list with type information |
required |
Returns:
Type | Description |
---|---|
ndarray
|
numpy.ndarray: X array with non-numeric values rounded to integers |
Examples:
>>> X = np.array([[1.2, 2.3], [3.4, 4.5]])
>>> var_type = ["num", "factor"]
>>> repair_non_numeric(X, var_type)
array([[1., 2.],
[3., 4.]])
Source code in spotPython/utils/repair.py
5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
|