data.diabetes.get_diabetes_dataloaders(
test_size=0.2,
batch_size=32,
scale_features=True,
shuffle_train=True,
shuffle_test=False,
random_state=42,
num_workers=0,
pin_memory=False,
)
Returns train and test dataloaders for the Diabetes dataset.
Parameters
| test_size |
float |
Fraction of data to use for testing. |
0.2 |
| batch_size |
int |
Batch size. |
32 |
| scale_features |
bool |
Whether to standardize features using StandardScaler. |
True |
| shuffle_train |
bool |
Whether to shuffle the training data. |
True |
| shuffle_test |
bool |
Whether to shuffle the test data. |
False |
| random_state |
int |
Random seed for splitting. |
42 |
| num_workers |
int |
Number of subprocesses to use for data loading. |
0 |
| pin_memory |
bool |
If True, the data loader will copy Tensors into CUDA pinned memory before returning them. |
False |
Returns
| tuple |
Tuple[DataLoader, DataLoader, Optional[StandardScaler]] |
(train_loader, test_loader, scaler) scaler is the StandardScaler implementation if scale_features=True, else None. |
Examples
from spotoptim.data.diabetes import get_diabetes_dataloaders
train_loader, test_loader, scaler = get_diabetes_dataloaders()
print(train_loader)
print(test_loader)
print(scaler)
<torch.utils.data.dataloader.DataLoader object at 0x12151c2f0>
<torch.utils.data.dataloader.DataLoader object at 0x12125a0d0>
StandardScaler()