PreprocessResponse
code_loader.contract.datasetclasses.PreprocessResponse
An object that holds the samples data and length. This object is generated in the Preprocessing Function for each dataset slice. Then passed to the input encoders, ground_truth encoders and metadata functions as an argument.
from code_loader.contract.enums import DataStateType
@dataclass
PreprocessResponse:
length: Optional[int] = None,
data: Any = None,
sample_ids: Optional[Union[List[str], List[int]]] = None,
state: Optional[DataStateType] = None,
sample_id_type: Optional[Union[Type[str], Type[int]]] = None
For more on PreprocessResponse
:
length
(int) value with the amount of samples in the data.
data
(Any) Dictionary / pandas.DataFrame / List or any object that describes the dataset features. The data
parameter is later passed to the input encoders, ground_truth encoders, and metadata functions.
sample_ids
(optional) a list of unique identifiers for the input, of the same length as your dataset. IDs should be either a list of ints or strings
state
(DataStateType, optional). Defaults to None. If this is not supplied the state is infered according to its order an is estimated to be [training, validatin, test, unlabeled]
sample_id_type
(str/int, optional). What is the type of the sample_id if provided.
Examples
Basic Usage
from code_loader.contract.datasetclasses import PreprocessResponse
preprocess_response = PreprocessResponse(length=len(x_df.index), data=x_df)
Within the Preprocess Function
Full examples can be found at the Dataset Integration section of the following guides:
Last updated
Was this helpful?