Dataset
A simple adaptor class to make the data in required format that can be easily consumed and processd by the models.
Data
Data (df:pandas.core.frame.DataFrame, reduce_datatype:bool=True, encode_category:str=None, add_intercept:bool=False, na_treatment:str='allow', copy_data:bool=False, digits:int=None, n_category:Union[int,float,NoneType]=None)
Dataset Adaptor Class
This class is meant to make dataset possible which would be consumed by models further
Type | Default | Details | |
---|---|---|---|
df | DataFrame | Dataframe that needs to be converted | |
reduce_datatype | bool | True | Shall we try to reduce datatype to make is smaller |
encode_category | str | None | Do encoding of categories default to None as no encoding |
add_intercept | bool | False | Add a constant value intercept to data. This might be needed for Model based Trees. |
na_treatment | str | allow | How to work with nas. Default: ‘allow’ |
copy_data | bool | False | Keep a self copy of original data |
digits | int | None | To round float to certain digits or not, Default: None means no rounding |
n_category | Union | None | How many different level shoud be treated as category. If a value less than one the number of levels is defined aas % oft total rows |
Returns | None |
How to work on data
Please refer Examples