Generator node for dataset splitting.
The splitter is configured with the name of an attribute. When its generate() methods is called with a dataset, it subsequently yields all possible subsets of this dataset, by selecting all dataset samples/features corresponding to a particular attribute value, for all unique attribute values.
Dataset splitting is possible by sample attribute, or by feature attribute. The maximum number of splits can be limited, and custom attribute values may be provided.
Notes
Available conditional attributes:
(Conditional attributes enabled by default suffixed with +)
Methods
generate(ds) | Yield dataset splits. |
get_postproc() | Returns the post-processing node or None. |
get_space() | Query the processing space name of this node. |
reset() | |
set_postproc(node) | Assigns a post-processing node Set to None to disable postprocessing. |
set_space(name) | Set the processing space name of this node. |
Parameters: | attr : str
attr_values : tuple
count : None or int
noslicing : bool
reverse : bool
ignore_values : tuple
enable_ca : None or list of str
disable_ca : None or list of str
space: str, optional :
postproc : Node instance, optional
descr : str
|
---|
Methods
generate(ds) | Yield dataset splits. |
get_postproc() | Returns the post-processing node or None. |
get_space() | Query the processing space name of this node. |
reset() | |
set_postproc(node) | Assigns a post-processing node Set to None to disable postprocessing. |
set_space(name) | Set the processing space name of this node. |
Yield dataset splits.
Parameters: | ds: Dataset :
|
---|---|
Returns: | generator :
|