MRMRFeatureSelectionTransform¶
- class MRMRFeatureSelectionTransform(relevance_table: etna.analysis.feature_relevance.relevance.RelevanceTable, top_k: int, features_to_use: Union[List[str], Literal['all']] = 'all', relevance_aggregation_mode: str = AggregationMode.mean, redundancy_aggregation_mode: str = AggregationMode.mean, atol: float = 1e-10, **relevance_params)[source]¶
Bases:
etna.transforms.feature_selection.base.BaseFeatureSelectionTransform
Transform that selects features according to MRMR variable selection method adapted to the timeseries case.
Notes
Transform works with any type of features, however most of the models works only with regressors. Therefore, it is recommended to pass the regressors into the feature selection transforms.
Init MRMRFeatureSelectionTransform.
- Parameters
relevance_table (etna.analysis.feature_relevance.relevance.RelevanceTable) – method to calculate relevance table
top_k (int) – num of features to select; if there are not enough features, then all will be selected
features_to_use (Union[List[str], Literal['all']]) – columns of the dataset to select from if “all” value is given, all columns are used
relevance_aggregation_mode (str) – the method for relevance values per-segment aggregation
redundancy_aggregation_mode (str) – the method for redundancy values per-segment aggregation
atol (float) – the absolute tolerance to compare the float values
- Inherited-members
Methods
fit
(df)Fit the method and remember features to select.
fit_transform
(df)May be reimplemented.
inverse_transform
(df)Inverse transforms dataframe.
transform
(df)Select top_k features.
- fit(df: pandas.core.frame.DataFrame) etna.transforms.feature_selection.feature_importance.MRMRFeatureSelectionTransform [source]¶
Fit the method and remember features to select.
- Parameters
df (pandas.core.frame.DataFrame) – dataframe with all segments data
- Returns
result – instance after fitting
- Return type