kaishi.tabular.filters.duplicate_rows_each_dataframe

Class definition for filtering duplicate rows in each dataframe.

Module Contents

class kaishi.tabular.filters.duplicate_rows_each_dataframe.FilterDuplicateRowsEachDataframe

Bases: kaishi.core.pipeline_component.PipelineComponent

Filter duplicate rows in each dataframe of a tabular dataset.

__call__(self, dataset)

Perform the filter operation on a given tabular dataset.

Parameters

dataset (kaishi.tabular.dataset.TabularDataset) – dataset to perform operation on