rfwtools.data_set

A module that manages the high level workflow from label files to FeatureSet

This module contains a single class DataSet. It is intended to manage a standard workflow from reading and parsing label files to generating a validated set of examples (ExampleSet) to creating a set of features that represent the examples for model building purposes (FeatureSet).

Typical usage example:

ds = DataSet()
ds.produce_example_set()
ds.produce_feature_set(my_feature_extract_func)
ds.save_example_set_csv("my_examples.csv")
ds.save_feature_set_csv("my_features.csv")

It is also possible to specify the type of Example that should be contained in the loaded ExampleSet.

ds = DataSet()
ds.load_example_set_csv("my_windowed_examples.csv", e_type=ExampleType.WINDOWED_EXAMPLE,
                        example_kwargs={'start': -1000.0, 'n_samples': 100})

Classes

DataSet([label_files, example_validator, ...])

This class manages a standard workflow from label files on disk to the generation of features