ZeMA dataset API

An API for accessing the data in the ZeMA remaining-useful life dataset

class zema_emc_annotated.dataset.ExtractionDataType(value)[source]

Identifiers of data types in ZeMA dataset

VALUES

with value qudt:value

Type

str

UNCERTAINTIES

with value qudt:standardUncertainty

Type

str

class zema_emc_annotated.dataset.ZeMASamples(sample_size: SampleSize = SampleSize(idx_first_cycle=0, n_cycles=1, datapoints_per_cycle=1), normalize: bool = False, skip_hash_check: bool = False)[source]

Extracts requested number of samples of values with associated uncertainties

The underlying dataset is the annotated “Sensor data set of one electromechanical cylinder at ZeMA testbed (ZeMA DAQ and Smart-Up Unit)” by Dorst et al. [Dorst2021]. Each extracted sample will be cached in the download directory of the file, which is handled by pooch.os_cache(), where <AppName> evaluates to pooch. That way the concurrent retrieval of the same data is as performant as possible and can simply be left to zema_emc_annotated. Where ever the result of ZeMASamples is needed in an external code base, it should be safe to call it over and over without causing unnecessary extractions or even downloads. The underlying mechanism is Python’s built-in pickle.

Parameters
  • sample_size (SampleSize, optional) – tuple containing information about which samples to extract, defaults to default of SampleSize

  • normalize (bool, optional) – if True, then values are centered around zero and values and uncertainties are scaled to values’ unit std, defaults to False

  • skip_hash_check (bool, optional) – allow to circumvent strict hash checking during the retrieve of dataset file, to speed up concurrent calls as each check for the large file might take several seconds, defaults to False

uncertain_values

The collection of samples of values with associated uncertainties, will be of shape (sample_size.n_cycles, 11 x sample_size.datapoints_per_cycle)

Type

UncertainArray

property uncertainties: ndarray[Any, dtype[float64]]

The uncertainties of the stored UncertainArray object

property values: ndarray[Any, dtype[float64]]

The values of the stored UncertainArray object