Skip to content

Input data set properties file format

Juan Lao Tebar edited this page Jan 21, 2017 · 4 revisions

This document specifies the format used for defining combinations of input data set properties.

Introduction

This format uses the JSON notation, with extra support for comments notated with // ... and /* ... */.

An example of file that follows this format is available in config/input/final.json.

Format

The root element of the file must be an array ([...]) that contains one object ({...}) for each input data set properties combination that is desired to test.

The file has an structure like this:

[
    {...},
    {...},
    ...
]

A pattern data set has the following properties:

  • seed (integer) defines a seed for the random generator, to reproduce experiments.
  • inputsPerPattern (integer) defines the number of inputs must be generated for each pattern.
  • target ("originalPatterns", "transformedPatterns") defines if the input data set is created using the original patterns or the transformed ones (after scaling and adding extra bits to them).
  • minDistance (object) optionally defines a constraint applied to each input. It contains the following keys:
    • mean (float >= 0) indicates the mean minimum distance between an input and the pattern from where it is originated.
    • stdev (float >= 0) indicates the standard deviation of the minimum distance between an input and the pattern from where it is originated.