ENH: Adds id to support output caching #83

thomasjpfan · 2018-07-01T21:09:25Z

Fixes #39

This PR adds an optional id field to data dictionary. When cache_output is set to True, theid field is appended to step.nameto distinguish between output caches produced by different data dictionaries.

For example:

data_train = {
    'id': 'data_train'
    'input': {
        'features': np.array([
            [1, 6],
            [2, 5],
            [3, 4]
        ]),
        'labels': np.array([2, 5, 3]),
    }
}
step = Step(
    name='test_cache_output_with_key',
    transformer=IdentityOperation(),
    input_data=['input'],
    experiment_directory='/exp_dir',
    cache_output=True
)
step.fit_transform(data_train)

This will produce a output cache file at /exp_dir/cache/test_cache_output_with_key__data_train.

jakubczakon · 2018-07-13T12:57:19Z

@thomasjpfan Sorry for late answer.

It's a very interesting idea from the production point of view where your training/dev/test data can easily change. Having an Id here could save you time and trouble.

We're gonna think it through shortly and get back to you.

kamil-kaczmarek · 2018-07-24T10:31:47Z

@thomasjpfan As mentioned in issue I will take a closer look at it next week. Thank you for your PR, here!

thomasjpfan changed the title ~~ENH: Adds auto output caching~~ ENH: Adds id to support output caching Jul 1, 2018

ENH: Adds auto output caching

d1b8b8a

thomasjpfan force-pushed the caching branch from 42e59a6 to d1b8b8a Compare July 1, 2018 21:15

thomasjpfan closed this Oct 15, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Adds id to support output caching #83

ENH: Adds id to support output caching #83

thomasjpfan commented Jul 1, 2018 •

edited

Loading

jakubczakon commented Jul 13, 2018

kamil-kaczmarek commented Jul 24, 2018

ENH: Adds id to support output caching #83

ENH: Adds id to support output caching #83

Conversation

thomasjpfan commented Jul 1, 2018 • edited Loading

jakubczakon commented Jul 13, 2018

kamil-kaczmarek commented Jul 24, 2018

thomasjpfan commented Jul 1, 2018 •

edited

Loading