Datasets

If a pre-trained model for the task you want to perform is not available, you can train Luminoth with an existing open dataset, or your own.

The first step in training Luminoth is converting your dataset to TensorFlow's .tfrecords format. This ensures that no matter what image or annotation formats the original dataset uses, it will be transformed to something that Luminoth can understand and process efficiently, either while training locally or in the cloud.

For this purpose, Luminoth provides the lumi dataset transform command, which includes support for some of the most well-known datasets for object detection and classification tasks.

Supported datasets

Pascal VOC2012

$ lumi dataset transform --type pascalvoc --data-dir ~/dataset/pascalvoc/ --output-dir ~/dataset/pascalvoc/tf/

ImageNet

$ lumi dataset transform --type imagenet --data-dir ~/dataset/imagenet/ --output-dir ~/dataset/imagenet/tf/

Limiting the dataset

During development, it is often useful to verify that the model can actually overfit a small dataset.

You can use the --limit-examples and --limit-classes` options for this.

For more information, try lumi dataset transform --help.

Supporting your own dataset

TODO guidelines on how to write your own conversion tool

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DATASETS.md

DATASETS.md

Datasets

Supported datasets

Limiting the dataset

Supporting your own dataset

Files

DATASETS.md

Latest commit

History

DATASETS.md

File metadata and controls

Datasets

Supported datasets

Limiting the dataset

Supporting your own dataset