Gonzalez-Diaz, R., Gutiérrez-Naranjo, M.A. & Paluzo-Hidalgo, E. Topology-based representative datasets to reduce neural network training resources. Neural Comput & Applic 34, 14397–14413 (2022). (Paper)
Three experiments were developed:
- Iris dataset experiment,
- Digits dataset experiment.
- Different synthetic datasets.
In all of them three sets were considered, the original dataset, the dominating dataset and a random dataset. Besides, the Algorithm based in proximity graphs and dominating sets was implemented and can be found in the auxiliary_fun.py.
- ripser
- keras
- gudhi
- Giotto-tda
An old preprint version of the paper can be found (here).