Visual similarity image finder and cleaner (image deduplication tool).
pip install imgdup
or clone the repo and run imgdup.py file directly
Should be run in the images folder.
It will create a duplicates
folder containing similar file pairs indicating which file was kept and which one is gone. You can later review similar files in the duplicates
folder and decide if you delete or restore each _GONE_
marked file.
usage: imgdup.py [-h] [-c CMP] [-s SENSITIVITY] [-i] [-d] [-u]
Compare images base on perceptual similarity.
optional arguments:
-h, --help show this help message and exit
-c CMP, --cmp CMP compare images by function and keep higher
(resolution, size [resolution])
-s SENSITIVITY, --sensitivity SENSITIVITY
how similar images must be to be considered duplicates
(0 - very similar, 5 - shomehow similar)
-i, --invert invert the compartison function (keep lower)
-d, --dry_run just print the pairs
-u, --undo put the moved files back
Watch example terminal cast here
Backup the image set before running this script!