image-deduplication
Here are 17 public repositories matching this topic...
Language:All
Sort:Most stars
😎 Finding duplicate images made easy!
- Updated
Jun 30, 2025 - Python
Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.
- Updated
Nov 22, 2022 - Python
Image similarity in Golang. Version 4 (LATEST)
- Updated
Apr 6, 2024 - Go
Tool to detect (and get rid of) similar images using perceptual hashing (pHash lib)
- Updated
Nov 6, 2016 - Python
A utility for locating near duplicate photos irrespective of image resolution, compression settings or file format.
- Updated
Feb 25, 2024 - Rust
A Python tool to identify and remove similar-looking images from a dataset. Utilizes image preprocessing and hashing techniques for efficient comparison.
- Updated
Jul 29, 2023 - Python
Downloader with custom wildcard system: create new with asterisks for HTML or right-carets for API, whether it's for time-critical website moments or just for laziness. Features HTML build and serve, alarm (essentially in-stock tracker), file sorter (organizer), image duplicate finder and tools for naked eyes.
- Updated
Jun 26, 2025 - Batchfile
🏍️ A clustering tool providing exact and near de-duplication of images using vector embeddings.
- Updated
Jun 18, 2025 - Python
a Python command-line tool that identifies and groups similar images using average hashing. It supports single-level and recursive directory scanning, adjustable similarity threshold, and presents results in JSON format. Ideal for image deduplication, organization, and content-based retrieval tasks.
- Updated
Jul 9, 2024 - Python
The extended version of simhash supports fingerprint extraction of documents and images.
- Updated
Aug 22, 2022 - Python
A CLI tool for images analysis: checking image integrity, images deduplication, image retrieval.
- Updated
Mar 27, 2024 - Rust
This Python script helps in identifying and moving duplicate images within a specified directory to a designated duplicates folder.
- Updated
Jul 1, 2024 - Python
A utility for testing the performance of de-duplication algorithms by randomly generating “noisy” images in a dataset.
- Updated
Jan 15, 2025 - Rust
Sort duplicate images into separate folders
- Updated
Oct 16, 2019 - PHP
A python program to detect duplicate images in a specified folder.
- Updated
May 18, 2020 - Python
A Python notebook combining MD5 and perceptual hashing to detect exact-duplicate images
- Updated
May 19, 2025 - Jupyter Notebook
Get Similarity adalah alat berbasis Python dengan antarmuka GUI yang memungkinkan pengguna menyaring gambar berkualitas rendah dan mengelompokkan gambar serupa secara otomatis menggunakan embedding CLIP + DINOv2 dan evaluasi kualitas berbasis MusIQ.
- Updated
Jun 5, 2025 - Python
Improve this page
Add a description, image, and links to theimage-deduplication topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theimage-deduplication topic, visit your repo's landing page and select "manage topics."