duplicates-removed
Here are 32 public repositories matching this topic...
Language:All
Sort:Most stars
Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.
- Updated
Nov 22, 2022 - Python
Advanced Duplicate File Finder for Python
- Updated
Nov 23, 2020 - Python
🍰 A library for creating n-grams, skip-grams, bag of words, bag of n-grams, bag of skip-grams.
- Updated
Mar 8, 2022 - Java
Command line utility to remove exact duplicate files.
- Updated
Nov 24, 2018 - C++
File duplicate remover for Synology DSM 213j+
- Updated
Jun 15, 2018 - Java
Program to scan and search for file duplicates. (~300MB/s)
- Updated
Oct 21, 2022 - Java
Function that removes duplicate items and objects based on a key from an array of objects.
- Updated
Jul 17, 2017 - JavaScript
Command Line Interface for deplicate
- Updated
Sep 3, 2017 - Python
Takes an input CSV and produces a CSV of duplicate records. Then the input CSV is cleansed to remove duplicates.
- Updated
Jul 31, 2018 - Python
Sort, uniq, reverse, and randomize data
- Updated
Jul 8, 2020 - JavaScript
A no-nonsense .NET Core 2.1 CLI duplicate files remover
- Updated
Apr 4, 2021 - C#
rm-dup is a script to remove duplicate files
- Updated
Jan 2, 2019 - Shell
powerful data preprocessing application that simplifies the task of preparing data for machine learning models.
- Updated
Jun 14, 2023 - Python
A tool that deduplicates lines of a textfile with the speed of ram and scales nicely on all cores concurrently.
- Updated
Apr 11, 2018 - Go
Searches for duplicates in two separate folders allowing removing duplicated files from one and keeping another intact.
- Updated
Mar 19, 2020 - Python
Conducting EDA on Instacart orders
- Updated
May 14, 2023 - HTML
Created modified Levenshtein distance algorithms, to match strings by deletion and capitalization only and does not allow replacement or insertion of characters
- Updated
Apr 4, 2019 - Java
Improve this page
Add a description, image, and links to theduplicates-removed topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theduplicates-removed topic, visit your repo's landing page and select "manage topics."