Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

data-matching

Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.

Here are 33 public repositories matching this topic...

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

  • UpdatedJul 6, 2025
  • Python
recordlinkage

A powerful and modular toolkit for record linkage and duplicate detection in Python

  • UpdatedFeb 21, 2024
  • Python

Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4

  • UpdatedAug 9, 2022
  • Python

🔎 Finds fuzzy matches between CSV files

  • UpdatedMar 26, 2025
  • Python

PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.

  • UpdatedNov 18, 2022
  • Jupyter Notebook

Resources for tackling record linkage / deduplication / data matching problems

  • UpdatedFeb 22, 2024
pyJedAI

An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.

  • UpdatedJul 14, 2025
  • Python

A browser user interface for manual labeling of record pairs.

  • UpdatedJun 23, 2023
  • JavaScript
snowman

Welcome to Snowman App – a Data Matching Benchmark Platform.

  • UpdatedFeb 9, 2023
  • TypeScript

A maximum-strength name parser for record linkage.

  • UpdatedJun 15, 2025
  • Python

Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).

  • UpdatedMay 24, 2024
  • R

Compound AI toolchain for fast and accurate entity matching, powered by LLMs.

  • UpdatedMar 25, 2025
  • Python

🔎 Finds fuzzy matches between datasets

  • UpdatedJun 1, 2025
  • Python

WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.

  • UpdatedJul 12, 2025
  • Java

Emulates the methods the US Census Bureau uses to link people across multiple data sources, using open-source software (Splink) and simulated data (from pseudopeople).

  • UpdatedOct 14, 2024
  • HTML

Created by Halbert L. Dunn

Released 1946

Followers
44 followers
Organization
entity-resolution
Website
github.com/topics/entity-resolution
Wikipedia
Wikipedia

Related Topics

artificial-intelligence nlp

[8]ページ先頭

©2009-2025 Movatter.jp