simhash
Here are 63 public repositories matching this topic...
Language:All
Sort:Most stars
Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
- Updated
May 11, 2021 - Go
A simple implementation of simhash algorithm by java.
- Updated
Oct 10, 2020 - Java
Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
- Updated
Jul 20, 2023 - Go
Dynatrace hash library for Java
- Updated
Mar 17, 2025 - Java
Locality Sensitive Hashing
- Updated
Jul 12, 2023 - Rust
Simhash implementation in Javascript
- Updated
Jun 29, 2017 - JavaScript
A fast python implementation of the SimHash algorithm.
- Updated
Oct 27, 2021 - Python
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
- Updated
Jul 25, 2024 - Python
基于springboot和Google开源simhash算法实现的作业查重/抄袭检测/文本相似度分析可视化系统,,集成jplag、MOSS、singleCloud工具套件进行多方位查重 Ref:https://github.com/ALuShu/checksystem
- Updated
Mar 9, 2023 - JavaScript
SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex
- Updated
Nov 18, 2022 - Python
Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.
- Updated
Aug 28, 2023 - Python
A library for cosine similarity & simhash calculation
- Updated
Jul 20, 2024 - Elixir
Improve this page
Add a description, image, and links to thesimhash topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thesimhash topic, visit your repo's landing page and select "manage topics."