This repository was archived by the owner on Nov 17, 2020. It is now read-only.

Huffon/sentence-similarityPublic archive

NotificationsYou must be signed in to change notification settings
Fork34
Star198

This repository contains various ways to calculate sentence vector similarity using NLP models

198 stars 34 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
img		img
models		models
utils		utils
.gitignore		.gitignore
README.md		README.md
corpus.txt		corpus.txt
download.sh		download.sh
requirements.txt		requirements.txt
sensim.py		sensim.py

Repository files navigation

Sentence Similarity Calculator

This repo contains various ways to calculate the similarity between source and target sentences. You can choosethe pre-trained models you want to use such asELMo,BERT andUniversal Sentence Encoder (USE).

And you can also choosethe method to be used to get the similarity:

1. Cosine similarity2. Manhattan distance3. Euclidean distance4. Angular distance5. Inner product6. TS-SS score7. Pairwise-cosine similarity8. Pairwise-cosine similarity + IDF

You can experiment with (The number of models) x (The number of methods) combinations!

Installation

This project is developed underconda enviroment
After cloning this repository, you can simply install all the dependent libraries described inrequirements.txt withbash install.sh

conda create -n sensim python=3.7conda activate sensimgit clone https://github.com/Huffon/sentence-similarity.gitcd sentence-similaritybash install.sh

Usage

Totest your own sentences, you should fill outcorpus.txt with sentences as below:

I ate an apple.I went to the Apple.I ate an orange....

Then,choose themodel andmethod to be used to calculate the similarity between source and target sentences

python sensim.py    --model    MODEL_NAME  [use, bert, elmo]    --method   METHOD_NAME [cosine, manhattan, euclidean, inner,                            ts-ss, angular, pairwise, pairwise-idf]    --verbose  LOG_OPTION (bool)

Examples

In this section, you can see the example result ofsentence-similarity
As you know, there is a nosilver-bullet which can calculateperfect similarity between sentences
You should conduct various experiments with your dataset
- Caution:TS-SS score might not fit withsentence similarity task, since this method originally devised to calculate the similarity between long documents
Result:

References

Papers

Libraries

Articles

About

This repository contains various ways to calculate sentence vector similarity using NLP models

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Sentence Similarity Calculator

Installation

Usage

Examples

References

Papers

Libraries

Articles

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors3

Uh oh!

Languages

Movatterモバイル変換

Huffon/sentence-similarity

Folders and files

Latest commit

History

Repository files navigation

Sentence Similarity Calculator

Installation

Usage

Examples

References

Papers

Libraries

Articles

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors3

Uh oh!

Languages

Packages