miserman/lingmatchPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star11

An all-in-one R package for the assessment of linguistic similarity

miserman.github.io/lingmatch

11 stars 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 148 Commits
R		R
docs		docs
man		man
pkgdown		pkgdown
src		src
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
.lintr		.lintr
DESCRIPTION		DESCRIPTION
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
README.md		README.md
build.R		build.R
cran-comments.md		cran-comments.md

Repository files navigation

lingmatch

An all-in-one R package for the assessment of linguistic matching and/or accommodation.

features

Input raw text, a document-term matrix (DTM), or LIWC output.
Apply various weighting functions to a DTM.
Measure similarity and/or accommodation with various metrics.
Calculate standard forms of Language Style Matching (LSM) and Latent Semantic Similarity (LSS).

resources

Documentation and guides:miserman.github.io/lingmatch
Dictionary repository:osf.io/y6g5b
Latent semantic space repository:osf.io/489he
Dictionary builder:miserman.github.io/dictionary_builder

installation

Download R fromr-project.org, then install the package from an R console:

Release (version 1.0.7)

install.packages("lingmatch")

Development (version 1.0.8)

# install.packages("remotes")remotes::install_github("miserman/lingmatch")

And load the package:

library(lingmatch)

examples

Can make a quick comparison between two bits of text; by default this will give the cosine similarity between rawword-count vectors:

lingmatch("First text to look at.","Text to compare that text with.")

Or, given a vector of texts:

text= c("Why, hello there! How are you this evening?","I am well, thank you for your inquiry!","You are a most good at social interactions person!","Why, thank you! You're not all bad yourself!")

Process the texts in one step:

# with a dictionaryinquirer_cats= lma_process(text,dict="inquirer",dir="~/Dictionaries")# with a latent semantic spaceglove_vectors= lma_process(text,space="glove",dir="~/Latent Semantic Spaces")

Or process the texts step by step, then measure similarity between each:

dtm= lma_dtm(text)dtm_weighted= lma_weight(dtm)dtm_categorized= lma_termcat(dtm_weighted, lma_dict(1:9))similarity= lma_simets(dtm_categorized,metric="canberra")

Or do that within a single function call:

similarity= lingmatch(text,weight="frequency",dict= lma_dict(1:9),metric="canberra")$sim

Or, if you want a standard form (as in this example), specify a default:

similarity= lingmatch(text,type="lsm")$sim

About

An all-in-one R package for the assessment of linguistic similarity

miserman.github.io/lingmatch

Releases5

lingmatch 1.0.7 Latest

May 3, 2024

+ 4 releases

Languages

R91.5%
C++8.1%
Other0.4%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

lingmatch

features

resources

installation

examples

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases5

Languages