Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

idiolect: An R package for forensic authorship analysis

License

NotificationsYou must be signed in to change notification settings

andreanini/idiolect

Repository files navigation

CRAN status

Theidiolect R package is designed to provide a comprehensive suite oftools for performing comparative authorship analysis within a forensiccontext using the Likelihood Ratio Framework (e.g. Ishihara 2021; Nini2023). The package contains a set of authorship analysis functions thattake a set of texts as input and output scores that can then becalibrated into likelihood ratios. The package is dependent onquanteda (Benoit et al. 2018) for all NaturalLanguage Processing functions.

Installation

You can installidiolect from CRAN:

install.packages("idiolect")

Workflow

The main functions contained in the package reflect the typical workflowfor authorship analysis for forensic problems:

  1. Input data usingcreate_corpus();

  2. Optionally mask the content/topic of the texts usingcontentmask();

  3. Launch an analysis (e.g. delta(),ngram_tracing(),impostors());

  4. Test the performance of the method on ground truth data usingperformance();

  5. Finally, apply the method to the questioned text and generate alikelihood ratio withcalibrate_LLR().

Check the website and the vignette for examples.

References

Benoit, Kenneth, Kohei Watanabe, Haiyan Wang, Paul Nulty, Adam Obeng,Stefan Müller, and Akitaka Matsuo. 2018. “Quanteda: An r Package for theQuantitative Analysis of Textual Data.”Journal of Open SourceSoftware 3 (30).https://doi.org/10.21105/joss.00774.

Ishihara, Shunichi. 2021. “Score-Based Likelihood Ratios for LinguisticText Evidence with a Bag-of-Words Model.”Forensic ScienceInternational 327: 110980.https://doi.org/10.1016/j.forsciint.2021.110980.

Nini, Andrea. 2023.A Theory of Linguistic Individuality for AuthorshipAnalysis. Elements in Forensic Linguistics. Cambridge, UK: CambridgeUniversity Press.

About

idiolect: An R package for forensic authorship analysis

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors2

  •  
  •  

Languages


[8]ページ先頭

©2009-2025 Movatter.jp