A dataset with an uneven number of cases in each class is said to be unbalanced. Many models produce a subpar performance on unbalanced datasets. A dataset can be balanced by increasing the number of minority cases using SMOTE 2011 <doi:10.48550/arXiv.1106.1813>, BorderlineSMOTE 2005 <doi:10.1007/11538059_91> and ADASYN 2008 <https://ieeexplore.ieee.org/document/4633969>. Or by decreasing the number of majority cases using NearMiss 2003 <https://www.site.uottawa.ca/~nat/Workshop2003/jzhang.pdf> or Tomek link removal 1976 <https://ieeexplore.ieee.org/document/4309452>.
| Version: | 1.0.3 |
| Depends: | R (≥ 3.6),recipes (≥ 1.1.0) |
| Imports: | cli,gower,lifecycle (≥ 1.0.3),dplyr,generics (≥ 0.1.0),purrr,RANN,rlang (≥ 1.1.0),ROSE,tibble,withr,glue,hardhat,vctrs |
| Suggests: | covr,dials (≥ 1.2.0),ggplot2,modeldata,testthat (≥3.0.0) |
| Published: | 2025-01-23 |
| DOI: | 10.32614/CRAN.package.themis |
| Author: | Emil Hvitfeldt |
| Maintainer: | Emil Hvitfeldt <emil.hvitfeldt at posit.co> |
| BugReports: | https://github.com/tidymodels/themis/issues |
| License: | MIT + fileLICENSE |
| URL: | https://github.com/tidymodels/themis,https://themis.tidymodels.org |
| NeedsCompilation: | no |
| Materials: | README,NEWS |
| CRAN checks: | themis results |
| Reference manual: | themis.html ,themis.pdf |
| Package source: | themis_1.0.3.tar.gz |
| Windows binaries: | r-devel:themis_1.0.3.zip, r-release:themis_1.0.3.zip, r-oldrel:themis_1.0.3.zip |
| macOS binaries: | r-release (arm64):themis_1.0.3.tgz, r-oldrel (arm64):themis_1.0.3.tgz, r-release (x86_64):themis_1.0.3.tgz, r-oldrel (x86_64):themis_1.0.3.tgz |
| Old sources: | themis archive |
| Reverse imports: | BioMoR,sae.projection |
| Reverse suggests: | caret,mlr3pipelines,mrIML,orbital,RStoolbox |
Please use the canonical formhttps://CRAN.R-project.org/package=themisto link to this page.