- Notifications
You must be signed in to change notification settings - Fork0
A Shiny app and functions for visual exploration of hierarchical clustering.
License
rhenkin/visxhclust
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
visxhclust is a package that includes a Shiny application forvisualexploration ofhierarchicalclustering. It is aimed atfacilitating iterative workflows of hierarchical clustering on numericdata. For that, the app allows users to quickly change parameters andanalyse and evaluate results with typical heatmaps with dendrograms andother charts. Additionally, it includes lightweight data overview plotssuch as correlation heatmaps, annotated MDS and PCA plots. On theevaluation side, it builds on existing packages to compute internalvalidation scores and Gap statistic, as well as Dunn’s test to evaluatesignificant differences between clusters.
The app includes multiple help points in the interface and a vignettedocumenting how to load data and the basic clustering loop – seevignette("visxhclust"). The package exports various functions to helpwith documenting and reproducing a clustering workflow with R or RMarkdown – seevignette("clusterworkflow") andvignette("clusterevaluation") for examples.
The latest release can be installed from CRAN:
install.packages("visxhclust")The latest development version can be installed from GitHub:
remotes::install_github("rhenkin/visxhclust")
Most dependencies are found in CRAN. However, the heatmap drawingpackage is part ofBioconductor and mayrequire a separate installation:
install.packages("BiocManager")BiocManager::install("ComplexHeatmap")
To use your data with the tool, you can save a data frame or tibble inan RDS file, or use comma or tab-delimited files, with .csv, .tsv or.txt extensions. The clustering method supported by the tool works onlyon numeric values; columns containing text will be set aside to annotatethe heatmap if so desired. If a column namedID exists, it will beused as an internal identifier for rows.
Clustering requires complete datasets with no missing values, NULLs orNAs. If any column contains missing values, it will be set aside to beused as a heatmap annotation. Badly formatted data will also lead tounexpected results in the tool. As an alternative, imputation packagescan be used to fill missing data and faulty rows (e.g. text in numericcolumns) should be removed before loading the file into the tool. Thetool provides limited abilities to help with diagnosing issues andpreprocessing data.
To run the app once the package is installed:
library(visxhclust)# Increases max file size to 30 MBoptions(shiny.maxRequestSize=30*1024^2)run_app()
About
A Shiny app and functions for visual exploration of hierarchical clustering.
Topics
Resources
License
Contributing
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Uh oh!
There was an error while loading.Please reload this page.
Contributors2
Uh oh!
There was an error while loading.Please reload this page.