Movatterモバイル変換

[0]ホーム

metasnf 2.1.2

Wrap additional examples indonttest

metasnf 2.1.1

Remove excessively sized mock_sim_mats_list.rda

metasnf 2.1.0

Improvements

calc_nmis now supports parallel processing, progress reportedthrough progressr
batch_snf_subsamples re-written to parallelize along subsamplesrather than cluster solutions, now uses progressr for progress insteadof verbose cat statements
speed up parallelization test

New data

New mock data objects in the format ofmock_(class name), e.g.,mock_data_list andmock_ext_solutions_df

New functions

Add several new S3 methods for plot, rbind, str, summary, t, c,extraction, merge, assignment, and type-coercion

Bug fixes

auto_plot output data frame doesn’t duplicate clustercolumn
error catching: data list sub-item name checking improvement
double transposingext_solutions_df no longer losessim_mats_list attribute

Other

Typo fixes
Code formatting
Computationally intensive examples are now wrapped indonttest rather than commented out
observations(),summary_features(),features(),uids() marked as internal

metasnf 2.0.6

Bug fixes

fixedrbind for classessolutions_df andext_solutions_df not preserving the class type of thecontainedweights_matrix

Print formatting

printingsolutions_df orext_solutions_dfrestricts output to 10 line max by default

Other

update for CRAN resubmission

metasnf 2.0.5

Bug fixes

calc_aris (as of v2, v1 is still fine) incorrectly excluded thefirst observation from ARI calculations.
merge.data_list wasn’t properly integrating updated parameternames
prevent solutions_df and ext_solutions_df from having 0 rows
usesolution column inmc_manhattan_plot()when extended solutions data frame has no MC labels

Code formatting

print.solutions_df title was set as print method forweights matrix
replace dl_1/dl_2 with x&y for consistency inmerge.data_list()

New functions

addedas.list() fordist_fns_list,clust_fns_list, anddata_list objects

Performance improvements

convert weights matrix to a regular matrix prior to printing reducesprint time
same as last commit
weights matrix rbinding is faster when treated as a matrix

Print formatting

deprecated message ongenerate_settings_matrix neededpaste0
solutions data frame printing above 10 rows will default to 10rows
print.solutions_df() misprinted the number ofobservations in the solutions data frame

metasnf 2.0.4

OOP

merge_dls() is superseded bymerge.data_lists()

Bug fixes

ext_solutions_df manipulation won’t dropsummary_features andfeatures attributes
estimate_nclust_given_graph has more resiliency tofloating point errors through tryCatch statement during eigengap qualityassignment

metasnf 2.0.3

bugfix:estimate_nclust_given_graph has more resiliencyto floating point errors through tryCatch loop updating eigenvaluescaling
added functions: addeddplyr_row_slice() functions forclassessolutions_df andext_solutions_df

metasnf 2.0.2

Formatting

removed debugging dash lines fromextend_solutions()

metasnf 2.0.1

Bug fixes

extend_solutions was not assigning feature typesproperly during p-value calculations
rbind.ext_solutions_df now takes...parameter beforereset_indices parameter to avoid errorduring calls with unnamed parameters.
rbind.solutions_df now takes... parameterbeforereset_indices parameter to avoid error during callwithout named parameters.
slicingsnf_config object made weights matrix lose itsclass

metasnf 2.0.0

Breaking changes

Extensive changes as a result of a transition to making use of R’sS3 OOP system.

Name changes and new classes

data list (classlist) -> (classdata_list,list)
solutions matrix (classdata.frame) -> solutionsdata frame (classsolutions_df,data.frame)
extended solutions matrix (classdata.frame) ->extended solutions data frame (classext_solutions_df,data.frame)
settings matrix -> settings data frame (classdata.frame) -> (classext_solutions_df,data.frame)
distance metrics list (classlist) -> distancefunctions list (classdist_fns_list,list)
clustering algorithms list (classlist) ->clustering functions list (classclust_fns_list,list)
weights matrix (classmatrix,array) ->(classweights_matrix,matrix,array)

Function changes

generate_data_list() ->data_list()
Functions related to converting a solutions matrix into a data frameof cluster solutions (get_cluster_df(),get_clusters(),get_cluster_solutions()) nowall superseded by custom transposition ofsolutions_dfclass objects (i.e., simply callt())

Workflow changes

Functionality offered by the settings matrix, distance metrics list,clustering algorithms list, weights matrix, and corresponding functions(generate_settings_matrix(),generate_distance_metrics_list(),generate_weights_matrix(),generate_clust_algs_list()) now all superseded by singlefunctionsnf_config() and thesnf_config classobject it produces
Following derivation of asplit_vector, either byadjusted_rand_index_heatmap() orshiny_annotator(),solutions_df andext_solutions_df class objects can be annotated with theirmeta cluster labels using the functionlabel_meta_clusters(). This is necessary prior to usage ofget_representative_solutions().
Functions that convert non-data frame objects, like a data list, toa data frame, have been replaced withas.data.frame()
Requesting similarity matrices are returned duringbatch_snf no longer changes the output structure from asolutions data frame to a list of a solutions data frame and asimilarity matrix list. Instead, the similarity matrix list is added tothe solutions data frame as an attribute and can be extracted using thefunctionsim_mats_list().

Improvements

Significant speed improvement tocalculate_coclustering() function
The p-value heatmap now follows a uni-color palette.
Customizedprint() functions have been defined for allmajor metasnf objects.
Examples have been added to all major metasnf functions.

metasnf 1.1.2

update settings matrix vignette to avoid convergence error on someseeds

metasnf 1.1.1

inclusion column bugfixes from 1.1.0

metasnf 1.1.0

Verbose parameter added to printing functions. By default set toFALSE.
CRAN compliant@return values in documentation.

metasnf 1.0.0

Last update before CRAN submission.

Breaking changes

Changing seed during settings matrix generation has been deprecated.Please manually callset.seed prior togenerate_settings_matrix instead.

Other

Package size reduced by downscaling vignette images

metasnf 0.7.2

Bug fix

Functionestimate_nclust_given_graph() occasionallyyielded incorrect number of cluster estimates as a result of improperscaling in metasnf v0.7.0. The scaling should be corrected now.

Breaking changes

Considerable changes have been made to the co-clustering workflow,including new heatmap and density plot.

metasnf 0.7.1

Possible breaking changes

Occasionally, spectral clustering results may yield an n-clustersolution where n differed from the number of clusters requested as aparameter in the spectral clustering function itself. Now, the spectralclustering functions provided in metasnf have been updated to report theactual number of clusters in the generated solution, rather than thenumber of clusters that was requested

metasnf 0.7.0

Minor changes

warnings provided when generating a data list with duplicate featurenames
warnings provided when usingmc_manhattan_plot() with adata list containing duplicate feature names
mc_manhattan_plot() parameterrep_solutionreplaced with more accurate nameextended_solutions_matrix(solutions matrix with _pval columns)

Bug fix

SNFtool::estimateNumberOfClustersGivenGraph() couldoccasionally error out on the basis of calculating eigenvectors(eigengap heuristic) for a Laplacian with floating point values thatwere too small. Adapted functionestimate_nclust_given_graph() slightly scales up Laplacianto reduce the risk of encountering this error (presumably without anychange to resulting cluster number estimate)

metasnf 0.6.8

New functionality

get_matrix_order has arguments allowing users tocontrol which distance metric and agglomerative hierarchical clusteringmethods are used to sort matrices

metasnf 0.6.7

Minor changes

More consistent usage of “feature” over “variable” acrossdocumentation.
New mock ABCD dataframes - like the old ones, but without the“abcd_” prefix and with a more accurate “unique_id” UID column ratherthan “patient”

metasnf 0.6.6

New functionality

get_complete_uids quickly pulls UIDs of observationswith complete data from a list of dataframes

metasnf 0.6.5

Bug fix

extend_solutions doesn’t crash on multi-feature targetlists

metasnf 0.6.4

Minor changes

Warning message provided when subjects are dropped duringgenerate_data_list()
Newremove_missing parameter forgenerate_data_list allowing subjects with incomplete datato remain in the data list

metasnf 0.6.3

Bug fixes

ensure cluster variable is treated as factor duringautoplotting
bugfix on autoplots built from tibbles rather than dataframes

Improvements

Added clarity tolp_solutions_matrix error message whentraining set is not subset of full data list
generate_data_list list elements now are named aftertheir components
added heatmap parameters to increase plotting flexibility

New functionality

added generic save_plot function and option to pass cluster_dfdirectly into auto_plot (useful for label propagation)
addmerge_data_lists functionality to horizontallymerge data lists

metasnf 0.6.2

Bug fixes

extend_solutions() will no longer crash when adata_list has the UID column in non-first position.
generate_data_list() enforces the UID column to be infirst position of each dataframe.

metasnf 0.6.1

New functionality

auto_plot() will automatically generate bar and/orjitter plots showing how features in a data_list/target_list aredistributed across a single cluster solution

metasnf 0.6.0

New functionality

shiny_annotator() function can be used to identifyindices of meta clusters within anadjusted_rand_index_heatmap
adjusted_rand_index_heatmap() now has asplit_vector parameter that will slice a heatmap into metaclusters
rename_dl() can be used to rename features in adata_list
manhattan_plot has been split intovar_manhattan_plot (key variable - all variables),esm_manhattan_plot (cluster solutions in an extendedsolutions matrix to all variables), andmc_manhattan_plot(likeesm_manhattan_plot, but at the meta-clusterlevel)
get_representative_solutions extracts max-ARI solutionsfrom an extended solutions matrix based on asplit_vectorcontaining meta cluster boundaries
batch_nmi calculates NMI scores (seehttps://branchlab.github.io/metasnf/articles/nmi_scores.html)
extend_solutions will only calculate p-value summarymeasures (min/max/mean) for data_list passed in as atarget_list parameter, but will also accept and calculatep-values for a data_list passed in through thedata_listparameter
heatmap functionadjusted_rand_index_heatmap andassoc_pval_heatmap have updated parameters to improve easeof use and flexibility (including easier colour control)

Deprecated functions

get_clustered_subs has been removed (does the samething asget_cluster_df)
get_cluster_pval deprecated forcalc_assoc_pval
All functions related to target_lists specifically have beendeprecated in favour of simply usinggenerate_data_list()and its corresponding functions

Name changes

remove_signal has been renamed tolinear_adjust to better reflect its function
summarize_distance_metrics_list has been shortened tosummarize_dml
correlation_pval_heatmap has been renamed toassoc_pval_heatmap
calc_om_aris has been renamed tocalc_aris

New vignettes

NMI scores:https://branchlab.github.io/metasnf/articles/nmi_scores.html
Imputations:https://branchlab.github.io/metasnf/articles/imputations.html

Other changes

Vignettes have been updated
Warnings are raised if spectral clustering does not generate acluster solution matching the number of clusters requested
Chi-squared andextend_solutions p-value calculationwarnings are now suppressed

metasnf 0.5.0

Breaking changes

All variables and values referencing p-values have been rephrased toend in_pval instead of a mix ofp_val,pval, andp.
Removal of deprecated functionspval_select,p_val_select,top_oms_per_cluster,check_subj_orders_for_lp,get_p,chi_sq_pval,
Functionpval_summaries, which would calculatemin/max/mean p-values, has been replaced withsummarize_pvals
train_test_assign now provides results as named list ofsubject vectors instead of a data.frame.keep_splitfunction has been removed accordingly.

Other changes

sort_subjects parameter added togenerate_data_list to allow for sorting of subjects in thedata_list

metasnf 0.4.6

fix bug in extend_solutions that incorrectly assigns p-values tovariable columns through grep (substring instead of exact match)

metasnf 0.4.5

extend_solutions can now also be parallelized (see?extend_solutions)
remove_signal function hassig_digsparameter that can be used to restrict how many significant figures arereturned in the resulting residuals

metasnf 0.4.4

calc_om_aris is now MUCH faster after removingexcessive calls toas.numeric and enabling parallelprocessing withfuture.apply. Thanks for the idea,Alper.

metasnf 0.4.3

Reformatting ofextend_solutions to better handleextreme p-values (e.g. infinity)
Replacement ofp_val_select withpval_select which can also return negative-logp-values

metasnf 0.4.2

Bug fixes

generate_data_list correctly errors when components areonly partially named (resolveshttps://github.com/BRANCHlab/metasnf/issues/10)

metasnf 0.4.1

Breaking changes

lp_row function has been replaced bylp_solutions_matrix. The new function is order agnostic:full data lists can be constructed without any restriction on howtraining and testing set subjects are sorted. Subjects present in theprovided solutions matrix to propagate are assumed to be the trainingsubjects.

New functionality

calc_om_aris now hasprogress parameter.When set to true and used in conjunction withprogressr::with_progress(), a progress bar is shown for thecalculations. Learn more with?calc_om_aris.

Bug fixes

grepl instead ofgrep used inextend_solutions to reduce errors when no chi-squaredwarning occurs

Other changes

A vignette specifically for label propagation has been added
Full removal of several previously deprecated functions
Minor source code reformatting

metasnf 0.4.0

New functionality

Parallel processing is now available! Check out the vignette here:https://branchlab.github.io/metasnf/articles/parallel_processing.html

metasnf 0.3.3

Breaking changes

input_wt and domain_wt are removed from settings_matrix and rest ofpackage - weighting at this level is no longer planned. This will resultin altered settings matrices, but only superficially - the columns“input_wt” and “domain_wt” will be missing, but had no effect on the SNFprior to this patch anyway.

metasnf 0.3.2

keep_split will preserve observations who were assigneda split but were not present in the dataframe being split. Instead ofbeing removed, those observations will have NA values.

metasnf 0.3.1

Bug fixes

fixedfraction_clustered_together crashing when acluster was assigned to only a single observation
fixedfraction_clustered_together not running due tobracket typo when evaluating length of the data_list

New functionality

correlation_pval_heatmap function can have significancestars disabled withsignificance_stars parameter

Other changes

pkgdown site now has google site verification code

metasnf 0.3.0

Breaking changes

The original SNFtool functionestimateNumberOfClustersGivenGraph has been used up to thispoint without specifying a parameter forNUMC.Consequently, final similarity matrices clustered with the defaultmethods (spectral clustering based on eigen-gap or rotation costheuristics) were not capable of resulting in more than 5 clusters. Thedefault functions have been updated to span 2 clusters to 10 clusters.Users will likely see different clustering results as a result of thischange. To replicate the behaviour of default spectral clustering priorto v0.3.0, users should copy the following code prior to the batch_snfcommand:

clust_algs_list <- generate_clust_algs_list(    "spectral_eigen" = spectral_eigen_classic,    "spectral_rot" = spectral_rot_classic)# Adapt below as necessarysolutions_matrix <- batch_snf(    data_list,    settings_matrix,    clust_algs_list = clust_algs_list)

Added “workspace=2e7” parameter tofisher_exact_pvalfunction to avoid “FEXACT” error (like herehttps://github.com/Lagkouvardos/Rhea/issues/17). Impact on results isexpected to be negligible.

New functionality

Functionremove_signal() enables correcting a data_listlinearly for confounders / unwanted signal. Vignette is available:https://branchlab.github.io/metasnf/articles/confounders.html.
batch_snf() has new parameterautomatic_standard_normalize to switch out the defaultnumeric distance measures (euclidean) with standard normalizedvariants.

Other changes

Added aNEWS.md file to track changes to thepackage.

[8]ページ先頭