Movatterモバイル変換

[0]ホーム

bigstatsr 1.6.2

Check that parameters ind.row/ind.col are notNULL.

bigstatsr 1.6.1

Remove {dplyr} dependency for internal functionany_near0().

bigstatsr 1.6.0

Fix conversion fromNA_real to FBM type integer on newMacs.

bigstatsr 1.5.14

Error when variables with a zero scaling are used ine.g. big_randomSVD() andbig_crossprodSelf()(#52).

bigstatsr 1.5.13

Add parameterbackingfile tobig_crossprodSelf() andbig_cor() (#170).

bigstatsr 1.5.11

Make sure not to use two levels of parallelism inbig_univLogReg() (#137).

bigstatsr 1.5.10

Check out-of-boundsind.col inbig_prodMat() (#154).

bigstatsr 1.5.9

Add global optionFBM.dir (that defaults totempdir() as before). This can be used to change thedefault directory used to create FBMs when calling eitherFBM(),FBM.code256(),as_FBM(),big_copy(), orbig_transpose(). Note that, ifnot using the temporary directory anymore, you must clean up the filesyou do not want to keep.

bigstatsr 1.5.8

EnableARMA_64BIT_WORD.

bigstatsr 1.5.7

New strategy for$add_columns().

bigstatsr 1.5.6

Add convenience functionas_scaling_fun() to createyour ownfun.scaling parameters.

bigstatsr 1.5.4

Now automatically discard covariates with no variation inpcor() (with a warning).

bigstatsr 1.5.3

pcor() now returns NAs (instead of 0s) for singularsystems.

bigstatsr 1.5.0

Recode some parallel algorithms with OpenMP. For now, functionsbig_prodVec(),big_cprodVec(),big_colstats() andbig_univLinReg() have beenrecoded.

bigstatsr 1.4.0

Now detects and errors if there is not enough disk space to createan FBM.

bigstatsr 1.3.3

Fixpcor() for singular systems, e.g. whenx has all the same values.

bigstatsr 1.3.2

Fixsummary() andplot() for old (<v1.3)big_sp_list models.

bigstatsr 1.3.1

Add functionpcor() to compute partialcorrelations.

bigstatsr 1.3.0

Add two options inbig_spLinReg() andbig_spLogReg();power_scale for using adifferent scaling for LASSO andpower_adaptive for usingadaptive LASSO (where larger marginal effects are penalized less). Seedocumentation for details.
big_(c)prodVec() andbig_(c)prodMat()(re)gain ancores parameter. Note that forbig_(c)prodMat(), it might be beneficial to use the BLASparallelism (withbigparallelr::set_blas_ncores()) insteadof this parameter, especially when the matrixA islarge-ish.

bigstatsr 1.2.2

Functionbig_colstats() can now be run in parallel(added parameterncores).

bigstatsr 1.2.1

It is now possible to use C++ FBM accessors without linking to{RcppArmadillo}.

bigstatsr 1.2.0

Functionsbig_(c)prodMat() andbig_(t)crossprodSelf() now use much less memory, and may befaster.
Addcovar_from_df() to convert a data frame withfactors/characters to a numeric matrix using one-hot encoding.

bigstatsr 1.1.4

Remove some ‘Suggests’ dependencies.

bigstatsr 1.1.3

Add a new column$all_conv to output ofsummary() forbig_spLinReg() andbig_spLogReg() to check whether all models have stoppedbecause of “no more improvement”. Also add a new parametersort tosummary().
Nowwarn (enabled by default) if some models may nothave reached a minimum when usingbig_spLinReg() andbig_spLogReg().

bigstatsr 1.1.1

FixIn .self$nrow * .self$ncol : NAs produced by integer overflow.

bigstatsr 1.1.0

Make two different memory-mappings: one that is read-only (using$address) and one where it is possible to write (using$address_rw). This enables to use file permissions toprevent modifying data.
Also add a new field$is_read_only to be used toprevent modifying data (at least with<-) even when youhave write permissions to it. Functions creating an FBM now gain aparameteris_read_only.
Make vector accessors (e.g. X[1:10])faster.

bigstatsr 1.0.0

Move some code to new packages {bigassertr} and{bigparallelr}.
big_randomSVD() gains arguments related tomatrix-vector multiplication.
assert_noNA() is faster.

bigstatsr 0.9.10

Addbig_increment().

bigstatsr 0.9.9

Inplot.big_SVD(),

Can now plot many PCA scores (more than two) at once.
Usecoord_fixed() when plotting PCA scores becauseit is good practice.
Use log-scale in scree plot to better see small differences insingular values.
Reexportcowplot::plot_grid() to merge multipleggplots.

bigstatsr 0.9.6

AUCBoot() is now 6-7 times faster.

bigstatsr 0.9.5

Add parameterscenter andscale toproducts.

bigstatsr 0.9.3

Fix a bug inbig_univLogReg() for variables with novariation. IRLS was not converging, soglm() was usedinstead. The problem is thatglm() drops dimensions causingsingularities so that Z-score of the first covariate (or intercept) wasused instead of a missing value.

bigstatsr 0.9.0

Usemio instead ofboost formemory-mapping.
Add a parameterbase.row topredict.big_sp_list() and automatically detect if needed(as well as forcovar.row).
Possibility to subset abig_sp_list without losingattributes, so that one can access one model (corresponding to onealpha) even if it is not the ‘best’.
Add parameterspf.X andpf.covar inbig_sp***Reg() to provide different penalization for eachvariable (possibly no penalization at all).

bigstatsr 0.8.4

Add%*%,crossprod andtcrossprod operations for ‘double’ FBMs.

bigstatsr 0.8.3

Now also returns the number of non-zero variables($nb_active) and the number of candidate variables($nb_candidate) for each step of the regularization pathsofbig_spLinReg() andbig_spLogReg().

bigstatsr 0.8.0

Parameterswarn andreturn.all ofbig_spLinReg() andbig_spLogReg() aredeprecated; now always return the maximum information. Now provide twomethods (summary andplot) to get a quickassessment of the fitted models.

bigstatsr 0.7.3

Check of missing values for input vectors (indices and targets)and matrices (covariables).
AUC() is now stricter: it accepts only 0s and 1s fortarget.

bigstatsr 0.7.1

$bm() and$bm.desc() have been added inorder to get anFBM as afilebacked.big.matrix. This enables using {bigmemory}functions.

bigstatsr 0.7.0

Typefloat added.

bigstatsr 0.6.2

big_write added.

bigstatsr 0.6.1

big_read now has afilter argument tofilter rows, and argumentnrow has been removed because itis now determined when reading the first block of data.
Removed thesave argument fromFBM (andothers); now, you must useFBM(...)$save() instead ofFBM(..., save = TRUE).

bigstatsr 0.6.0

You can now fill an FBM using a data frame. Note that factorswill be used as integers.
Package{bigreadr} has been developed and is now used bybig_read.

bigstatsr 0.5.0

There have been some changes regarding how conversion between typesis checked. Before, you would get a warning for any possible loss ofprecision (without actually checking it). Now, any loss of precision dueto conversion between types is reported as a warning, and only in thiscase. If you want to disable this feature, you can useoptions(bigstatsr.downcast.warning = FALSE), or you can usewithout_downcast_warning() to disable this warning for onecall.

bigstatsr 0.4.1

changebig_read so that it is faster (correspondingvignette updated).

bigstatsr 0.4.0

possibility to add a “base predictor” forbig_spLinReg andbig_spLogReg.
don’t store the whole regularization path (as a sparsematrix) inbig_spLinReg andbig_spLogReganymore because it caused major slowdowns.
directly average the K predictions inpredict.big_sp_best_list.
only use the “PSOCK” type of cluster because “FORK” can leavezombies behind. You can change this withoptions(bigstatsr.cluster.type = "PSOCK").

bigstatsr 0.3.4

Fix a bug inbig_spLinReg related to the computationof summaries.
Now provides functionplus to be used as thecombine argument inbig_apply andbig_parallelize instead of'+'.

bigstatsr 0.3.3

Before, this package used only the “PSOCK” type of cluster, whichhas some significant overhead. Now, it uses the “FORK” type onnon-Windows systems. You can change this withoptions(bigstatsr.cluster.type = "PSOCK"). Uses “PSOCK” in0.4.0.

bigstatsr 0.3.2

you can now provide multiple$\alpha$ values (as a numeric vector) inbig_spLinReg andbig_spLogReg. One will bechosen by grid-search.

bigstatsr 0.3.1

fixed a bug inbig_prodMat when using a dimension of 1or 0.

bigstatsr 0.3.0

Package {bigstatsr} is published inBioinformatics

bigstatsr 0.2.6

no scaling is used by default forbig_crossprod,big_tcrossprod,big_SVD andbig_randomSVD (before, there was no default at all)

bigstatsr 0.2.4

Integrate Cross-Model Selection and Averaging (CMSA)directly inbig_spLinReg andbig_spLogReg, aprocedure that automatically chooses the value of the$\lambda$hyper-parameter.
Speed upbig_spLinReg andbig_spLogReg (issue#12)

bigstatsr 0.2.3

Speed up AUC computations

bigstatsr 0.2.0

No longer use thebig.matrix format of packagebigmemory

[8]ページ先頭