Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

The multivariate MArginal ePIstasis Test

License

NotificationsYou must be signed in to change notification settings

lcrawlab/mvMAPIT

Repository files navigation

R CMD checkDocker Image CICRAN downloadsCRAN_Status_Badge

Find the full package documentation including examples and articles here:Multivariate MAPIT Documentation.

The multivariate MArginal ePIstasis Test (mvMAPIT)

This R package is a generalization of theMAPITimplementation by Crawford etal. (2017)1 for any number of traits as described by Stamp et al. (2023)2.The univariate MAPIT test for marginal epistasis is implemented as the specialcase of running multivariate MAPIT with a single trait.

mvMAPIT is implemented as a set of R and C++ routines, which can becarried out within an R environment.

Introduction

Epistasis, commonly defined as the interaction between genetic loci, is known toplay an important role in the phenotypic variation of complex traits. As aresult, many statistical methods have been developed to identify genetic variantsthat are involved in epistasis, and nearly all of these approaches carry outthis task by focusing on analyzing one trait at a time. However, because of thelarge combinatorial search space of interactions, most epistasis mappingmethods face enormous computational challenges and often suffer from lowstatistical power.

Previous studies have shown that jointly modeling multiple phenotypes can oftendramatically increase statistical power for association mapping. Therefore, herewe present themultivariate MArginal ePIstasis Test (mvMAPIT) – amulti-outcome generalization of a recently proposed epistatic detection methodwhich seeks to detectmarginal epistasis or the combined pairwise interactioneffects between a given variant and all other variants. By searching for marginalepistatic effects, one can identify genetic variants that are involved inepistasis without the need to identify the exact partners with which the variantsinteract – thus, potentially alleviating much of the statistical and computationalburden associated with conventional explicit search based methods. Our proposedmvMAPIT builds upon this strategy by leveraging correlation structures betweentraits to improve the identification of variants involved in epistasis. Weformulate mvMAPIT as a multivariate linear mixed model and develop a multi-traitvariance component estimation algorithm for efficient parameter inference andP-value computation. Together with reasonable model approximations, our proposedapproach is scalable to moderately sized GWA studies.

The Method

Themultivariate MArginal ePIstasis Test is a multi-outcome extension of thestatistical framework MAPIT which aims to identify variants that are involved inepistatic interactions by leveraging the correlation structure of non-additivegenetic variation that is shared between multiple traits. The key idea behind theconcept of marginal epistasis is to identify variants that are involved inepistasis while avoiding the need to explicitly conduct an exhaustive search overall possible pairwise interactions. As an overview of mvMAPIT and itscorresponding software implementation, we will assume that we have access to aGWA study onN individuals denoted asD = {X,Y} whereX is anN x J matrixof genotypes withJ denoting the number of SNPs (each of which is encoded as{0,1,2} copies of a reference allele at each locusj) andY denoting aN x Dmatrix holdingD different traits that are measured for each of theNindividuals.

The goal of mvMAPIT is to identify variants that have non-zero interaction effectswith any other variant in the data. To accomplish this, we examine each SNP inturn and assess the null hypothesis that its corresponding variance component is zero. Inpractice, we use a computationally efficient method of moments algorithm called MQS from Zhou (2017)3to estimate model parameters and to carry out calibrated statistical tests withinmvMAPIT.

Installation

The package needs compilation but the released version can be installed fromCRAN.

install.packages("mvMAPIT")

The R Environment

R is a widely used, free, and open source software environment forstatistical computing and graphics. The most recent version of R can bedownloaded from theComprehensive R Archive Network(CRAN). CRAN provides precompiled binaryversions of R for Windows, macOS, and select Linux distributions thatare likely sufficient for many users' needs. Users can also install Rfrom source code; however, this may require a significant amount ofeffort. For specific details on how to compile, install, and manage Rand R-packages, refer to the manualR Installation andAdministration.

R Packages Required for mvMAPIT

mvMAPIT requires the installation of the following R libraries:

The easiest method to install these packages is with the followingexample command entered in an R shell:

install.packages(c( 'checkmate',                     'CompQuadForm',                     'dplyr',                     'foreach',                     'harmonicmeanp',                     'logging',                     'mvtnorm',                     'Rcpp',                     'RcppAlgos',                     'RcppArmadillo',                     'RcppParallel',                     'RcppProgress',                     'RcppSpdlog',                     'testthat',                     'tidyr',                     'truncnorm'),                     dependencies = TRUE);

Alternatively, one can alsoinstall R packages from thecommand-line.

Installing mvMAPIT from Sources

The easiest way to install the package from sources is to change intothe directory of mvMAPIT and runR CMD INSTALL . --preclean. The--preclean flag makes sure that the latest state is run.

C++ Functions Required for MAPIT

The code in this repository assumes that basic Fortran and C++ libraries and compilers are already set up on the running personal computer orcluster. If not, the mvMAPIT functions and necessary Rcpp packages willnot work properly. A simple option is to usegcc. macOS users may use this collection byinstalling theHomebrew package manager andthen typing the following into the terminal:

brew install gcc

OpenMP

Note that mvMAPIT takes advantage ofOpenMP, anAPI for multi-platform shared-memory parallel programming in C/C++. Thisis to speed up the computational time of the modeling algorithm.Unfortunately, macOS does not currently support OpenMP under the defaultcompiler. A work around to use OpenMP in R on macOS can be foundhere.mvMAPIT can be compiled without OpenMP, but we recommend using it ifapplicable for scalability.

Known Issues


Questions and Feedback

For questions or concerns with the MAPIT functions, please contactLorin Crawford orJulian Stamp.

We appreciate any feedback you may have with our repository and instructions.

References

Footnotes

  1. L. Crawford, P. Zeng, S. Mukherjee, X. Zhou (2017). Detectingepistasis with the marginal epistasis test in genetic mappingstudies of quantitative traits.PLoS Genet.13(7): e1006869.https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1006869

  2. J. Stamp, A. DenAdel, D. Weinreich, L. Crawford (2023). Leveraging theGenetic Correlation between Traits Improves the Detection of Epistasis inGenome-wide Association Studies.G3 Genes|Genomes|Genetics,13(8), jkad118. doi:https://doi.org/10.1093/g3journal/jkad118

  3. X. Zhou (2017). A unified framework for variance component estimation with summary statisticsin genome-wide association studies.Ann Appl Stat.11(4): 2027-2051.https://projecteuclid.org/journals/annals-of-applied-statistics/volume-11/issue-4/A-unified-framework-for-variance-component-estimation-with-summary-statistics/10.1214/17-AOAS1052.full


[8]ページ先頭

©2009-2025 Movatter.jp