biobakery/melonnpanPublic

NotificationsYou must be signed in to change notification settings
Fork8
Star37

Model-based Genomically Informed High-dimensional Predictor of Microbial Community Metabolic Profiles

License

MIT license

37 stars 8 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github		.github
R		R
data		data
man		man
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE.txt		LICENSE.txt
NAMESPACE		NAMESPACE
NEWS		NEWS
README.md		README.md
predict_metabolites.R		predict_metabolites.R
train_metabolites.R		train_metabolites.R

Repository files navigation

MelonnPan - Model-based Genomically Informed High-dimensional Predictor of Microbial Community Metabolic Profiles

Himel Mallick2023-02-14

Introduction

MelonnPan is a computational method for predicting metabolite compositions from microbiome sequencing data.

MelonnPan is composed of two high-level workflows:MelonnPan-Predict andMelonnPan-Train.

TheMelonnPan-Predict workflow takes a table of microbial sequence features (i.e., taxonomic or functional abundances on a per sample basis) as input, and outputs a predicted metabolomic table (i.e., relative abundances of metabolite compounds across samples).

TheMelonnPan-Train workflow creates an weight matrix that links an optimal set of sequence features to a subset of predictable metabolites following rigorous internal validation, which is then used to generate a table of predicted metabolite compounds (i.e., relative abundances of metabolite compounds per sample). When sufficiently accurate, these predicted metabolite relative abundances can be used for downstream statistical analysis and end-to-end biomarker discovery.

How to Install

There are two options for installingMelonnPan:

From R

In R, you can installMelonnPan using thedevtools package as follows (execute from within a fresh R session):

install.packages('devtools')# Install devtools if not installed alreadylibrary(devtools)# Load devtoolsdevtools::install_github("biobakery/melonnpan")# Install MelonnPan

From the command line

Clone the repository usinggit clone, which downloads the package as its own directory calledmelonnpan.

git clone https://github.com/biobakery/melonnpan.git

Then, install MelonnPan usingR CMD INSTALL.

R CMD INSTALL melonnpan

Usage

MelonnPan can be run from the command line or from within R. Both methods require the same arguments, have the same options, and use the same default settings. Check out theMelonnPan tutorial for an example application.

The defaultMelonnPan-Predict function can be run by executing the scriptpredict_metabolites.R from the command line or within R using the functionmelonnpan.predict(). Currently it uses apre-trained model from the human gut based on UniRef90 gene families (functionally profiled byHUMAnN2), as described in Franzosa et al. (2019) and the original MelonnPan paper (Mallick et al., 2019), which is included in the package and can also be downloaded from thedata/ sub-directory (melonnpan.trained.model.txt).
If you have paired metabolite and microbial sequencing data (possibly measured from the same biospecimen), you can also train a MelonnPan model by running the scripttrain_metabolites.R from the command line or within R using the functionmelonnpan.train().
MelonnPan currently requires input data that is specified using UniRef90 gene families (functionally profiled byHUMAnN2). If you do not have functionally profiled UniRef90 gene families from the human gut or other environments, you may need to first train a MelonnPan model using theMelonnPan-Train workflow and supply the resulting weights to theMelonnPan-Predict module to get the relevant predictions.

Input

MelonnPan-Predict workflow requires the following input:
- a table of microbial sequence features' relative abundances (samples in rows)
MelonnPan-Train workflow requires the following inputs:
- a table of metabolite relative abundances (samples in rows)
- a table of microbial sequence features' relative abundances (samples in rows)
For a complete description of the possible parameters for specificMelonnPan functions and their default values and output, run the help within R with the? operator.

Output

TheMelonnPan-Predict workflow outputs the following:
- MelonnPan_Predicted_Metabolites.txt: Predicted relative abundances of metabolites as determined byMelonnPan-Predict.
- MelonnPan_RTSI.txt: Table summarizing RTSI scores per sample.
Similarly, theMelonnPan-Train workflow outputs the following:
- MelonnPan_Training_Summary.txt: Significant compounds list with per-compound prediction accuracy (correlation coefficient) and the associated p-value and q-value.
- MelonnPan_Trained_Metabolites.txt: Predicted relative abundances of statisticially significant metabolites as determined byMelonnPan-Train.
- MelonnPan_Trained_Weights.txt: Table summarizing coefficient estimates (weights) per compound.

References

Zou H, Hastie T (2005). Regularization and Variable Selection via the Elastic Net.Journal of the Royal Statistical Society. Series B (Methodological) 67(2):301–320.

Franzosa EA et al. (2019).Gut microbiome structure and metabolic activity in inflammatory bowel disease.Nature Microbiology 4(2):293–305.

Citation

Mallick H, Franzosa EA, McIver LJ, Banerjee S, Sirota-Madi A, Kostic AD, Clish CB, Vlamakis H, Xavier R, Huttenhower C (2019).Predictive metabolomic profiling of microbial communities using amplicon or metagenomic sequences.Nature Communications 10(1):3136-3146.

About

Model-based Genomically Informed High-dimensional Predictor of Microbial Community Metabolic Profiles

huttenhower.sph.harvard.edu/melonnpan

Languages

R100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

MelonnPan - Model-based Genomically Informed High-dimensional Predictor of Microbial Community Metabolic Profiles

Introduction

How to Install

From R

From the command line

Usage

Input

Output

References

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors3

Uh oh!

Languages

Movatterモバイル変換

License

biobakery/melonnpan

Folders and files

Latest commit

History

Repository files navigation

MelonnPan - Model-based Genomically Informed High-dimensional Predictor of Microbial Community Metabolic Profiles

Introduction

How to Install

From R

From the command line

Usage

Input

Output

References

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors3

Uh oh!

Languages

Packages