Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Fast metagenome analysis toolkit

License

NotificationsYou must be signed in to change notification settings

ctlab/metafast

Repository files navigation

Build

MetaFast

MetaFast

MetaFast (METAgenome FAST analysis toolkit) is a toolkit for calculating a number of statistics ofmetagenome sequences and building the distance matrix between them. It also provides a functionalityto extract features from metagenomic samples.

Authors:

  • Software:Artem Ivanov,Sergey Kazakov andVladimir Ulyantsev,
    ITMO University, Saint-Petersburg, Russia.
  • Testing:Veronika Dubinkina andAlexandr Tyakht,
    SRI of Physical-Chemical Medicine, Moscow, Russia.
  • Idea, supervisor:Dmitry Alexeev,
    SRI of Physical-Chemical Medicine, Moscow, Russia.

MetaFast documentation is available on the GitHubwiki page.
Here is a short version of it.

Content

Installation

To run MetaFast you need to have JRE 1.6 or higher installed and only one script –metafast.sh,metafast.bat ormetafast.jar.

You can download it from the last stable release in the GitHub'Releases' section.

  • ForLinux andMac OS: downloadmetafast.sh, run the commandchmod a+x metafast.sh, then run./metafast.sh from the command line.
  • ForWindows: downloadmetafast.bat and run it from the command line.
  • For other OS: downloadmetafast.jar and run it via commandjava -jar metafast.jar.

Alternatively, you can build the newest version of the MetaFast from the repository:

git clone https://github.com/ctlab/metafast.gitcd metafastant./out/metafast.sh --version

MetaFast 1.3

A new version of MetaFast software is being prepared for the release. New pipelines for comparative metagenomics data analysis have been implemented. Four recommended use cases (including the original one) and a detailed description of available tools are presented inPipelines.md

Running instructions

To run MetaFast use the following syntax:

  • metafast.sh [<Launch options>] [<Input parameters>]
  • metafast.bat [<Launch options>] [<Input parameters>]
  • java -jar metafast.jar [<Launch options>] [<Input parameters>]

To view help for launch options and input parameters runmetafast.sh --help ormetafast.sh --help-all.

By running MetaFast a working directory is created (by default./workDir/).All intermidiate files, log file and final results are saved in it.

Fileoutput_description.txt is created after every run in the current and working directories.It contains the description of every output file produced by the MetaFast.

Metafast run script also allows you to run subtools of whole process or different tools, that was included into the package.To see the list of available additional tools, runmetafast.sh --tools.

Examples

Generatedin-silico dataset

Downloadmeta_test_1.fa,meta_test_2.fa andmeta_test_3.fa and run the command:

./metafast.sh -i meta_test_1.fa meta_test_2.fa meta_test_3.fa

After it has finished, a distance matrix can be found inworkDir/matrices/dist_matrix_<date>_<time>_original_order.txt:

#       meta_test_1     meta_test_2     meta_test_3meta_test_1     0.0000  0.5691  0.2981meta_test_2     0.5691  0.0000  0.8448meta_test_3     0.2981  0.8448  0.0000

The elementmatrix[i][j] is a distance betweensample i andsample j.

K-mers frequency statistics is saved inworkDir/kmer-counter-many/stats/<in-file>.stat.txt;
image file with heatmap and dendrogram is saved inworkDir/matrices/dist_matrix_<date>_<time>_heatmap.png:
Test heatmap

Mock community example

For testing on realistic metagenomic communities, the CAMI Challenge dataset was used. The detailed example description is available in fileExample.md.

FAQ

Q Does MetaFast works with paired-end reads?

A Yes, MetaFast can process paired-end reads. For correct detection, files should be named with suffixes "_R1"&"_R2" or "_r1"&"_r2" after sample name before extension. For example, sample_r1.fastq & sample_r2.fastq, or reads_R1.fq.gz & reads_R2.fq.gz

Q Can I compare samples with different read lengths or the number of reads?

A Yes, you can do this without any specific preprocessing. MetaFast uses k-mers for comparing metagenomic sequences, so it will automatically normalize all values by the total number of k-mers per sample.

Q Can I compare metagenomes obtained from different sequencing platforms (i.e. Illumina vs 454-seq)

A Yes, you can. MetaFast extract features from each metagenome independently, so you can compare samples from different sequencing platforms.

Citation

If you use MetaFast in your research, please cite the following publication:

Ulyantsev V.I., Kazakov S.V., Dubinkina V.B., Tyakht A.V. & Alexeev D.G. (2016).MetaFast: fast reference-free graph-based comparison of shotgun metagenomic data.Bioinformatics, 32(18), 2760-2767.doi: 10.1093/bioinformatics/btw312

Contact

Please report any problems directly to the GitHubissue tracker.
Also, you can send your feedback toabivanov@itmo.ru.

License

The MIT License (MIT)

Publications using MetaFast

There are several papers about bioinformatics projects, which used various MetaFast pipelines for data analysis:

  • Analysis of human gut microbiota of patients with Crohn's disease, ulcerative colitis and healthy controls
    Khachatryan, L., Xiang, Y., Ivanov, A., Glaab, E., Graham, G., Granata, I., ... & Poussin, C. (2023). Results and lessons learned from the sbv IMPROVER metagenomics diagnostics for inflammatory bowel disease challenge. Scientific Reports, 13(1),doi: 10.1038/s41598-023-33050-0
  • Analysis of human gut microbiota of patients undergoing melanoma immunotherapy
    Olekhnovich, E. I., Ivanov, A. B., Babkina, A. A., Sokolov, A. A., Ulyantsev, V. I., Fedorov, D. E., & Ilina, E. N. (2023). Consistent Stool Metagenomic Biomarkers Associated with the Response To Melanoma Immunotherapy. Msystems, 8(2), e01023-22.doi: 10.1128/msystems.01023-22
  • Analysis of gut microbiota time-series samples from patients undergoing microbiome transplantationOlekhnovich, E. I., Ivanov, A. B., Ulyantsev, V. I., & Ilina, E. N. (2021). Separation of donor and recipient microbial diversity allows determination of taxonomic and functional features of gut microbiota restructuring following fecal transplantation. Msystems, 6(4), e00811-21.doi: 10.1128/msystems.00811-21

See also

  • MetaCherchant – a tool for analysing genomic environment within a metagenome.
  • RECAST – a tool for sorting reads per their origin in metagenomic time series.
  • khmer – a toolkit to split reads.
  • crAss – Cross-Assembly of Metagenomes.
  • MaryGold – Variation analysis of metagenomic samples.

About

Fast metagenome analysis toolkit

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors9

Languages


[8]ページ先頭

©2009-2025 Movatter.jp