Movatterモバイル変換


[0]ホーム

URL:


Publisher LogoGenome Research

Skip to main page content

Advanced Search
  • ONT_LB_GRRNA_RealTime

An efficient and scalable analysis framework for variant extraction and refinement from population-scale DNA sequence data

  1. Hyun Min Kang2
  1. 1Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas 77030, USA;
  2. 2Center for Statistical Genetics and Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, USA
  1. Corresponding author:hmkang{at}umich.edu

Abstract

The analysis of next-generation sequencing data is computationally and statistically challenging because of the massive volume of data and imperfect data quality. We present GotCloud, a pipeline for efficiently detecting and genotyping high-quality variants from large-scale sequencing data. GotCloud automates sequence alignment, sample-level quality control, variant calling, filtering of likely artifacts using machine-learning techniques, and genotype refinement using haplotype information. The pipeline can process thousands of samples in parallel and requires less computational resources than current alternatives. Experiments with whole-genome and exome-targeted sequence data generated by the 1000 Genomes Project show that the pipeline provides effective filtering against false positive variants and high power to detect true variants. Our pipeline has already contributed to variant detection and genotyping in several large-scale sequencing projects, including the 1000 Genomes Project and the NHLBI Exome Sequencing Project. We hope it will now prove useful to many medical sequencing studies.

Footnotes

  • ReceivedMarch 31, 2014.
  • AcceptedApril 13, 2015.

This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (seehttp://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described athttp://creativecommons.org/licenses/by-nc/4.0/.

  • Add to CiteULikeCiteULike
  • Add to DeliciousDelicious
  • Add to DiggDigg
  • Add to FacebookFacebook
  • Add to Google+Google+
  • Add to RedditReddit
  • Add to TwitterTwitter

What's this?

« Previous |Next Article »Table of Contents

This Article

  1. »AbstractFree
  2. Full TextFree
  3. Full Text (PDF)Free
  4. Supplemental Material
  5. All Versions of this Article:
    1. gr.176552.114v1
    2. gr.176552.114v2
    3. 25/6/918most recent

Article Category

Services

  1. Alert me when this article is cited
  2. Alert me if a correction is posted
  3. Similar articles in this journal
  4. Similar articles in Web of Science
  5. Article Metrics
  6. Similar articles in PubMed
  7. Download to citation manager
  8. Permissions

Citing Articles

  1. Load citing article information
  2. Citing articles via Web of Science
  3. Citing articles via Google Scholar

Google Scholar

  1. Articles by Jun, G.
  2. Articles by Kang, H. M.
  3. Search for related content

PubMed/NCBI

  1. PubMed citation
  2. Articles by Jun, G.
  3. Articles by Kang, H. M.

Share

    • Add to CiteULikeCiteULike
    • Add to DeliciousDelicious
    • Add to DiggDigg
    • Add to FacebookFacebook
    • Add to Google+Google+
    • Add to RedditReddit
    • Add to TwitterTwitter

    What's this?

Preprint Server



Current Issue

  1. February 2026, 36 (2)
  1. Current Issue
  1. Alert me to new issues of Genome Research

Copyright © 2026 by Cold Spring Harbor Laboratory Press

  • Print ISSN:1088-9051
  • Online ISSN:1549-5469

[8]ページ先頭

©2009-2026 Movatter.jp