Movatterモバイル変換


[0]ホーム

URL:


Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
Thehttps:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

NIH NLM Logo
Log inShow account info
Access keysNCBI HomepageMyNCBI HomepageMain ContentMain Navigation
pubmed logo
Advanced Clipboard
User Guide

Full text links

PeerJ, Inc. full text link PeerJ, Inc. Free PMC article
Full text links

Actions

Share

doi: 10.7717/peerj.431. eCollection 2014.

dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms

Affiliations

dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms

Jonathan B Puritz et al. PeerJ..

Abstract

Restriction-site associated DNA sequencing (RADseq) has become a powerful and useful approach for population genomics. Currently, no software exists that utilizes both paired-end reads from RADseq data to efficiently produce population-informative variant calls, especially for non-model organisms with large effective population sizes and high levels of genetic polymorphism. dDocent is an analysis pipeline with a user-friendly, command-line interface designed to process individually barcoded RADseq data (with double cut sites) into informative SNPs/Indels for population-level analyses. The pipeline, written in BASH, uses data reduction techniques and other stand-alone software packages to perform quality trimming and adapter removal, de novo assembly of RAD loci, read mapping, SNP and Indel calling, and baseline data filtering. Double-digest RAD data from population pairings of three different marine fishes were used to compare dDocent with Stacks, the first generally available, widely used pipeline for analysis of RADseq data. dDocent consistently identified more SNPs shared across greater numbers of individuals and with higher levels of coverage. This is due to the fact that dDocent quality trims instead of filtering, incorporates both forward and reverse reads (including reads with INDEL polymorphisms) in assembly, mapping, and SNP calling. The pipeline and a comprehensive user guide can be found at http://dDocent.wordpress.com.

Keywords: Bioinformatics; Molecular ecology; Next-generation sequencing; Population genomics; RADseq.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Levels of coverage for each unique read in the red snapper data set.
The horizontal axis represents the minimal level of coverage, while the vertical axis represents the number of unique paired reads in thousands.
Figure 2
Figure 2. SNP results averaged across the three different run parameters fordDocent andStacks.
(A) Red snapper, (B) Red drum, (C) Silk snapper (see Methods or Table 1 for SNP categories description). Error bars represent one standard error.
See this image and copyright information in PMC

Similar articles

See all similar articles

Cited by

See all "Cited by" articles

References

    1. Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, Selker EU, Cresko WA, Johnson EA. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE. 2008;3:e431. doi: 10.1371/journal.pone.0003376. - DOI - PMC - PubMed
    1. Catchen JM, Amores A, Hohenlohe P, Cresko W, Postlethwait JH. Stacks: building and genotyping Loci de novo from short-read sequences. G3. 2011;1:171–182. doi: 10.1534/g3.111.000240. - DOI - PMC - PubMed
    1. Catchen J, Hohenlohe PA, Bassham S, Amores A, Cresko WA. Stacks: an analysis tool set for population genomics. Molecular Ecology. 2013;22:3124–3140. doi: 10.1111/mec.12354. - DOI - PMC - PubMed
    1. Chong Z, Ruan J, Wu C-I. Rainbow: an integrated tool for efficient clustering and assembling RAD-seq reads. Bioinformatics. 2012;28.21:2732–2737. doi: 10.1093/bioinformatics/bts482. - DOI - PubMed
    1. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R. The variant call format and VCFtools. Bioinformatics. 2011;27:2156–2158. doi: 10.1093/bioinformatics/btr330. - DOI - PMC - PubMed

Grants and funding

This work was supported by Award #NA10NMF4270199 from the Saltonstall-Kennedy Program of the National Marine Fisheries Service (Department of Commerce/National Oceanic and Atmospheric Administration), Award #NA12NMF4330093 from the Marfin Program of the National Marine Fisheries Service, Award #NA12NMF4540082 from the Cooperative Research Program of the National Marine Fisheries Service, and Award # NA10OAR4170099 from the National Oceanic and Atmospheric Administration to Texas Sea Grant, and by TexasAgriLife under Project H-6703. The statements, findings, conclusions, and recommendations are those of the author(s) and do not necessarily reflect the views of the National Marine Fisheries Service, the National Oceanic and Atmospheric Administration (NOAA), the U.S. Department of Commerce, Texas Sea Grant, or Texas AgriLife. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

LinkOut - more resources

Full text links
PeerJ, Inc. full text link PeerJ, Inc. Free PMC article
Cite
Send To

NCBI Literature Resources

MeSHPMCBookshelfDisclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.


[8]ページ先頭

©2009-2025 Movatter.jp