- Letter
- Published:
A sequence-based variation map of 8.27 million SNPs in inbred mouse strains
- Kelly A. Frazer1,
- Eleazar Eskin2,
- Hyun Min Kang3,
- Molly A. Bogue4,
- David A. Hinds1,
- Erica J. Beilharz1,
- Robert V. Gupta1,
- Julie Montgomery1,
- Matt M. Morenzoni1,
- Geoffrey B. Nilsen1,
- Charit L. Pethiyagoda1,
- Laura L. Stuve1,
- Frank M. Johnson5,
- Mark J. Daly6,7,
- Claire M. Wade6,7 &
- …
- David R. Cox1
Naturevolume 448, pages1050–1053 (2007)Cite this article
4197Accesses
22Altmetric
Abstract
A dense map of genetic variation in the laboratory mouse genome will provide insights into the evolutionary history of the species1 and lead to an improved understanding of the relationship between inter-strain genotypic and phenotypic differences. Here we resequence the genomes of four wild-derived and eleven classical strains. We identify 8.27 million high-quality single nucleotide polymorphisms (SNPs) densely distributed across the genome, and determine the locations of the high (divergent subspecies ancestry) and low (common subspecies ancestry) SNP-rate intervals2,3,4,5,6 for every pairwise combination of classical strains. Using these data, we generate a genome-wide haplotype map containing 40,898 segments, each with an average of three distinct ancestral haplotypes. For the haplotypes in the classical strains that are unequivocally assigned ancestry, the genetic contributions of theMus musculus subspecies—M. m. domesticus, M. m. musculus, M. m. castaneus and the hybridM. m. molossinus—are 68%, 6%, 3% and 10%, respectively; the remaining 13% of haplotypes are of unknown ancestral origin. The considerable regional redundancy of the SNP data will facilitate imputation of the majority of these genotypes in less-densely typed classical inbred strains to provide a complete view of variation in additional strains.
This is a preview of subscription content,access via your institution
Access options
Subscription info for Japanese customers
We have a dedicated website for our Japanese customers. Please go tonatureasia.com to subscribe to this journal.
Prices may be subject to local taxes which are calculated during checkout


Similar content being viewed by others

Inbred lab mice are not isogenic: genetic variation within inbred strains used to infer the mutation rate per nucleotide site
References
Beck, J. A. et al. Genealogies of mouse inbred strains.Nature Genet.24, 23–25 (2000)
Wade, C. M. & Daly, M. J. Genetic variation in laboratory mice.Nature Genet.37, 1175–1180 (2005)
Bonhomme, F., Guénet, J.-L., Dod, B., Moriwaki, K. & Bulfield, G. The polyphyletic origin of laboratory inbred mice and their rate of evolution.Biol.J. Linnean Soc.30, 51–58 (1987)
Petkov, P. M. et al. An efficient SNP system for mouse genome scanning and elucidating strain relationships.Genome Res.14, 1806–1811 (2004)
Ideraabdullah, F. Y. et al. Genetic and haplotype diversity among wild-derived mouse inbred strains.Genome Res.14, 1880–1887 (2004)
Frazer, K. A. et al. Segmental phylogenetic relationships of inbred mouse strains revealed by fine-scale analysis of sequence variation across 4.6 Mb of mouse genome.Genome Res.14, 1493–1500 (2004)
Wade, C. M. et al. The mosaic structure of variation in the laboratory mouse genome.Nature420, 574–578 (2002)
Wiltshire, T. et al. Genome-wide single-nucleotide polymorphism analysis defines haplotype patterns in mouse.Proc. Natl Acad. Sci. USA100, 3380–3385 (2003)
Yalcin, B. et al. Unexpected complexity in the haplotypes of commonly used inbred strains of laboratory mice.Proc. Natl Acad. Sci. USA101, 9734–9739 (2004)
Zhang, J. et al. A high-resolution multistrain haplotype analysis of laboratory mouse genome reveals three distinctive genetic variation patterns.Genome Res.15, 241–249 (2005)
Waterston, R. H. et al. Initial sequencing and comparative analysis of the mouse genome.Nature420, 520–562 (2002)
Chee, M. et al. Accessing genetic information with high-density DNA arrays.Science274, 610–614 (1996)
Hinds, D. A. et al. Whole-genome patterns of common DNA variation in three human populations.Science307, 1072–1079 (2005)
Patil, N. et al. Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.Science294, 1719–1723 (2001)
Mural, R. J. et al. A comparison of whole-genome shotgun-derived mouse chromosome 16 and the human genome.Science296, 1661–1671 (2002)
Silver, L. M.Mouse Genetics: Concepts and Applications (Oxford University Press, New York, 1995)
Stein, L. D. et al. The generic genome browser: a building block for a model organism system database.Genome Res.12, 1599–1610 (2002)
Singer, J. B. et al. Genetic dissection of complex traits with chromosome substitution strains of mice.Science304, 445–448 (2004)
Grubb, S. C., Churchill, G. A. & Bogue, M. A. A collaborative database of inbred mouse strain characteristics.Bioinformatics20, 2857–2859 (2004)
Bogue, M. A. & Grubb, S. C. The Mouse Phenome Project.Genetica122, 71–74 (2004)
Bogue, M. Mouse Phenome Project: understanding human biology through mouse genetics and genomics.J. Appl. Physiol.95, 1335–1337 (2003)
Wahlsten, D., Metten, P. & Crabbe, J. C. Survey of 21 inbred mouse strains in two laboratories reveals that BTBR T/+ tf/tf has severely reduced hippocampal commissure and absent corpus callosum.Brain Res.971, 47–54 (2003)
Moy, S. S. et al. Mouse behavioral tasks relevant to autism: phenotypes of 10 inbred strains.Behav. Brain Res.176, 4–20 (2007)
Nakamura, M. A diabetic strain of the mouse.Proc. Jap. Acad.38, 348–352 (1962)
Ewing, B. & Green, P. Base-calling of automated sequencer traces usingPhred. II. Error probabilities.Genome Res.8, 186–194 (1998)
Viterbi, A. J. Error bounds for convolutional codes and an asymptotically optimal decoding algorithm.IEEE Trans. Information Theory13, 260–269 (1967)
Acknowledgements
Work was supported by funding from the NIEHS. H.M.K. and E.E. are partially supported by the NSF. H.M.K. is partially supported by a Samsung Scholarship. E.E. is partially supported by the NIH. At Perlegen Sciences, we thank A. Kloek for assistance with manuscript preparation; B. Nguyen, X. Chen, P. Chu, R. Patel, P.-E. Jiao, R. Irikat and J. Kwon for assistance with DNA sample preparation and hybridization of the high-density oligonucleotide arrays; R, Vergara for primer handling; H. Huang and W. Barrett for designing the high-density arrays; T. Genschoreck and J. Sheehan for data quality control; and S. Osborn for assistance with website development and data delivery. At NIEHS, we thank D. A. Schwartz, K. Olden, S. Wilson, L. Birnbaumer, J. Bucher, W. T. Schrader and D. M. Klotz for constructive scientific discussions, and J. A. Lewis and T. Hardee for administrative support. At The Jackson Laboratory, we thank S. Deveau and JAX DNA Resources for DNA sample preparation.
Author Contributions L.L.S., J.M., C.L.P. and K.A.F. supervised the experiments. K.A.F., D.R.C., M.J.D., F.M.J., E.J.B. and M.A.B designed the study. H.M.K., E.E, C.M.W., D.A.H., G.B.N., R.V.G. and M.M.M. performed data analysis. K.A.F., with help from E.J.B., D.A.H., E.E. and C.M.W., wrote the manuscript.
Author information
Authors and Affiliations
Perlegen Sciences, 2021 Stierlin Court, Mountain View, California 94043, USA,
Kelly A. Frazer, David A. Hinds, Erica J. Beilharz, Robert V. Gupta, Julie Montgomery, Matt M. Morenzoni, Geoffrey B. Nilsen, Charit L. Pethiyagoda, Laura L. Stuve & David R. Cox
Department of Computer Science and Department of Human Genetics, University of California, Los Angeles, Los Angeles, California 90095, USA,
Eleazar Eskin
Department of Computer Science and Engineering, University of California, San Diego, La Jolla, California 92093, USA,
Hyun Min Kang
The Jackson Laboratory, Bar Harbor, Maine 04609, USA,
Molly A. Bogue
Toxicology Operations Branch, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA,
Frank M. Johnson
Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA,
Mark J. Daly & Claire M. Wade
Center for Human Genetic Research, Massachussets General Hospital, 185 Cambridge St, Boston, Massachusetts 02114, USA,
Mark J. Daly & Claire M. Wade
- Kelly A. Frazer
You can also search for this author inPubMed Google Scholar
- Eleazar Eskin
You can also search for this author inPubMed Google Scholar
- Hyun Min Kang
You can also search for this author inPubMed Google Scholar
- Molly A. Bogue
You can also search for this author inPubMed Google Scholar
- David A. Hinds
You can also search for this author inPubMed Google Scholar
- Erica J. Beilharz
You can also search for this author inPubMed Google Scholar
- Robert V. Gupta
You can also search for this author inPubMed Google Scholar
- Julie Montgomery
You can also search for this author inPubMed Google Scholar
- Matt M. Morenzoni
You can also search for this author inPubMed Google Scholar
- Geoffrey B. Nilsen
You can also search for this author inPubMed Google Scholar
- Charit L. Pethiyagoda
You can also search for this author inPubMed Google Scholar
- Laura L. Stuve
You can also search for this author inPubMed Google Scholar
- Frank M. Johnson
You can also search for this author inPubMed Google Scholar
- Mark J. Daly
You can also search for this author inPubMed Google Scholar
- Claire M. Wade
You can also search for this author inPubMed Google Scholar
- David R. Cox
You can also search for this author inPubMed Google Scholar
Corresponding author
Correspondence toKelly A. Frazer.
Ethics declarations
Competing interests
D.A.H., R.V.G., J.M., M.M.M., G.B.N., C.L.P., L.L.S. and D.R.C. are employed by Perlegen Sciences. K.A.F. is a former employee of Perlegen Sciences
Supplementary information
Supplementary Information
This file contains Supplementary Figures 1-7, Supplementary Tables S2-S16 and S21, Supplementary Discussion and additional references. (PDF 515 kb)
Supplementary Table S1
This file contains Supplementary Table S1 indicating sequences tiled on the arrays that were lightly-masked or unmasked. (XLS 1082 kb)
Supplementary Table S17
This file contains Supplementary Table S17 listing SNPs that produce nonsense mutations. (XLS 175 kb)
Supplementary Table S18
This file contains Supplementary Table S18 listing SNPs that produce altered translation start sites. (XLS 70 kb)
Supplementary Table S19
This file contains Supplementary Table S19 listing SNPs that produce altered translation stop sites. (XLS 61 kb)
Supplementary Table S20
This file contains Supplementary Table S20 listing SNPs that produce altered splice sites. (XLS 139 kb)
Rights and permissions
About this article
Cite this article
Frazer, K., Eskin, E., Kang, H.et al. A sequence-based variation map of 8.27 million SNPs in inbred mouse strains.Nature448, 1050–1053 (2007). https://doi.org/10.1038/nature06067
Received:
Accepted:
Published:
Issue Date: