- Short Review
- Published:
Combining population genomics and quantitative genetics: finding the genes underlying ecologically important traits
Heredityvolume 100, pages158–170 (2008)Cite this article
14kAccesses
557Citations
6Altmetric
Abstract
A central challenge in evolutionary biology is to identify genes underlying ecologically important traits and describe the fitness consequences of naturally occurring variation at these loci. To address this goal, several novel approaches have been developed, including ‘population genomics,’ where a large number of molecular markers are scored in individuals from different environments with the goal of identifying markers showing unusual patterns of variation, potentially due to selection at linked sites. Such approaches are appealing because of (1) the increasing ease of generating large numbers of genetic markers, (2) the ability to scan the genome without measuring phenotypes and (3) the simplicity of sampling individuals without knowledge of their breeding history. Although such approaches are inherently applicable to non-model systems, to date these studies have been limited in their ability to uncover functionally relevant genes. By contrast, quantitative genetics has a rich history, and more recently, quantitative trait locus (QTL) mapping has had some success in identifying genes underlying ecologically relevant variation even in novel systems. QTL mapping, however, requires (1) genetic markers that specifically differentiate parental forms, (2) a focus on a particular measurable phenotype and (3) controlled breeding and maintenance of large numbers of progeny. Here we present current advances and suggest future directions that take advantage of population genomics and quantitative genetic approaches – in both model and non-model systems. Specifically, we discuss advantages and limitations of each method and argue that a combination of the two provides a powerful approach to uncovering the molecular mechanisms responsible for adaptation.
Similar content being viewed by others
Introduction
Understanding the genetic basis of ecologically important traits – traits that increase an organism's ability to survive and reproduce in natural environments – has been and continues to be a central goal for ecological and evolutionary genetics (Feder and Mitchell-Olds, 2003). Identifying the genes for ecologically relevant traits will allow a host of important geneticand ecological questions to be answered: how many genes influence ecologically important traits, and what are their relative effect sizes (Orr and Coyne, 1992;Orr, 1998)? Do these genes show evidence of non-neutral evolution at the sequence level (Stahl et al., 1999;Tian et al., 2002;Mauricio et al., 2003)? What ecological and evolutionary forces lead to the maintenance of variation at these loci (Mitchell-Olds and Schmitt, 2006)? Do ecologically similar environments favor the same genes (Calboli et al., 2003;Colosimo et al., 2004,2005;Protas et al., 2006), or is it possible to achieve a similar phenotype with different genetic mechanisms (Hoekstra and Nachman, 2003;Hoekstra et al., 2006)? Answering these questions is not trivial, yet to begin to make progress on them, identifying the genes that influence ecologically important traits is a prerequisite. In addition, these questions must be answered in a number of organisms, including and extending beyond traditional model systems – representing diverse taxonomic groups, life histories and ecological roles – before a clear picture of the ecology and genetics of adaptation emerges. Here we review recent contributions of a relatively new approach, population genomics and an old-mainstay, quantitative genetics, to the challenge of finding genes that underlie ecologically important traits. We argue that combining these approaches provides a powerful and promising way to move from chromosomal regions to genes and even to mutations underlying adaptive phenotypic variation.
What is population genomics?
At its core, population genomics is simply population genetics writ large – that is, population genetic analyses of a large number of loci, distributed throughout the genome (Black et al., 2001;Luikart et al., 2003;Schlotterer, 2003). Population genomics can be narrowly defined as separating locus-specific effects (recombination, selection, mutation and so on) that affect one or a few loci at a time from genome-wide demographic effects (genetic bottlenecks, founder events, inbreeding and so on). By utilizing a large number of loci spread throughout the genome, the effects of selection on a beneficial mutation and neutral variation at flanking sites (genetic hitch-hiking;Maynard Smith and Haigh, 1974) can be compared to genome-wide demographic effects, which are not locus specific. As such, the population genomic approach can be described in four phases (Luikart et al., 2003): (1) sample many individuals, (2) genotype this large population sample for many independent loci, (3) identify statistical ‘outlier’ loci and (4) either estimate demographic parameters and statistics (e.g., FST, phylogeographic structure, evidence of past bottlenecks) in a large data set with outlier loci removed, or alternatively, study the outlier loci specifically in an attempt to infer potential selective mechanisms underlying them.
At its core, population genomics relies on two key factors. First, it requires genotyping of a large number of loci, whether through amplified fragment length polymorphism (AFLP)'s, microsatellites, single-nucleotide polymorphism (SNP)'s or sequences. The current explosion of molecular techniques and genomic tools available suggests that this is unlikely to be a rate-limiting step, even for non-model species. One key working assumption of population genomics approaches, particularly important for studies using anonymous markers, is that the loci are independent. Even with markers of known locations in the genome, as the number of markers increases, a degree of auto-correlation will be introduced, potentially resulting in misleading inferences (seeHahn, 2006). Second, the population genomics approach requires a reliable means to detect outlier loci that may indicate regions that have been under selection – either to remove these loci to study genome-wide effects, or to identify such loci as the focus of study (Figure 1a). Because local adaptation and directional selection should have locus-specific effects of reducing genetic variability within populations and increasing differentiation between populations, loci that are outliers for these characteristics are strong candidate regions for involvement in adaptation. Determining whether an individual locus behaves as an outlier can be statistically evaluated with a battery of approaches, among them: testing whether FST is significantly different from either zero or neutral expectations (Lewontin and Krakauer, 1973;Beaumont and Nichols, 1996;Vitalis et al., 2001;Beaumont and Balding, 2004); the lnRV and lnRH statistics (natural log of the ratio of the variance and heterozygosity of alleles between two populations; (Schlotterer, 2002)); and the Ewens–Watterson test (Ewens, 1972;Watterson, 1978;Vigouroux et al., 2002); seeStorz (2005) for a recent review of such tests. Importantly, the statistical significance of these estimates can be determined from the genome-wide empirical distributions of the test statistics (Akey et al., 2002), or by comparing observed statistics to a distribution generated by neutral coalescent simulations (Beaumont and Nichols, 1996;Beaumont and Balding, 2004), or neutral, non-equilibrium simulations using parameters estimated from the data (Thornton and Andolfatto, 2006).
Conceptual model for the integration of population genomics and quantitative genetics. In (a), using population genomics approaches, outlier loci can be identified either by identifying loci with FST values that exceed confidence limits or intervals based on neutral coalescent simulations (dashed line, left panel) or are in the tails of the empirical, genome-wide distributions (filled portions of distribution, right panel). Once these outlier loci (shown in red), which are some unknown distance from the causal mutations, have been identified statistically, the next step of identifying the causal gene and mutation(s) can be pursued using genetic mapping techniques common to quantitative genetics (b). These mapping approaches can entail genetic crosses, or identification of homologous regions/candidate loci in related model organisms, or both.
Applications: estimating genome-wide effects
One consequence of gathering anonymous genome-wide polymorphism data is the potential it offers for investigators to separate locus- and region-specific selective effects from genome-wide effects such as demography. Several recent studies provide new methods to distinguish the effects of demography and selection in shaping genome-wide levels of polymorphism. These studies also caution that for candidate genes or loci linked with so-called outlier loci (see below), the challenge of distinguishing between purely demographic factors and the combined effects of demography and selection will be difficult (Przeworski et al., 2005;Teshima et al., 2006).
Because many demographic factors can affect patterns of nucleotide polymorphism in a way similar to the effects of selection, methods that can differentiate the effects of these forces are necessary before inferences can be made about their relative importance. Indeed, several recent studies have detected genome-wide departures from predictions of equilibrium neutral models in standard tests of selection (seeFord, 2002 for a review of such tests), presumably because of the effects of population genetic structure and demography (Andolfatto and Przeworski, 2000;Nordborg et al., 2005;Schmid et al., 2005,2006). Although application in more traditional ecological settings is limited, three recent papers have used alternative approaches to distinguish between demographic and selective forces in shaping human polymorphism levels (Nielsen et al., 2005;Stajich and Hahn, 2005;Williamson et al., 2005), and some generalizations appear to be emerging. First, purely demographic factors can generate much of the observed variation in the amount and frequency of polymorphism in human populations. Based on this result, it seems likely that demography can have a large effect on genetic variability in many species that have similar ecological, demographic and genetic histories. Second, against this backdrop of demographic factors, it is still possible to detect loci that appear to have been under natural selection, either because patterns of variation at individual loci show a poor fit to a purely demographic model (Stajich and Hahn, 2005), or models incorporating selection provided a better fit to the data than demographic models parameterized with putatively neutral non-coding SNPs (Williamson et al., 2005), or because individual regions of the genome show allele frequency distributions that differ from global, genome-wide allele frequency distributions (Nielsen et al., 2005). Importantly, an ongoing challenge will be to distinguish whether patterns of variation at these loci truly show evidence of natural selection, or could as easily be explained by slightly more complicated (yet still realistic) demographic models.
In situations in which an ancestor-descendant relationship exists between different species or samples within a species (e.g., colonization of an island or novel habitat, domestication), it is possible to gain additional information by utilizing data from the ancestral population (Ometto et al., 2005;Wright et al., 2005;Yamasaki et al., 2005). In the case of maize and its wild ancestor, teosinte,Wright et al. (2005) used a simulation approach to partition selective and demographic effects on polymorphism levels at 774 genes. By running coalescent simulations conditioned on the simulated data fitting multiple summaries of teosinte data, the authors were able to control for the shared history of demography, mutation, and recombination of the maize and teosinte lineages before domestication. Within this context, the severity of the bottleneck that accompanied domestication was estimated for each locus to arrive at a multilocus (genome-wide) estimate of the bottleneck severity. By comparing these models to other models that allowed a fraction of loci to show evidence of a more severe bottleneck that is indicative of artificial selection,Wright et al. (2005) estimated that approximately 2–4% of genes in the maize genome were targets of artificial selection. Importantly, these candidate loci were then aligned with published linkage and quantitative trait locus (QTL) maps, showing a significant clustering between candidate loci and QTL for morphological differences between teosinte and maize.
Applications: detecting outlier loci
Many applications of the population genomics approach have concentrated on attempts to detect outlier loci, either by screening a large number of anonymous loci or by comparing test statistics between candidate genes and a random sample of unlinked loci. There have been numerous applications of both approaches utilizing data from humans (Payseur et al., 2002;Akey et al., 2004;Hahn et al., 2004;Rockman et al., 2004,2005;Storz et al., 2004;Voight et al., 2006),Drosophila (Harr et al., 2002;Glinka et al., 2003;Kauer et al., 2003;Orengo and Aguade, 2004;Schofl and Schlotterer, 2004;Pool et al., 2006),Mus musculus (Ihle et al., 2006) andArabidopsis thaliana (Cork and Purugganan, 2005). However, because in most of these cases neither the ecological context in which selection occurred nor the potential selective agent are known (but seeCork and Purugganan, 2005), here we focus on other recent applications.
Two clear cases in which the ‘ecological’ context and agent of selection are known are artificial selection/domestication and pesticide use. These cases provide a test for population genomics methods, at least in cases in which selection is strong and recent. To date, the population genomics approach has been used successfully to confirm loci that might have undergone a selective sweep in maize during domestication (Vigouroux et al., 2002), genes for coat color and shortened limbs in dog breed formation (Pollinger et al., 2005), chloroquine resistance in the malaria-inducing parasitePlasmodium falciparum (Wootton et al., 2002) and warfarin resistance in rats (Kohn et al., 2003). However, it is important to note that in all of these cases, strong artificial rather than natural selection is driving phenotypic divergence. In a ‘proof of concept’ paper,Anderson et al. (2005) compared FST for 10 non-synonymous mutations in four loci known to be involved in antimalarial drug resistance to FST for 10 synonymous mutations in housekeeping genes or genes of unknown function. They found that not only was FST higher for non-synonymous mutations in drug resistance loci than for synonymous mutations at other loci, but that it was higher than neutral coalescent simulations that had been based on their putatively neutral loci, confirming that in this case loci subject to natural selection indeed exhibit higher FST relative to neutral loci.
In more traditional ecological settings, the population genomics approach has been applied in several cases in which species show clinal variation or ecotypic differentiation. Although not at a genomic scale,Storz and Dubach (2004) showed a clear example of detecting outlier loci: the albumin (Alb) locus in the deer mousePeromyscus maniculatus showed significant altitudinal differentiation that exceeded neutral expectations based on 18 other allozyme markers, although the precise selective agent remains unclear. Studies that implicate an environmental gradient as the selective force producing differentiation are clearly strengthened by multiple tests (e.g., multiple altitudinal or latitudinal transects), and preferably using multiple statistical approaches (Campbell and Bernatchez, 2004;Storz et al., 2004;Vasemagi and Primmer, 2005). However, identifying truly independent tests may prove to be a challenge because before population divergence, individual loci will share mutational environment and coalescent histories, potentially introducing some degree of correlation between populations.
To date, four studies, using anonymous genome-wide markers, have used multiple comparisons to test for consistent or repeatable outlier loci, using a variety of species (Table 1). For example, the common frog (Rana temporaria) exhibits altitudinal clines in a host of life history traits in Europe.Bonin et al. (2006) showed that approximately 2% of the AFLP loci they screened also exhibited elevated altitudinal differentiation; to guard against false positives, the authors only considered true outlier loci to be those that showed elevated differentiation in multiple tests. Regions in linkage with these AFLP loci would be strong candidates to contain genes underlying life history traits in this species that have been subject to altitudinally varying selection. Results from these studies (Table 1) suggest that <5–10% of loci screened show significantly elevated FST between differentiated ecotypes or populations, although the small number of examples available means generalizations are tentative.
Limitations of population genomics
Despite the appeal of these methods, especially for non-model organisms, they suffer from three glaring weaknesses from the standpoint of ecological and evolutionary functional genomics when applied in isolation. First, and perhaps most importantly, in cases where anonymous genetic markers are used to scan the genome, it is extremely likely that any anonymous locus showing ‘outlier’ behavior is not the causal locus itself, but is either physically linked or in linkage disequilibrium (LD) with the selected site(s). The extent of LD between the marker locus and the functionally relevant mutation can vary dramatically across the genome and also study systems, and will be affected by population history, mating system, recombination rate, the age of the selected allele, the strength of selection and many other factors (Nordborg and Tavare, 2002), making it difficult to localize the functionally relevant mutation. Similarly, the size and position of the genomic regions that show differentiation will be unknown, at least for species without detailed linkage maps (see below). On their own, most population genomic studies in natural populations have been limited to detecting a few statistical outlier loci, often in regions of unknown position in the genome. Therefore, the next and most important step of moving from anonymous marker to functional gene/mutation is unclear.
Second, population genomic studies are usually carried out in the absence of any information about phenotype. Thus, although genetic loci that show significant differentiation may be indicative of the effects of natural selection and local adaptation, in many cases it is unclear which traits may differ between samples, and if any correspond to the differentiated loci. The absence of knowledge about the phenotype under selection limits both ecological investigation about the putative selective agents as well as any knowledge or future use of candidate genes (see below).
The third potential weakness of the approach is with the logical inference that loci showing patterns of high differentiation (or reduced variation) have been subject to selection, whereas loci that do not show these patterns have not. Existing evidence suggests that it is possible and even probable for some loci to show high levels of differentiation (or reduced within population variation) without having been targets of selection either owing to chance alone, or for instance, due to incorrect models of demographic history used in estimating parameters like FST (e.g., island versus stepping stone models; seeAkey et al., 2004) or ascertainment bias (Thornton and Jensen, in press). Similarly, it is also possible for loci to be under selection without yielding statistically significant results in tests for selection (Gallavotti et al., 2004;McVean et al., 2005;Przeworski et al., 2005;Teshima et al., 2006). Simulation studies byTeshima et al. (2006) suggest that a sizable proportion of loci under selection will be missed in empirical genome-wide scans, especially if the loci selected had previously been neutral. In addition, requiring loci to show outlier behavior in independent population comparisons or transects, while helpful in guarding against false positives, implicitly assumes that the same loci will be fixed in response to similar environmental conditions (Bonin et al., 2006). Existing evidence demonstrates that this may not be the case even when both phenotypes and selective environments are very similar (Hoekstra and Nachman, 2003;Hoekstra et al., 2006), suggesting that this criterion will lead investigators to miss some loci involved with adaptation.
New contributions from quantitative genetics
Unlike population genomics, quantitative genetics is not a novel approach, but is instead rooted in a long history (Galton, 1869,1889). More recently, molecular tools have reinvigorated quantitative genetics through LD and QTL mapping. Like population genomics approaches, both LD and QTL mapping require the survey of a large number of genome-wide molecular markers (Figure 2). Specifically, LD mapping relies on surveys of genetic polymorphism data from a collection of samples (inbred lines, accessions, individuals and populations) to test for statistical associations between these genetic markers and particular phenotypes, again based on the premise that the marker(s) is in LD with the causal locus, or less likely, is in fact the causal mutation itself (Box 1; seeMackay, 2001;Clark, 2003;Mitchell-Olds and Schmitt, 2006). By contrast, in a QTL mapping approach, statistical analyses of genome-wide molecular markers and phenotypes measured in progeny of controlled crosses are used to identify chromosomal regions contributing to phenotypic differentiation (reviewed inMackay, 2001;Erickson et al., 2004).
Schematic illustration of the relationships between population genomics, LD mapping and QTL mapping, emphasizing the different types of data required.
LD mapping and related methods (Box 1) offer the prospect of identifying genes for ecologically important traits. By utilizing naturally occurring variation sampled in wild populations that have accumulated hundreds to thousands of recombination events over time (compared to a few generations in laboratory crosses), LD mapping is expected to (1) necessitate more markers than traditional QTL studies to provide complete coverage of the genome, but (2) have substantially higher resolution for fine-scale mapping of genomic regions. This approach offers great potential, especially if candidate genes are available for association tests. However, one of the major hurdles facing LD mapping is the need to control for cryptic population structure or stratification, which can lead to false positives (seePritchard et al., 2000a,2000b;Cardon and Palmer, 2003;Marchini et al., 2004;Yu et al., 2005). The LD mapping approach has successfully been applied inDrosophila and maize (Long et al., 1998;Thornsberry et al., 2001;Palsson and Gibson, 2004), and is starting to be applied in ecological settings. For example,Stinchcombe et al. (2004,2005) showed that accessions ofArabidopsis thaliana with putatively functionalFRIGIDA alleles exhibited significant latitudinal clines for flowering time and vernalization sensitivity, as would be predicted based onFRIGIDA's role in the vernalization flowering time pathway (Simpson and Dean, 2002). In like fashion,Aranzana et al. (2005) showed that genome-wide association tests could successfully identify known flowering time and pathogen resistance genes inArabidopsis thaliana, despite appreciable population structure. At present, most success stories in non-model organisms are limited to associations between a phenotype (inherited in a simple Mendelian manner) and one or a few candidate genes. For example, allelic variation at the melanocortin-1 receptor (Mc1r) was perfectly associated with coat color phenotype (melanic versus wild-type dorsal pelage) within populations (Nachman et al., 2003) and with environmental variation (dark-colored lava versus light-colored granitic habitat) among populations (Hoekstra et al., 2004); similar statistical associations were not observed at neutral mtDNA markers.
Unlike LD mapping, QTL approaches require the breeding of a large number of progeny, but thereby skirt the complications associated with genetic structure in natural populations. The genetic architecture of one phenotype, bristle number inDrosophila, has perhaps been the most intensively studied in a QTL context (reviewed byMackay, 1995,1996), and after tireless work, genes underlying bristle variation have been identified (Lai et al., 1994;Long et al., 1995). Although the precise molecular mechanisms remain elusive and the ecological relevance of bristle number is unclear, the progress in identifying the genes underlying bristle number suggest that moving from QTL to gene can be daunting even in model systems. Moreover, available data from bothDrosophila melanogaster andArabidopsis thaliana suggests that considerable heterogeneity exists in the causal mutations for ecologically important traits, either because of different loci affecting traits in natural populations than in mapping crosses (McDonald and Long, 2004), or because of genotype × environment interactions lead to different loci being identified in field versus laboratory settings (Weinig et al., 2002).
For these reasons, most QTL studies have been limited to describing the genetic architecture of traits, with little progress in reaching the level of genes and mutations (Flint et al., 2005), especially in non-model systems. Nonetheless, a small but growing number of exceptions exist (e.g.,Johanson et al., 2000;El-Assal et al., 2001;Shapiro et al., 2004;Colosimo et al., 2005;Balasubramanian et al., 2006;Protas et al., 2006), suggesting that QTL mapping is a feasible method of identifying the genes for ecologically important traits. And, although time intensive, costly and challenging, QTL approaches arguably represent the most comprehensive way to identify genomic regions and ultimately genes contributing to adaptive variation, especially for multigenic traits (Price, 2006).
There are three major ways in which genetic mapping approaches can interface with population genomics approaches in natural populations. First, data from genetic mapping studies (such as QTL studies) can be applied to population genomics studies. By scoring genetic markers in controlled crosses or pedigrees, genetic linkage maps can be generated, allowing for the possibility of linking outlier loci detected using population genomics approaches to ‘real’ chromosomal positions in the genome – representing a first step in localizing the genes of interest (Figure 1b). Second, by providing a large number of anonymous markers for study, the data gathered for population genomics approaches can do ‘double duty’ and be used to test and control for population genetic structure in subsequent studies using an LD mapping approach. Finally, population genomics approaches can be used to fine-scale map within the large chromosomal regions identified by lab-based QTL studies.
Applications: linkage map development and QTL mapping
A prerequisite for QTL mapping is the development of a linkage map, which allows investigators to associate phenotypes with specific identifiable regions of genome. Although the development of a linkage map and QTL mapping are clearly distinct issues, and the development of linkage maps is no longer necessary in many model systems with complete genome sequences, generating linkage maps can remain a challenge in many novel systems. Recently much effort has been spent generating linkage maps in non-model species, using a variety of experimental approaches and a diversity of molecular markers, with great potential for identifying genes underlying ecologically relevant variation (Table 2). Species that can be maintained in captivity, bred in the lab, and have relatively large brood sizes are often ideal for generating linkage maps using traditional crosses (e.g., butterflies (Heliconius,Bicyclus), sticklebacks (Gasterous), deermice (Peromyscus), monkeyflowers (Mimulus) and columbines (Aqueligia)). In other cases, linkage maps can be generated by following large pedigrees in natural populations (e.g., red deer (Cervus elaphus), soay sheep (Ovis aries), great reed warblers (Acrocephalus arundinaceus)); such long-term studies are time intensive and are only applicable to species that can be easily followed over time.
It is clear that many systems of ecological interest are not easily manipulated in the laboratory (i.e., genetic crosses are not feasible or generation times are prohibitively long). In many cases, ecological systems can take advantage of either closely related genetic model systems with genetic linkage maps or even complete genome sequences (e.g.,Dawson et al., 2006;Windsor et al., 2006). For example, a recent study generated a predicted linkage map for passerine birds by taking advantage of the sequence similarity of available microsatellites and the draft chicken genome sequence (Dawson et al., 2006), and then evaluated the accuracy of the predicted linkage map by comparing it to a previously published map for the great reed warbler (Acrocephalus arundinaceus). Despite the fact that chickens and warblers are diverged by millions of years, 24 microsatellite markers were conserved between the linkage maps, and synteny was maintained across genomes, highlighting the utility of the chicken genome for generating genomic resources for other avian species. Similar levels of conserved linkage have been reported between model organisms and non-model relatives, includingDrosophila and the apple maggot fly (Rhagoletis;Roethele et al., 2001),Mus and deer mice (Peromyscus;Steiner et al., in review), and zebrafish and salamanders (Voss et al., 2001). The availability of linkage maps for non-model species can be extremely useful for two primary reasons: (1) evenly spaced markers representing even coverage of the genome can be chosen for use in population genomic scans of the genome, or (2) alternatively, once regions of interest are identified, homologous regions in a closely related species (either a model with a complete genome sequence or one more amenable to controlled crosses and breeding) can be used to either design additional markers for fine-scale mapping or to search for candidate loci.
The benefits of combining the population genomics approach with traditional linkage maps can be seen in two studies that focused on closely related plant species (maize and teosinte:Vigouroux et al., 2002; pedunculate and sessile oak:Scotti-Saintagne et al., 2004). BothVigouroux et al. (2002) andScotti-Saintagne et al. (2004) detected loci that behaved as outliers in comparisons of population samples between closely related species. Because linkage maps have been made from experimental crosses, it is possible to determine (1) the genomic position in which these outliers occur, and (2) in some cases, test if loci showing elevated differentiation are also the loci closest to QTL for traits that are differentiated between the species. In the maize example, two of the outlier loci were located near known QTL for ear structure and endosperm weight, two traits that differ dramatically between maize and teosinte and could have been past targets of artificial selection (Vigouroux et al., 2002). In fact, because even the largest QTL mapping populations are limited by the number of recombination events, population genomic approaches may be useful in this context to fine-scale map genes.
Utilizing knowledge of candidate genes
One appeal of both population genomics and quantitative genetic approaches is that anonymous markers can easily be generated in non-model species and then scored in a large number of individuals without anya priori knowledge of the genetic or developmental mechanisms responsible for ecological differentiation. However, the use of candidate genes, although not necessary, can certainly aid in moving from the identification of a genomic region (a QTL) to a single gene or even a nucleotide mutation (a QTN). The vast majority of successes in identifying genes responsible for adaptive phenotypic variation arguably have involved either candidate loci in the initial genomic scan or the identification of candidate loci within a genomic region of interest. For example, population genomic approaches need not be restricted to completely anonymous markers (e.g., AFLPs or microsatellites), and instead can include markers in candidate loci themselves or a subset of loci chosen to include possible candidate genes (e.g., markers based on expressed sequence tags developed in an appropriate tissue type or from microarray experiments). Similarly, association studies in natural populations can include candidate loci; for example,Olsen et al. (2004) used this approach to assess how allelic variation at the photoperiod receptor geneCRY2 contributes to variation in flower timing in 95 wild accessions ofArabidopsis.
Even in large genetic crosses, candidate genes have played a major role in the success stories of linking adaptive phenotypic variation to genes. For example, in three-spine sticklebacks (Gasterosteus aculeatus), a QTL approach identified a 10 Mb region containing a large effect region contributing to adaptive variation in pelvic morphology between oceanic and lake populations (Shapiro et al., 2004). A candidate gene, thePitx1 gene, was identified in this region based on its knockout phenotype in laboratory mice, which affects pelvic morphology. When interrogated in sticklebacks,Pitx1 expression differences were associated with a reduced pelvis in lake populations, although the precise molecular change is yet to be identified. Additional phenotypes, like pigmentation variation, have been well explored in vertebrate systems, in part because the wealth of genetic and developmental information on pigmentation provides an extensive list of well-characterized candidate loci (Hoekstra, 2006). First, mutations in the tyrosine-related protein 1 (Tyrp-1) gene have been mapped in a pedigreed population of Soay sheep (Ovis aries), and are associated with a naturally segregating light/dark coat color polymorphism (Gratten et al., 2007). Second, genetic crosses and exploration of candidate genes in Mexican tetra (Astyanax mexicanus) led to the discovery that multiple independent deletions in the ocular and cutaneous albinism-2 (Oca2) gene were responsible for parallel loss of pigmentation in cave-dwelling tetra populations (Protas et al., 2006). Finally, a QTL study of adaptive color pattern in beach mice (Peromyscus polionotus) identified several regions of major effect (Steiner et al., in review), one of which contained the candidate gene,Mc1r. A single amino-acid change in theMc1r coding region is associated with between 10 and 36% of the variation in several adaptive pigment traits and the functional effects of this amino-acid change was verified in pharmacological assays (Hoekstra et al., 2006).
Future directions: combining data from laboratory crosses and natural populations
Both population genomics and quantitative genetic approaches have limitations, especially in non-model systems, which often lack complete genome sequences. Although population genomic studies have been largely successful in generating large-scale genomic data for comparisons between populations, disentangling the effects of demography and sifting through false positives have been major challenges. Beyond the statistical challenges, the next step of moving from anonymous markers to known genetic regions and eventually to genes is perhaps even more daunting. Whereas QTL studies have successfully identified chromosomal regions contributing to phenotypic variation for those species which are amenable to genetic crossing experiments, narrowing these regions to genes, especially for traits with limited candidate loci requires enormous sample sizes and a plethora of genetic markers to detect rare recombination events (Flint et al., 2005;Slate, 2005). Because of the limitations of each respective method, combining these approaches has the potential to be extremely powerful for identifying genes responsible for ecologically relevant variation.
Here we provide a powerful example of how combining multiple approaches can yield more insight than a single method applied in isolation.Rogers and Bernatchez (2005) combined population genomics scans of the genome for outlier loci with QTL mapping to examine the genetic basis of growth rate differences between dwarf (limnetic) and normal (benthic) ecotypes of whitefish (Coregonus clupeaformis). By constructing a linkage map and performing QTL mapping using AFLP loci that had previously been used in population genomics scans (Campbell and Bernatchez, 2004), they were able to determine whether the loci closest to growth rate QTL were the same as loci showing elevated differentiation in genome-wide scans of natural populations. They found that eight loci closest to QTL for growth rate showed FST values outside the empirically determined 95% confidence limits estimated from 440 AFLP loci, suggesting that differentiation at these loci was due to selection on nearby growth rate loci. Moreover, because benthic and limnetic fish were sampled from four lakes, the authors were able to show that one AFLP locus corresponding to a growth rate QTL exhibited significantly higher levels of genetic differentiation between ecotypes than expected by neutrality in three of the four lakes, suggesting genetic parallelism in how growth rate differences have evolved in lakefish. By combining QTL mapping, population genomics and surveys of multiple populations, this study illustrates the potential utility of combining approaches to (1) link markers identified in population genomics scans to phenotype and (2) test for parallel evolution using comparative genomic scans. However, it is important to note that additional work in both natural and lab-based populations will be needed to narrow these genomic regions to genes and mutations.
Conclusions
Population genomics provides an alluring first glimpse into the genome of previously unexplored organisms. In isolation, this approach can provide estimates of the proportion of the genome that are inconsistent with simple patterns of neutrality and hints of the possibility of parallel evolution, but it is thus far limited in its ability to point us directly to genes underlying adaptive phenotypic variation. The recent explosion of genome-wide linkage maps in novel systems highlights the ease by which large-scale genomic markers can be generated, and represents a clear way in which population genomic data can be linked to genome/chromosomal position, bringing us one step closer to the adaptive alleles themselves. It is clear from recent studies that combining data from natural populations (e.g., population genomics approaches or LD mapping) with information from lab-based experiments (e.g., linkage maps and QTL) provides a powerful approach for identifying the genes responsible for adaptive phenotypes (e.g.,Colosimo et al., 2005).
Importantly, the identification of genes underlying ecologically relevant traits does not represent a scientific end point, but rather the beginning of a new set of questions! Are adaptations to similar environments due to the same genes or mutations either within or between species? Do adaptive alleles emerge from standing genetic variation or as new mutations? How does the strength of selection affect the genetic architecture of adaptive traits? How do demographic and stochastic factors affect the ability of organisms to adapt to changing environments? Although the tools for non-model systems will by definition lag behind model systems, the ecological and evolutionary questions that can be answered in a diversity of novel systems will often be unique. These questions and others can be more directly addressed once ecologically relevant genes are in hand for a diversity of systems and will together provide important insight into both the ecology and evolution of adaptation.
References
Akey JM, Eberle MA, Rieder MJ, Carlson CS, Shriver MD, Nickerson DAet al. (2004). Population history and natural selection shape patterns of genetic variation in 132 genes.PLoS Biol2: 1591–1599.
Akey JM, Zhang G, Zhang K, Jin L, Shriver MD (2002). Interrogating a high-density SNP map for signatures of natural selection.Genome Res12: 1805–1814.
Anderson TJC, Nair S, Sudimack D, Williams JT, Mayxay M, Newton PNet al. (2005). Geographical distribution of selected and putatively neutral SNPs in Southeast Asian malaria parasites.Mol Biol Evol22: 2362–2374.
Andolfatto P, Przeworski M (2000). A genome-wide departure from the standard neutral model in natural populations ofDrosophila.Genetics156: 257–268.
Aranzana MJ, Kim JAS, Zhao K, Bakker E, Horton M, Jakob Ket al. (2005). Genome-wide association mapping inArabidopsis identifies previously known flowering time and pathogen resistance genes.PLoS Genet1: 531–539.
Assuncao AGL, Pieper B, Vromans J, Lindhout P, Aarts MGM, Schat H (2006). Construction of a genetic linkage map ofThlaspi caerulescens and quantitative trait loci analysis of zinc accumulation.New Phytol170: 21–32.
Balasubramanian S, Sureshkumar S, Agrawal M, Michael TP, Wessinger C, Maloof JNet al. (2006). ThePHYTOCHROME C photoreceptor gene mediates natural variation in flowering and growth responses ofArabidopsis thaliana.Nat Genet38: 711–715.
Beaumont MA, Balding DJ (2004). Identifying adaptive genetic divergence among populations from genome scans.Mol Ecol13: 969–980.
Beaumont MA, Nichols RA (1996). Evaluating loci for use in the genetic analysis of population structure.Proc R Soc Lond B Biol Sci263: 1619–1626.
Beraldi D, McRae AF, Gratten J, Slate J, Visscher PM, Pemberton JM (2006). Development of a linkage map and mapping of phenotypic polymorphisms in a free-living population of Soay Sheep (Ovis aries).Genetics173: 1521–1537.
Bernatzky R, Tanksley SD (1986). Toward a saturated linkage map in tomato based on isozymes and random cDNA sequences.Genetics112: 887–898.
Black WC, Baer CF, Antolin MF, DuTeau NM (2001). Population genomics: genome-wide sampling of insect populations.Annu Rev Entomol46: 441–469.
Boivin K, Acarkan A, Mbulu RS, Clarenz O, Schmidt R (2004). TheArabidopsis genome sequence as a tool for genome analysis in Brassicaceae. A comparison of theArabidopsis andCapsella rubella genomes.Plant Physiol135: 735–744.
Bonin A, Taberlet P, Miaud C, Pompanon F (2006). Explorative genome scan to detect candidate loci for adaptation along a gradient of altitude in the common frog (Rana temporaria).Mol Biol Evol23: 773–783.
Bouck A, Peeler R, Arnold ML, Wessler SR (2005). Genetic mapping of species boundaries in Louisiana Irises using IRRE retrotransposon display markers.Genetics171: 1289–1303.
Bradshaw HD, Otto KG, Frewen BE, McKay JK, Schemske DW (1998). Quantitative trait loci affecting differences in floral morphology between two species of monkeyflower (Mimulus).Genetics149: 367–382.
Bratteler M, Lexer C, Widmer A (2006). A genetic linkage map ofSilene vulgaris based on AFLP markers.Genome49: 320–327.
Calboli FCF, Kennington WJ, Partridge L (2003). QTL mapping reveals a striking coincidence in the positions of genomic regions associated with adaptive variation in body size in parallel clines ofDrosophila melanogaster on different continents.Evolution57: 2653–2658.
Campbell D, Bernatchez L (2004). Generic scan using AFLP markers as a means to assess the role of directional selection in the divergence of sympatric whitefish ecotypes.Mol Biol Evol21: 945–956.
Cardon LR, Palmer LJ (2003). Population stratification and spurious allelic association.Lancet361: 598–604.
Cervera M-T, Storme V, Ivens B, Gusmao J, Liu BH, Hostyn Vet al. (2001). Dense genetic linkage maps of threePopulus species (Populus deltoides, P. nigra andP. trichocarpa) based on AFLP and microsatellite markers.Genetics158: 787–809.
Chu J, Howard DJ (1998). Genetic linkage maps of the ground cricketsAllonemobius fasciatus andAllonemobius socius using RAPD and allozyme markers.Genome41: 841–847.
Clark AG (2003). Finding genes underlying risk of complex disease by linkage disequilibrium mapping.Curr Opin Genet Dev13: 296–302.
Colosimo PF, Hosemann KE, Balabhadra S, Villarreal Jr G, Dickson M, Grimwood Jet al. (2005). Widespread parallel evolution in sticklebacks by repeated fixation of ectodysplasin alleles.Science307: 1928–1933.
Colosimo PF, Peichel CL, Nereng K, Blackman BK, Shapiro MD, Schluter Det al. (2004). The genetic architecture of parallel armor plate reduction in threespine sticklebacks.PLoS Biol2: 635–641.
Cork JM, Purugganan MD (2005). High-diversity genes in theArabidopsis genome.Genetics170: 1897–1911.
Dawson DA, Burke T, Hansson B, Pandhal J, Hale MC, Hinten GNet al. (2006). A predicted microsatellite map of the passerine genome based on chicken-passerine sequence similarity.Mol Ecol15: 1299–1320.
Dopman EB, Bogdanowicz SM, Harrison RG (2004). Genetic mapping of sexual isolation between E and Z pheromone strains of the european corn borer (Ostrinia nubilalis).Genetics167: 301–309.
El-Assal SED, Alonso-Blanco C, Peeters AJM, Raz V, Koornneef M (2001). A QTL for flowering time inArabidopsis reveals a novel allele ofCRY2.Nat Genet29: 435–440.
Erickson DL, Fenster CB, Stenoien HK, Price D (2004). Quantitative trait locus analyses and the study of evolutionary process.Mol Ecol13: 2505–2522.
Ewens WJ (1972). The sampling theory of selectively neutral alleles.Theoret Popul Biol3: 87–112.
Feder ME, Mitchell-Olds T (2003). Evolutionary and ecological functional genomics.Nat Rev Genet4: 649–655.
Fishman L, Kelly AJ, Morgan E, Willis JH (2001). A genetic map in theMimulus guttatus species complex reveals transmission ratio distortion due to heterospecific interactions.Genetics159: 1701–1716.
Flint J, Valdar W, Shifman S, Mott R (2005). Strategies for mapping and cloning quantitative trait genes in rodents.Nat Rev Genet6: 271–286.
Ford MJ (2002). Applications of selective neutrality tests to molecular ecology.Mol Ecol11: 1245–1262.
Gallavotti A, Zhao Q, Kyozuka J, Meeley RB, Ritter M, Doebley JFet al. (2004). The role ofbarren stalk1 in the architecture of maize.Nature432: 630–635.
Galton F (1869).Hereditary Genius. Reprinted 1962, Meridian Books: NY.
Galton F (1889).Natural Inheritance. Macmillan: London.
Gharbi K, Gautier A, Danzmann RG, Gharbi S, Sakamoto T, Hoyheim Bet al. (2006). A linkage map for brown trout (Salmo trutta): chromosome homologies and comparative genome organization with other salmonid fish.Genetics172: 2405–2419.
Glinka S, Ometto L, Mousset S, Stephan W, De Lorenzo D (2003). Demography and natural selection have shaped genetic variation inDrosophila melanogaster: a multi-locus approach.Genetics165: 1269–1278.
Goodwillie C, Ritland C, Ritland K (2006). The genetic basis of floral traits associated with mating system evolution inLeptosiphon (Polemoniaceae): an analysis of quantitative trait loci.Evolution60: 491–504.
Gratten J, Beraldi D, Lowder BV, McRae AF, Visscher PM, Pemberton JMet al. (2007). Compelling evidence that a single nucleotide polymorphism inTYRP1 is responsible for coat colour polymorphism in a free-living population of Soay sheep.Proc R Soc Lond B274: 619–626.
Hahn MW (2006). Accurate inference and estimation in population genomics.Mol Biol Evol23: 911–918.
Hahn MW, Rockman MV, Soranzo N, Goldstein DB, Wray GA (2004). Population genetic and phylogenetic evidence for positive selection on regulatory mutations at the factor VII locus in humans.Genetics167: 867–877.
Hansson B, Akesson M, Slate J, Pemberton JM (2005). Linkage mapping reveals sex-dimorphic map distances in a passerine bird.Proc R Soc Lond B Biol Sci272: 2289–2298.
Harr B, Kauer M, Schlotterer C (2002). Hitchhiking mapping: a population-based fine-mapping strategy for adaptive mutations inDrosophila melanogaster.Proc Natl Acad Sci USA99: 12949–12954.
Hawthorne DJ (2001). AFLP-based genetic linkage map of the Colorado potato beetleLeptinotarsa decemlineata: sex chromosomes and a pyrethroid-resistance candidate gene.Genetics158: 695–700.
Hawthorne DJ, Via S (2001). Genetic linkage of ecological specialization and reproductive isolation in pea aphids.Nature412: 904–907.
Hirschhorn JN, Daly MJ (2005). Genome-wide association studies for common diseases and complex traits.Nat Rev Genet6: 95–108.
Hodges SA, Whittall JB, Fulton M, Yang JY (2002). Genetics of floral traits influencing reproductive isolation betweenAquilegia formosa andAquilegia pubescens.Am Nat159: S51–S60.
Hoekstra HE (2006). Genetics, development, and evolution of adaptive pigmentation in vertebrates.Heredity97: 222–234.
Hoekstra HE, Hirschmann RJ, Bundey RA, Insel PA, Crossland JP (2006). A single amino acid mutation contributes to adaptive beach mouse color pattern.Science313: 101–104.
Hoekstra HE, Drumm KE, Nachman MW (2004). Ecological genetics of adaptive color polymorphism in pocket mice: geographic variation in selected and neutral genes.Evolution58: 1329–1341.
Hoekstra HE, Nachman MW (2003). Different genes underlie adaptive melanism in different populations of rock pocket mice.Mol Ecol12: 1185–1194.
Hubert S, Hedgecock D (2004). Linkage maps of microsatellite DNA markers for the pacific oysterCrassostrea gigas.Genetics168: 351–362.
Ihle S, Ravaoarimanana I, Tautz D (2006). An analysis of signatures of selective sweeps in natural populations of the house mouse.Mol Biol Evol23: 790–794.
Jiggins CD, Mavarez J, Beltran M, McMillan WO, Johnston JS, Bermingham E (2005). A genetic linkage map of the mimetic butterflyHeliconius melpomene.Genetics171: 557–570.
Johanson U, West J, Lister C, Michaels S, Amasino R, Dean C (2000). Molecular analysis ofFRIGIDA, a major determinant of natural variation inArabidopsis flowering time.Science290: 344–347.
Kauer MO, Dieringer D, Schlotterer C (2003). A microsatellite variability screen for positive selection associated with the ‘Out of Africa’ habitat expansion ofDrosophila melanogaster.Genetics165: 1137–1148.
Kocher TD, Lee W-J, Sobolewska H, Penman D, McAndrew B (1998). A genetic linkage map of a Cichlid Fish, the Tilapia (Oreochromis niloticus).Genetics148: 1225–1232.
Kohn MH, Pelz HJ, Wayne RK (2003). Locus-specific genetic differentiation atRw among warfarin-resistant rat (Rattus norvegicus) populations.Genetics164: 1055–1070.
Kuittinen H, de Haan AA, Vogl C, Oikarinen S, Leppala J, Koch Met al. (2004). Comparing the linkage maps of the close relativesArabidopsis lyrata andAhaliana.Genetics168: 1575–1584.
Lai CG, Lyman RF, Long AD, Langley CH, Mackay TFC (1994). Naturally-occurring variation in bristle number and DNA polymorphisms at thescabrous locus ofDrosophila melanogaster.Science266: 1697–1702.
Lee B-Y, Lee W-J, Streelman JT, Carleton KL, Howe AE, Hulata Get al. (2005). A second-generation genetic linkage map of Tilapia (Oreochromis spp).Genetics170: 237–244.
Lewontin RC, Krakauer J (1973). Distribution of gene frequency as a test of theory of selective neutrality of polymorphisms.Genetics74: 175–195.
Lin JZ, Ritland K (1996). Construction of a genetic linkage map in the wild plantMimulus using RAPD and isozyme markers.Genome39: 63–70.
Linde M, Diel S, Neuffer B (2001). Flowering ecotypes ofCapsella bursa-pastoris (L.). Medik. (Brassicaceae) analysed by a cosegregation of phenotypic characters (QTL) and molecular markers.Ann Bot87: 91–99.
Long AD, Langley CH (1999). The power of association studies to detect the contribution of candidate genetic loci to variation in complex traits.Genome Res9: 720–731.
Long AD, Lyman RF, Langley CH, Mackay TFC (1998). Two sites in the Delta gene region contribute to naturally occurring variation in bristle number inDrosophila melanogaster.Genetics149: 999–1017.
Long AD, Mullaney SL, Reid LA, Fry JD, Langley CH, Mackay TFC (1995). High-resolution mapping of genetic-factors affecting abdominal bristle number inDrosophila melanogaster.Genetics139: 1273–1291.
Luikart G, England PR, Tallmon D, Jordan S, Taberlet P (2003). The power and promise of population genomics: from genotyping to genome typing.Nat Rev Genet4: 981–994.
Mackay TFC (1995). The genetic-basis of quantitative variation – numbers of sensory bristles ofDrosophila melanogaster as a model system.Trends Genet11: 464–470.
Mackay TFC (1996). The nature of quantitative genetic variation revisited: lessons fromDrosophila bristles.Bioessays18: 113–121.
Mackay TFC (2001). Quantitative trait loci inDrosophila.Nat Rev Genet2: 11–20.
Marchini J, Cardon LR, Phillips MS, Donnelly P (2004). The effects of human population structure on large genetic association studies.Nat Genet36: 512–517.
Mauricio R, Stahl EA, Korves T, Tian D, Kreitman M, Bergelson J (2003). Natural selection for polymorphism in the disease resistance geneRps2 ofArabidopsis thaliana.Genetics163: 735–746.
Maynard Smith J, Haigh J (1974). The hitchhiking effect of a favorable gene.Genet Res23: 23–35.
McDonald SJ, Long AD (2004). A potential regulatory polymorphism upstream ofhairy is not associated with bristle number variation in wild-caughtDrosophila.Genetics167: 2127–2131.
McVean G, Spencer CCA, Chaix R (2005). Perspectives on human genetic variation from the HapMap Project.PLoS Genet1: 413–418.
Mitchell-Olds T, Schmitt J (2006). Genetic mechanisms and evolutionary significance of natural variation inArabidopsis.Nature441: 947–952.
Moen T, Hoyheim B, Munck H, Gomez-Raya L (2004). A linkage map of Atlantic salmon (Salmo salar) reveals an uncommonly large difference in recombination rate between the sexes.Anim Genet35: 81–92.
Nachman MW, Hoekstra HE, D'Agostino SL (2003). The genetic basis of adaptive melanism in pocket mice.Proc Natl Acad Sci USA100: 5268–5273.
Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, Bustamante C (2005). Genomic scans for selective sweeps using SNP data.Genome Res15: 1566–1575.
Nordborg M, Hu TT, Ishino Y, Jhaveri J, Toomajian C, Zheng Het al. (2005). The pattern of polymorphism inArabidopsis thaliana.PLoS Biol3: 1289–1299.
Nordborg M, Tavare S (2002). Linkage disequilibrium: what history has to tell us.Trends Genet18: 83–90.
Nürnberger B, Hofman S, Forg-Brey B, Praetzel G, Maclean A, Szymura JMet al. (2003). A linkage map for the hybridising toadsBombina bombina andB. variegata (Anura: Discoglossidae).Heredity91: 136–142.
Olsen KM, Halldorsdottir SS, Stinchcombe JR, Weinig C, Schmitt J, Purugganan MD (2004). Linkage disequilibrium mapping ofArabidopsisCRY2 flowering time alleles.Genetics167: 1361–1369.
Ometto L, Glinka S, De Lorenzo D, Stephan W (2005). Inferring the effects of demography and selection onDrosophila melanogaster populations from a chromosome-wide scan of DNA variation.Mol Biol Evol22: 2119–2130.
Orengo DJ, Aguade M (2004). Detecting the footprint of positive selection in a European population ofDrosophila melanogaster: multilocus pattem of variation and distance to coding regions.Genetics167: 1759–1766.
Orr HA (1998). The population genetics of adaptation: the distribution of factors fixed during adaptive evolution.Evolution52: 935–949.
Orr HA, Coyne JA (1992). The genetics of adaptation: a reassessment.Am Nat140: 725–742.
Palsson A, Gibson G (2004). Association between nucleotide variation inEgfr and wing shape inDrosophila melanogaster.Genetics167: 1187–1198.
Parsons YM, Shaw KL (2002). Mapping unexplored genomes: a genetic linkage map of the Hawaiian cricketLaupala.Genetics162: 1275–1282.
Payseur BA, Cutter AD, Nachman MW (2002). Searching for evidence of positive selection in the human genome using patterns of microsatellite variability.Mol Biol Evol19: 1143–1153.
Peichel CL, Nereng KS, Ohgi KA, Cole BLE, Colosimo PF, Buerkle CAet al. (2001). The genetic architecture of divergence between threespine stickleback species.Nature414: 901–905.
Pekkinen M, Varvio S, Kulju KKM, Kärkkäinen H, Smolander S, Viherä-Aarnio Aet al. (2005). Linkage map of birch,Betula pendula Roth, based on microsatellites and amplified fragment length polymorphisms.Genome48: 619–625.
Pollinger JP, Bustamante CD, Fledel-Alon A, Schmutz S, Gray MM, Wayne RK (2005). Selective sweep mapping of genes with large phenotypic effects.Genome Res15: 1809–1819.
Pool JE, DuMont VB, Mueller CJL, Aquadro F (2006). A scan of molecular variation leads to the narrow localization of a selective sweep affecting both afrotropical and cosmopolitan populations ofDrosophila melanogaster.Genetics172: 1093–1105.
Price AH (2006). Believe it or not, QTLs are accurate!Trends Plant Sci11: 213–216.
Pritchard JK, Stephens M, Donnelly P (2000a). Inference of population structure using multilocus genotype data.Genetics155: 945–959.
Pritchard JK, Stephens M, Rosenberg NA, Donnelly P (2000b). Association mapping in structured populations.Am J Hum Genet67: 170–181.
Protas ME, Hersey C, Kochanek D, Zhou Y, Wilkens H, Jeffery WRet al. (2006). Genetic analysis of cavefish reveals molecular convergence in the evolution of albinism.Nat Genet38: 107–111.
Przeworski M, Coop G, Wall JD (2005). The signature of positive selection on standing genetic variation.Evolution59: 2312–2323.
Reed KM, Chaves LD, Hall MK, Knutson TP, Harry DE (2005). A comparative genetic map of the turkey genome.Cytogenet Genome Res111: 118–127.
Reich D, Patterson N, Jager PLD, McDonald GJ, Waliszewska A, Tandon Aet al. (2005). A whole-genome admixture scan finds a candidate locus for multiple sclerosis susceptibility.Nat Genet37: 1113–1118.
Rieseberg LH, Raymond O, Rosenthal DM, Lai Z, Livingstone K, Nakazato Tet al. (2003). Major ecological transitions in wild sunflowers facilitated by hybridization.Science301: 1211–1216.
Rockman MV, Hahn MW, Soranzo N, Loisel DA, Goldstein DB, Wray GA (2004). Positive selection on MMP3 regulation has shaped heart disease risk.Curr Biol14: 1531–1539.
Rockman MV, Hahn MW, Soranzo N, Zimprich F, Goldstein DB, Wray GA (2005). Ancient and recent positive selection transformed opioid cis-regulation in humans.PLoS Biol3: 2208–2219.
Roethele JB, Feder JL, Berlocher SH, Kreitman ME, Lashkari DA (1997). Toward a molecular genetic linkage map for the apple maggot fly (Diptera:Tephritidae): comparison of alternative strategies.Ann Entomol Soc Am90: 470–479.
Roethele JB, Romero-Severson J, Feder JL (2001). Evidence for broad-scale conservation of linkage map relationships betweenRhagoletis pomonella (Diptera:Tephritidae) andDrosophila melanogaster (Diptera:Drosophilidae).Ann Entomol Soc Am94: 936–947.
Rogers J, Mahaney MC, Witte SM, Nair S, Newman D, Wedel Set al. (2000). A genetic linkage map of the Baboon (Papio hamadryas) genome based on human microsatellite polymorphisms.Genomics67: 237–247.
Rogers SM, Bernatchez L (2005). Integrating QTL mapping and genome scans towards the characterization of candidate loci under parallel selection in the lake whitefish (Coregonus clupeaformis).Mol Ecol14: 351–361.
Samollow PB, Kammerer CM, Mahaney SM, Schneider JL, Westenberger SJ, VandeBerg JLet al. (2004). First-generation linkage map of the gray, short-tailed opossum,Monodelphis domestica, reveals genome-wide reduction in female recombination rates.Genetics166: 307–329.
Schlotterer C (2002). A microsatellite-based multilocus screen for the identification of local selective sweeps.Genetics160: 753–763.
Schlotterer C (2003). Hitchhiking mapping – functional genomics from the population genetics perspective.Trends Genet19: 32–38.
Schmid K, Törjék O, Meyer R, Schmuths H, Hoffmann M, Altmann T (2006). Evidence for a large-scale population structure ofArabidopsis thaliana from genome-wide single nucleotide polymorphism markers.Theor Appl Genet112: 1104–1114.
Schmid KJ, Ramos-Onsins S, Ringys-Beckstein H, Weisshaar B, Mitchell-Olds T (2005). A multilocus sequence survey inArabidopsis thaliana reveals a genome-wide departure from a neutral model of DNA sequence polymorphism.Genetics169: 1601–1615.
Schofl G, Schlotterer C (2004). Patterns of microsatellite variability among X chromosomes and autosomes indicate a high frequency of beneficial mutations in non-AfricanD. simulans.Mol Biol Evol21: 1384–1390.
Scotti-Saintagne C, Mariette S, Porth I, Goicoechea PG, Barreneche T, Bodenes Cet al. (2004). Genome scanning for interspecific differentiation between two closely related Oak species [Quercus robur L. andQ.petraea (Matt.) Liebl.].Genetics168: 1615–1626.
Shapiro MD, Marks ME, Peichel CL, Blackman BK, Nereng BJ, Schluter Det al. (2004). Genetic and developmental basis of evolutionary pelvic reduction in threespine sticklebacks.Nature428: 717–723.
Simpson GG, Dean C (2002).Arabidopsis, the rosetta stone of flowering time?Science296: 285–289.
Slate J (2005). Quantitative trait locus mapping in natural populations: progress, caveats and future directions.Mol Ecol14: 363–379.
Slate J, Visscher PM, MacGregor S, Stevens D, Tate ML, Pemberton JM (2002). A genome scan for quantitative trait loci in a wild population of Red Deer (Cervus elaphus).Genetics162: 1863–1873.
Smith JJ, Kump DK, Walker JA, Parichy DM, Voss SR (2005). A comprehensive expressed sequence tag linkage map for Tiger Salamander and Mexican Axolotl: enabling gene mapping and comparative genomics inAmbystoma.Genetics171: 1161–1171.
Smith MW, O'Brien SJ (2005). Mapping by admixture linkage disequilibrium: advances, limitations, and guidelines.Nat Rev Genet6: 623–632.
Srinivasan J, Sinz W, Lanz C, Brand A, Nandakumar R, Raddatz Get al. (2002). A bacterial artificial chromosome-based genetic linkage map of the nematodePristionchus pacificus.Genetics162: 129–134.
Stahl EA, Dwyer G, Mauricio R, Kreitman M, Bergelson J (1999). Dynamics of disease resistance polymorphism at theRPM1 locus ofArabidopsis.Nature400: 667–671.
Stajich JE, Hahn MW (2005). Disentangling the effects of demography and selection in human history.Mol Biol Evol22: 63–73.
Staten R, Schully SD, Noor MAF (2004). A microsatellite linkage map ofDrosophila mojavensis.BMC Genet5: 12–12.
Steiner CC, Weber JN, Hoekstra HE . Two interacting pigmentation genes underlie adaptive variation in beach mice. (in review).
Stinchcombe JR, Caicedo AL, Hopkins R, Mays C, Boyd EW, Purugganan MDet al. (2005). Vernalization sensitivity inArabidopsis thaliana (Brassicaceae): the effects of latitude andFLC variation.Am J Bot92: 1701–1707.
Stinchcombe JR, Weinig C, Ungerer M, Olsen KM, Mays C, Halldorsdottir SSet al. (2004). A latitudinal cline in flowering time inArabidopsis thaliana modulated by the flowering time geneFRIGIDA.Proc Natl Acad Sci USA101: 4712–4717.
Storz JF (2005). Using genome scans of DNA polymorphism to infer adaptive population divergence.Mol Ecol14: 671–688.
Storz JF, Dubach JM (2004). Natural selection drives altitudinal divergence at the albumin locus in deer mice,Peromyscus maniculatus.Evolution58: 1342–1352.
Storz JF, Payseur BA, Nachman MW (2004). Genome scans of DNA variability in humans reveal evidence for selective sweeps outside of Africa.Mol Biol Evol21: 1800–1811.
Teshima KM, Coop G, Przeworski M (2006). How reliable are empirical genomic scans for selective sweeps?Genome Res16: 702–712.
Thornsberry JM, Goodman MM, Doebley J, Kresovich S, Nielsen D, Buckler ES (2001). Dwarf8 polymorphisms associate with variation in flowering time.Nat Genet28: 286–289.
Thornton K, Andolfatto P (2006). Approximate Bayesian inference reveals evidence for a recent, severe bottleneck in a Netherlands population ofDrosophila melanogaster.Genetics172: 1607–1619.
Tian DC, Araki H, Stahl E, Bergelson J, Kreitman M (2002). Signature of balancing selection inArabidopsis.Proc Natl Acad Sci USA99: 11525–11530.
Tobler A, Kapan D, Flanagan NS, Gonzalez C, Peterson E, Jiggins CDet al. (2005). First-generation linkage map of the warningly colored butterflyHeliconius erato.Heredity94: 408–417.
Vasemagi A, Nilsson J, Primmer CR (2005). Expressed sequence tag-linked microsatellites as a source of gene-associated polymorphisms for detecting signatures of divergent selection in Atlantic salmon (Salmo salar L.).Mol Biol Evol22: 1067–1076.
Vasemagi A, Primmer CR (2005). Challenges for identifying functionally important genetic variation: the promise of combining complementary research strategies.Mol Ecol14: 3623–3642.
Vigouroux Y, McMullen M, Hittinger CT, Houchins K, Schulz L, Kresovich Set al. (2002). Identifying genes of agronomic importance in maize by screening microsatellites for evidence of selection during domestication.Proc Natl Acad Sci USA99: 9650–9655.
Vitalis R, Dawson K, Boursot P (2001). Interpretation of variation across marker loci as evidence of selection.Genetics158: 1811–1823.
Voight BF, Kudaravalli S, Wen X, Pritchard JK (2006). A map of recent positive selection in the human genome.PLoS Biol4: 446–458.
Voss SR, Shaffer HB (1997). Adaptive evolution via a major gene effect: paedomorphosis in the Mexican axolotl.Proc Natl Acad Sci USA94: 14185–14189.
Voss SR, Smith JJ, Gardiner DM, Parichy DM (2001). Conserved vertebrate chromosome segments in the large salamander genome.Genetics158: 735–746.
Wang BQ, Porter AH (2004). An AFLP-based interspecific linkage map of sympatric, hybridizingColias butterflies.Genetics168: 215–225.
Watterson GA (1978). The homozygosity test of neutrality.Genetics88: 405–417.
Weinig C, Ungerer MC, Dorn LA, Halldorsdottir SS, Toyonaga Y, Mackay TFCet al. (2002). Novel loci control variation in reproductive timing inArabidopsis thaliana in natural environments.Genetics162: 1875–1884.
Wilding CS, Butlin RK, Grahame J (2001). Differential gene exchange between parapatric morphs ofLittorina saxatilis detected using AFLP markers.J Evol Biol14: 611–619.
Williamson SH, Hernandez R, Fledel-Alon A, Zhu L, Nielsen R, Bustamante CD (2005). Simultaneous inference of selection and population growth from patterns of variation in the human genome.Proc Natl Acad Sci USA102: 7882–7887.
Wilson LM, Whitt SR, Ibanez AM, Rocheford TR, Goodman MM, Buckler IV ES (2004). Dissection of maize kernel composition and starch production by candidate gene association.Plant Cell16: 2719–2733.
Windsor AJ, Schranz ME, Formanova N, Gebauer-Jung S, Bishop JG, Schnabelrauch Det al. (2006). Partial shotgun sequencing of theBoechera stricta genome reveals extensive microsynteny and promoter conservation withArabidopsis.Plant Physiol140: 1169–1182.
Wootton JC, Feng XR, Ferdig MT, Cooper RA, Mu JB, Baruch DIet al. (2002). Genetic diversity and chloroquine selective sweeps inPlasmodium falciparum.Nature418: 320–323.
Wright SI, Bi IV, Schroeder SG, Yamasaki M, Doebley JF, McMullen MDet al. (2005). The effects of artificial selection on the maize genome.Science308: 1310–1314.
Yamasaki M, Tenaillon MI, Bi IV, Schroeder SG, Sanchez-Villeda H, Doebley JFet al. (2005). A large-scale screen for artificial selection in maize identifies candidate agronomic loci for domestication and crop improvement.Plant Cell17: 2859–2872.
Yu J, Pressoir G, Briggs WH, Vroh Bi I, Yamasaki M, Doebley JFet al. (2005). A unified mixed-model method for association mapping that accounts for multiple levels of relatedness.Nat Genet38: 203–208.
Zenger KR, McKenzie LM, Cooper DW (2002). The first comprehensive genetic linkage map of a marsupial: the Tammar Wallaby (Macropus eugenii).Genetics162: 321–330.
Zheng LB, Benedict MO, Cornel AJ, Collins FH, Kafatos FC (1996). An integrated genetic map of the African human malaria vector mosquito,Anopheles gambiae.Genetics143: 941–952.
Zhong D, Pai A, Yan G (2004). AFLP-based genetic linkage map for the red flour beetle (Tribolium castaneum).J Hered95: 53–61.
Acknowledgements
We thank Molly Przeworski, Bret Payseur, Jeff Jensen, Stephen Wright, Patrick Nosil, Matt Hahn and two anonymous reviewers for helpful discussion and comments on this manuscript. The Stinchcombe Lab is supported by funds from the NSERC Canada and the University of Toronto Connaught Fund. Research in the Hoekstra Lab is funded by grants from the National Science Foundation, the National Institutes of Health and the Arnold and Mabel Beckman Foundation.
Author information
Authors and Affiliations
Department of Ecology and Evolutionary Biology, Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario, Canada
J R Stinchcombe
Department of Organismic and Evolutionary Biology and the Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
H E Hoekstra
- J R Stinchcombe
Search author on:PubMed Google Scholar
- H E Hoekstra
Search author on:PubMed Google Scholar
Corresponding author
Correspondence toJ R Stinchcombe.
Rights and permissions
About this article
Cite this article
Stinchcombe, J., Hoekstra, H. Combining population genomics and quantitative genetics: finding the genes underlying ecologically important traits.Heredity100, 158–170 (2008). https://doi.org/10.1038/sj.hdy.6800937
Received:
Revised:
Accepted:
Published:
Issue date:
Share this article
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
Keywords
This article is cited by
Comparison of genotyping by sequencing procedures to determine population genetic structure
- Dilini K. Abeyrama
- Brian Boyle
- Theresa M. Burg
Functional & Integrative Genomics (2023)
Combining QTL mapping and gene co-expression network analysis for prediction of candidate genes and molecular network related to yield in wheat
- Jun Wei
- Yu Fang
- Yong-xiu Liu
BMC Plant Biology (2022)
Genomic architecture of phenotypic extremes in a wild cervid
- S. J. Anderson
- S. D. Côté
- A. B. A. Shafer
BMC Genomics (2022)
Dissecting the loci underlying maturation timing in Atlantic salmon using haplotype and multi-SNP based association methods
- Marion Sinclair-Waters
- Torfinn Nome
- Nicola J. Barson
Heredity (2022)
Göte Turesson’s research legacy to Hereditas: from the ecotype concept in plants to the analysis of landraces’ diversity in crops
- Rodomiro Ortiz
Hereditas (2020)




