Movatterモバイル変換


[0]ホーム

URL:


Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Nature
Download PDF

The selection landscape and genetic legacy of ancient Eurasians

Naturevolume 625pages312–320 (2024)Cite this article

Subjects

Abstract

The Holocene (beginning around 12,000 years ago) encompassed some of the most significant changes in human evolution, with far-reaching consequences for the dietary, physical and mental health of present-day populations. Using a dataset of more than 1,600 imputed ancient genomes1, we modelled the selection landscape during the transition from hunting and gathering, to farming and pastoralism across West Eurasia. We identify key selection signals related to metabolism, including that selection at the FADS cluster began earlier than previously reported and that selection near theLCT locus predates the emergence of the lactase persistence allele by thousands of years. We also find strong selection in the HLA region, possibly due to increased exposure to pathogens during the Bronze Age. Using ancient individuals to infer local ancestry tracts in over 400,000 samples from the UK Biobank, we identify widespread differences in the distribution of Mesolithic, Neolithic and Bronze Age ancestries across Eurasia. By calculating ancestry-specific polygenic risk scores, we show that height differences between Northern and Southern Europe are associated with differential Steppe ancestry, rather than selection, and that risk alleles for mood-related phenotypes are enriched for Neolithic farmer ancestry, whereas risk alleles for diabetes and Alzheimer’s disease are enriched for Western hunter-gatherer ancestry. Our results indicate that ancient selection and migration were large contributors to the distribution of phenotypic diversity in present-day Europeans.

Similar content being viewed by others

Main

One of the central goals of human evolutionary genetics is to understand how natural selection has shaped the genomes of present-day people in response to changes in culture and environment. The transition from hunter-gatherers to farmers, and subsequently pastoralists, during the Holocene in Eurasia, involved some of the most dramatic changes in diet, health and social organization experienced during recent human evolution. These changes represent big shifts in environmental exposure, impacting the evolutionary forces acting on the human gene pool and imposing a series of heterogeneous selection pressures. As human lifestyles changed, close contact with domestic animals and higher population densities are likely to have increased exposure to infectious diseases, introducing new challenges to our immune system2,3.

Our understanding of the genetic architecture of complex traits in humans has been substantially advanced by genome-wide association studies (GWAS), which have identified many genetic variants associated with phenotypes of interest4,5. However, the extent to which these variants have been under directional selection during recent human evolution remains unclear. Although signatures of selection can be identified from patterns of genetic diversity in extant populations6, this can be challenging in humans as they have been exposed to highly diverse and dynamic local environments through time and space. In the complex mosaic of genetic affinities that constitute a present-day human genome, any putative signatures of selection may misrepresent the timing and magnitude of the selective process. For example, episodes of admixture between ancestral populations can result in present-day haplotypes that contain no evidence of selective processes occurring further back in time. Ancient DNA (aDNA) provides the potential to resolve these issues, by directly observing changes in trait-associated allele frequencies over time.

Whilst numerous previous studies have used aDNA to infer patterns of selection in Eurasia during the Holocene (for example, refs.7,8,9), many key questions remain unanswered. To what extent are present-day genetic differences due to natural selection or to differential patterns of admixture? What are the genetic legacies of Mesolithic, Neolithic and Bronze Age populations in present-day complex traits? How has the complex admixture history of Holocene Eurasia affected our ability to detect natural selection in genetic data? To investigate these questions, we tested for traces of divergent selection in health- and lifestyle-related genetic variants using three broad approaches. First, we looked for evidence of selection by identifying strong differentiation in allele frequencies between ancient populations. Second, we reconstructed the allele frequency trajectories and selection coefficients of tens of thousands of trait-associated variants, using a new chromosome painting technique to model ancestry-specific allele frequency trajectories through time. This allowed us to identify many trait-associated variants with new evidence for directional selection and to answer long-standing questions about the timing of selection for key health-, dietary- and pigmentation-associated loci. Finally, we used ancient genomes to infer local ancestry tracts in more than 400,000 present-day samples from the UK Biobank (UKB)5 and calculated ancestry-specific polygenic risk scores for 35 complex traits. This allowed us to characterize the genetic legacy of Mesolithic, Neolithic and Bronze Age populations in present-day phenotypes.

Samples and data

Our analyses are undertaken on a large collection of shotgun-sequenced ancient genomes presented in ref.1. This dataset comprises 1,664 imputed diploid ancient genomes and more than 8.5 million single nucleotide polymorphisms (SNPs), with an estimated imputation error rate of 1.9% and a phasing switch error rate of 2.0% for 1X genomes. Full details of the validation and benchmarking of the imputation and phasing of this dataset are provided in ref.10. These samples represent a considerable transect of Eurasia, ranging longitudinally from the Atlantic coast to Lake Baikal and latitudinally from Scandinavia to the Middle East (Fig.1). The included genomes constitute a thorough temporal sequence from 11,000 to 1,000 cal bp. This dataset allowed us to characterize in fine detail the changes in selective pressures exerted by major transitions in human culture and environment.

Fig. 1: Geographic and temporal distribution of the 1,015 ancient genomes from West Eurasia.
figure 1

a, Map of West Eurasia showing sampling locations and ages of the ancient samples.b, Raincloud plot of the sample ages, grouped by sampling region: Western Europe (n = 156), Central/Eastern Europe (n = 268), Southern Europe (n = 136), Northern Europe (n = 432) and Central/Western Asia (n = 23). Boxplot shows the median and first and third quartiles of the sample ages and whiskers extend to the largest value no further than 1.5× the interquartile range.

Genetic legacy of ancient Eurasians

We began our analysis by inferring local ancestry tracts in present-day populations by chromosome ‘painting’11 the UKB with Mesolithic, Neolithic and Bronze Age individuals as tract sources. We used a pipeline adapted from GLOBETROTTER12 and estimated admixture proportions by means of non-negative least squares (Supplementary Note 2). In total, we painted 433,395 present-day samples, including 24,511 from individuals born outside the United Kingdom, from 126 countries (Supplementary Note 1). Our results show that none of the Mesolithic, Neolithic or Bronze Age ancestries are homogeneously distributed among present-day Eurasian populations (Fig.2). Western hunter-gatherer (WHG)-related ancestries are highest in present-day individuals from the Baltic States, Belarus, Poland and Russia; Eastern hunter-gatherer (EHG)-related ancestries are highest in Mongolia, Finland, Estonia and Central Asia; and Caucasus hunter-gatherer (CHG)-related ancestries are highest in countries east of the Caucasus, in Pakistan, India, Afghanistan and Iran, in accordance with previous results13. The CHG-related ancestries probably reflect affinities to both CHG and Iranian Neolithic individuals, explaining the relatively high levels in South Asia14. Consistent with expectations15, Neolithic Anatolian-related farmer ancestries are concentrated around the Mediterranean basin, with high levels in southern Europe, the Near East and North Africa, including the Horn of Africa, but are less frequent in Northern Europe. This is in direct contrast to the Steppe-related ancestries, which are found in high levels in northern Europe, peaking in Ireland, Iceland, Norway and Sweden and decreasing further south. There is also evidence for their spread into southern Asia. Overall, these results refine global patterns of spatial distributions of ancient ancestries amongst present-day individuals. We caution, however, that absolute admixture proportions should be interpreted with caution in regions where our ancient source populations are less directly related to present-day individuals, such as in Africa and East Asia. Although these values are dependent on the reference samples used, as well as the treatment of pre- or post-admixture drift, the relative geographical variation and associations should remain consistent.

Fig. 2: The genetic legacy of ancient Eurasian ancestries in present-day populations.
figure 2

ae, Maps showing the average ancestry of: WHG (a); EHG (b); CHG (c); Neolithic farmer (d); and Steppe pastoralist (e) ancestry components per country (left) and per county or unitary authority within Great Britain and per country for the Republic of Ireland and Northern Ireland (right). Estimation was performed using ChromoPainter and NNLS, on samples of a ‘typical ancestral background’ for each non-UK country (n = 24,511) and Northern Ireland. For Great Britain, an average of self-identified ‘white British’ samples was used to represent each UK county and unitary authority, based on place of birth (n = 408,884). Countries with less than 4 and counties with less than 15 samples are shown in grey. Map uses ArcGIS layers World Countries Generalized and World Terrain.

The availability of many present-day samples (n = 408,884) from self-identified ‘white British’ individuals who share similar positions on a principal component analysis5 allowed us to further examine the distribution of ancient ancestries at high resolution in present-day Britain (Supplementary Note 2). Although regional ancestry distributions differ by only a few percentage points, we find clear evidence of geographical heterogeneity across the United Kingdom. This can be visualized by averaging ancestry proportions per county on the basis of place of birth (Fig.2). The proportion of Neolithic farmer ancestries is highest in southern and eastern England today and lower in Scotland, Wales and Cornwall. Steppe-related ancestries are inversely distributed, peaking in the Outer Hebrides and Ireland, a pattern only previously described for Scotland16. This regional pattern was already evident in the Pre-Roman Iron Age and persists to the present day even though immigrating Anglo-Saxons had relatively less affinities to Neolithic farmers than the Iron Age individuals of southwest Britain. Although this Neolithic farmer/Steppe-related dichotomy mirrors the modern ‘Anglo-Saxon’/‘Celtic’ ethnic divide, its origins are older, resulting from continuous migration from a continental population relatively enriched in Neolithic farmer ancestries, starting as early as the Late Bronze Age17,18. By measuring haplotypes from these ancestries in present-day individuals, we show that these patterns differentiate Wales and Cornwall as well as Scotland from England. We also find higher levels of WHG-related ancestries in central and northern England. These results demonstrate clear ancestry differences within an ‘ethnic group’ (white British), highlighting the need to account for subtle population structure when using resources such as the UKB19.

Ancestry-stratified selective sweeps

Having identified that significant differences in ancestries persist in seemingly homogeneous present-day populations, we sought to disentangle these effects by developing a chromosome painting technique that allows us to label haplotypes on the basis of their genetic affinities to ancient individuals. To achieve this, we built a quantitative admixture graph model (Fig.3 and Supplementary Note 3) that represents the four main ancestry flows contributing to present-day European genomes over the last 50,000 years20. We used this model to simulate genomes at time periods and in sample sizes equivalent to our empirical dataset and inferred tree sequences using Relate21,22. We trained a neural network classifier to estimate the path backwards in time through the population structure taken by each simulated individual, at each position in the genome. Our trained classifier was then used to infer the ancestral paths taken at each site, using 1,015 imputed ancient genomes from West Eurasia that passed quality filters. Using simulations, we show that our chromosome painting method has an average accuracy of 94.6% for the four ancestral paths leading to present-day Europeans and is robust to model misspecification.

Fig. 3: A schematic of the model of population structure in Europe.
figure 3

Quantitative admixture model used to simulate genomes to train the local ancestry neural network classifier. The model begins with the Out-of-Africa population, before splitting into basal Northern Europeans (NE) and West Asians (WA), who further split into EHG, WHG, CHG and ANA. These then admix to form Steppe pastoralist (Yam) and Neolithic farmer (Neo) populations. Moving down the figure is forwards in time and the population split times and admixture times are given in generations ago. Each branch is labelled with the effective population size of the population. Coloured lines represent the populations declared in the simulation that extend through time.

We then adapted CLUES23 to model aDNA time-series data (Supplementary Notes 4 and5) and used it to infer allele frequency trajectories and selection coefficients for 33,341 quality-controlled trait-associated variants from the GWAS Catalog24. An equal number of putatively neutral, frequency-paired variants were used as a control set (Supplementary Note 4). To control for possible confounders, we built a causal model to distinguish direct effects of age on allele frequency from indirect effects mediated by read depth, read length and/or error rates (Supplementary Note 6) and developed a mapping bias test used to evaluate systematic differences between data from ancient and present-day populations (Supplementary Note 4). Because admixture between groups with differing allele frequencies can confound interpretation of allele frequency changes through time, we used the local ancestry paths from our chromosome painting model to stratify haplotypes in our selection tests. By conditioning on these path labels, we are able to infer selection trajectories while controlling for changes in admixture proportions through time.

Our analysis identified no genome-wide significant (P < 5 × 10−8) selective sweeps when using genomes from present-day individuals alone (1000 Genomes Project populations GBR, FIN and TSI25), although trait-associated variants were enriched for evidence of selection compared to the control group (P < 7.29 × 10−35, Wilcoxon signed-rank test). By contrast, when using imputed aDNA genotype probabilities, we identified 11 genome-wide significant selective sweeps in the GWAS group (n = 476 SNPs withP < 5 × 10−8) and no sweeps in the control group, despite some SNPs exhibiting evidence of selection (n = 51). These results are consistent with selection preferentially acting on trait-associated variants. We then conditioned our selection analysis on each of our four local ancestry pathways—that is, local ancestry tracts passing through WHG, EHG, CHG or Anatolian farmers (ANA)—and identified 21 genome-wide significant selection peaks (Fig.4 and Extended Data Figs.110). This suggests that admixture between ancestral populations has masked evidence of selection at many trait-associated loci in Eurasian populations26.

Fig. 4: Genome-wide selection scan for trait-associated variants.
figure 4

a, Manhattan plot ofP values from selection scan with CLUES, based on a time-series of imputed aDNA genotype probabilities. Twenty-one genome-wide significant selection peaks highlighted in grey and labelled with the gene closest to the most significant SNP within each locus. Within each sweep, SNPs are positioned on they axis and coloured by their most significant marginal ancestry. Outside of the sweeps, SNPs showP values from the pan-ancestry analysis and are coloured grey. Red dotted lines indicate genome-wide significance (P < 5 × 10−8).b, Detailed plots for three genome-wide significant sweep loci: (1)MCM6, lactase persistence; (2)SLC45A2, skin pigmentation; and (3)FADS2, lipid metabolism. Rows show results for the pan-ancestry analysis (ALL) plus the four marginal ancestries: WHG, EHG, CHG and ANA. The first column of each locus shows zoomed Manhattan plots of theP values for each ancestry and column two shows allele frequency trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Selection on diet-associated loci

We find strong changes in selection associated with lactose digestion after the introduction of farming but before the expansion of the Steppe pastoralists into Europe around 5,000 years ago27,28, the timing of which is a long-standing controversy29,30,31,32. The strongest overall signal of selection in the pan-ancestry analysis is observed at theMCM6/LCT locus (rs4988235: A;P = 1.68 × 10−59;s = 0.0194), where the derived allele results in lactase persistence33. The trajectory inferred from the pan-ancestry analysis indicates that the lactase persistence allele began increasing in frequency about 6,000 years ago and has continued to increase up to the present (Fig.4). In the ancestry-stratified analyses, this signal is driven primarily by sweeps in two of the ancestral backgrounds, associated with EHG and CHG. We also observed that many selected SNPs within this locus exhibited earlier evidence of selection than atrs4988235, suggesting that selection at theMCM6/LCT locus is more complex than previously thought. To investigate this further, we expanded our selection scan to include all SNPs within the ~2.6 megabase (Mb)-wide sweep locus (n = 5,608) and checked for the earliest evidence of selection. We observed that most genome-wide significant SNPs at this locus began rising in frequency earlier thanrs4988235, indicating that strong positive selection at this locus predates the emergence of the lactase persistence allele by thousands of years. Among the alleles showing much earlier frequency rises wasrs1438307: T (P = 9.77 × 10−24;s = 0.0146), which began rising in frequency about 12,000 years ago (Fig.4). This allele has been shown to regulate energy expenditure and contribute to metabolic disease and it has been suggested to be an ancient adaptation to famine34. The high linkage disequilibrium betweenrs1438307 andrs4988235 in present-day individuals (R2 = 0.89 in GBR) may explain the recently observed correlation between frequency rises in the lactase persistence allele and archaeological proxies for famine and increased pathogen exposure35. To control for potential bias introduced by imputation, we replicated these results using genotype likelihoods, called directly from the aDNA sequencing reads, and with publicly available 1240k capture array data from the Allen Ancient DNA Resource v.52.2 (ref.36) (Supplementary Note 4).

We also found strong selection in the FADS gene cluster—FADS1 (rs174546: C;P = 4.41 × 10−19;s = 0.0126) andFADS2 (rs174581: G;P = 2.21 × 10−19;s = 0.0138)—which are associated with fatty acid metabolism and known to respond to changes in diet from a more/less vegetarian to a more/less carnivorous diet37,38,39,40,41. In contrast to previous results39,40,41, we find that much of the selection associated with a more vegetarian diet occurred in Neolithic populations before they arrived in Europe, then continued during the Neolithic (Fig.4). The strong signal of selection in this region in the pan-ancestry analysis is driven primarily by a sweep occurring across the EHG, WHG and ANA haplotypic backgrounds (Fig.4). Interestingly, we do not find statistically significant evidence of selection at this locus in the CHG background but most of the allele frequency rise in the EHG background occurs after their admixture with CHG (around 8,000 years ago42), within whom the selected alleles were already close to present-day frequencies. This suggests that the selected alleles may already have existed at substantial frequencies in early farmer populations in the Middle East and among Caucasus hunter-gatherers (associated with the ANA and CHG backgrounds, respectively) and were subject to continued selection as eastern groups moved northwards and westwards during the late Neolithic and Bronze Age periods.

When specifically comparing selection signatures differentiating ancient hunter-gatherer and farmer populations43, we also observe many regions associated with lipid and sugar metabolism and various metabolic disorders (Supplementary Note 7). These include, for example, a region in chromosome 22 containingPATZ1, which regulates the expression ofFADS1 andMORC2, which plays an important role in cellular lipid metabolism44. Another region in chromosome 3 overlaps withGPR15, which is both related to immune tolerance and to intestinal homoeostasis45,46. Finally, in chromosome 18, we recover a selection candidate region spanningSMAD7, which is associated with inflammatory bowel diseases such as Crohn’s disease47. Taken together these results indicate that the transition to agriculture imposed a substantial amount of selection for humans to adapt to a new diet and lifestyle and that the prevalence of some diseases observed today may be a consequence of these selective processes.

Selection on immunity-associated loci

We also observe evidence of strong selection in several loci associated with immunity and autoimmune disease (Supplementary Note 4). Some of these putative selection events occurred earlier than previously claimed and are probably associated with the transition to agriculture, which may help explain the high prevalence of autoimmune diseases today. Most notably, we detect an 8 Mb-wide selection sweep signal in chromosome 6 (chr6: 25.4–33.5 Mb), spanning the full length of the human leukocyte antigen (HLA) region. The selection trajectories of the variants within this locus support several independent sweeps, occurring at different times and with differing intensities. The strongest signal of selection at this locus in the pan-ancestry analysis is at an intergenic variant, located betweenHLA-A andHLA-W (rs7747253: A;P = 7.56 × 10−32;s = −0.0178), associated with protection against chickenpox (odds ratio (OR) 0.888; ref.5), increased risk of intestinal infections (OR 1.08; ref.48) and decreased heel bone mineral density (OR 0.98; ref.49). This allele rapidly decreased in frequency, beginning about 8,000 years ago (Extended Data Fig.3), reducing the risk of intestinal infections, at the cost of increasing the risk of chickenpox. By contrast, the signal of selection atC2 (rs9267677: C;P = 6.60 × 10−26;s = 0.0441), also found within this sweep, shows a gradual increase in frequency beginning around 4,000 years ago, before rising more rapidly about 1,000 years ago. In this case, the favoured allele is associated with protection against some sexually transmitted diseases (STDs) (OR 0.786; ref.48), primarily those caused by human papillomavirus, and with increased psoriasis risk (OR 2.2; ref.5). This locus provides a good example of the possibility that the high prevalence of autoimmune diseases in present-day populations may, in part, be due to genetic trade-offs; by which selection increased protection against pathogens with the pleiotropic effect of increased susceptibility to autoimmune diseases50.

These results also highlight the complex temporal dynamics of selection at the HLA locus, which not only plays a role in the regulation of the immune system but is also associated with many non-immune-related phenotypes. The high pleiotropy in this region makes it difficult to determine which selection pressures may have driven these increases in frequencies at different periods of time. However, profound shifts in lifestyle in Eurasian populations during the Holocene have been suggested to be drivers for strong selection on loci involved in immune response. These include a change in diet and closer contact with domestic animals, combined with higher mobility and increasing population density. We further explore the complex pattern of ancestry-specific selection at the HLA locus in our companion paper51.

We also identify selection signals at theSLC22A4 (rs35260072: C;P = 8.49 × 10−20;s = 0.0172) locus, associated with increased itch intensity from mosquito bites (OR 1.049; ref.52), protection against childhood and adult asthma (OR 0.902 and 0.909; ref.48) and asthma-related infections (OR 0.913; ref.48) and we find that the derived variant has been steadily rising in frequency since around 9,000 years ago (Extended Data Fig.9). However, in the sameSLC22A4 candidate region as rs35260072, we find that the frequency of the previously reported allelers1050152: T (which also protects against asthma (OR 0.90; ref.48) and related infections) plateaued about 1,500 years ago, contrary to previous reports suggesting a recent rise in frequency7. Similarly, we detect selection at theHECTD4 (rs11066188: A; P = 9.51 × 10−31s = 0.0198) andATXN2 (rs653178: C;P = 3.73 × 10−29;s = 0.0189) loci, both of which have been rising in frequency for about 9,000 years (Extended Data Fig.4), also contrary to previous reports of a more recent rise in frequency7. These SNPs are associated with protection against urethritis and urethral syndrome (OR 0.769 and 0.775; ref.48), which are often caused by STDs or accumulated urethral damage from having more than five births. Both SNPs are also linked to increased risk of intestinal infectious diseases (OR 1.03 and 1.04), several non-specific parasitic diseases (OR 1.44 and 1.59; ref.48), schistosomiasis (OR 1.13 and 1.32; ref.48), helminthiases (OR 1.29 and 1.28; ref.48), spirochaetes (OR 1.14 and 1.12; ref.48), pneumonia (OR 1.03 and 1.03; ref.48) and viral hepatitis (OR 1.15 and 1.15; ref.5). These SNPs also increase the risk of coeliac disease and rheumatoid arthritis53. Thus, several highly pleiotropic disease-associated loci, which were previously thought to be the result of recent adaptation, may have been subject to selection for a much longer period of time.

Selection on the 17q21.31 locus

We further detect signs of strong selection in a 2 Mb sweep on chromosome 17 (chr. 17: 44.0-46.0 Mb), spanning a locus on 17q21.3, implicated in neurodegenerative and developmental disorders. The locus includes an inversion and other structural polymorphisms with indications of a recent positive selection sweep in some human populations54,55. Specifically, partial duplications of theKANSL1 gene probably occurred independently on the inverted (H2) and non-inverted (H1) haplotypes (Extended Data Fig.11a) and both are found in high frequencies (15–25%) among current European and Middle Eastern populations, but are much rarer in Sub-Saharan African and East Asian populations. We used both SNP genotypes and WGS read depth information to determine inversion (H1/H2) andKANSL1 duplication (d) status in the ancient individuals studied here (Supplementary Note 8).

The H2 haplotype is observed in two of three previously published genomes56 of Anatolian aceramic-associated Neolithic individuals (Bon001 and Bon004) from around 10,000 bp but data were insufficient to identifyKANSL1 duplications. The oldest evidence forKANSL1 duplications is observed in an early Neolithic individual (AH1 from 9,900 bp; ref.57) from present-day Iran, followed by two Mesolithic individuals (NEO281 from 9,724 bp and KK1 (ref.58) from 9,720 bp), from present-day Georgia, all of whom are heterozygous for the inversion and carry the inverted duplication. TheKANSL1 duplications are also detected in two Neolithic individuals, from present-day Russia (NEO560 from 7,919 bp (H1d) and NEO212 from 7,390 bp (H2d)). With both H1d and H2d having spread to large parts of Europe with Anatolian Neolithic farmer ancestries, their frequency seems unchanged in most of Europe as Steppe-related ancestries become dominant in large parts of the subcontinent (Extended Data Fig.11c). The fact that both H1d and H2d are found in apparently high frequencies in both early Anatolian farmers and the earliest Steppe-related ancestry groups suggests that any selective sweep acting on the H1d and H2d variants would probably have occurred in populations ancestral to both.

We note that the strongest signal of selection observed in the pan-ancestry analysis at this locus is atMAPT (rs4792897: G;P = 1.33 × 10−18;s = 0.0299 (Extended Data Fig.8 and Supplementary Note 4), which codes for the tau protein59 and is associated with protection against mumps (OR 0.776; ref.48) and increased risk of snoring (OR 1.04; ref.60). More generally, polymorphisms inMAPT have been associated with increased risk of several neurodegenerative disorders, including Alzheimer’s disease and Parkinson’s disease61. However, we caution that this region is also enriched for evidence of reference bias in our dataset—especially around theKANSL1 gene—due to complex structural polymorphisms (Supplementary Note 10).

Selection on pigmentation loci

Our results identify strong selection for lighter skin pigmentation in groups moving northwards and westwards, consistent with the idea that selection is caused by reduced ultraviolet exposure and resulting vitamin D deficiency. We find that the most strongly selected alleles reached near-fixation several thousand years ago, suggesting that this process was not associated with recent sexual selection as previously proposed62. In the pan-ancestry analysis, we detect strong selection at theSLC45A2 locus (rs35395: C;P = 1.60 × 10−44;s = 0.0215)8,63, with the selected allele (responsible for lighter skin) increasing in frequency from around 13,000 years ago, until plateauing around 2,000 years ago (Fig.4). The predominant hypothesis is that high melanin levels in the skin are important in equatorial regions owing to its protection against ultraviolet radiation, whereas lighter skin has been selected for at higher latitudes (where ultraviolet radiation is less intense) because some ultraviolet penetration is required for cutaneous synthesis of vitamin D64,65. Our findings confirm pigmentation alleles as key targets of selection during the Holocene7,66, particularly on a small proportion of loci with large effect sizes8.

Our results also provide detailed information about the duration and geographic spread of these processes (Fig.4), suggesting that an allele associated with lighter skin was selected for repeatedly, probably as a consequence of similar environmental pressures occurring at different times in different regions. In the ancestry-stratified analysis, all marginal ancestries show broad agreement at theSLC45A2 locus (Fig.4) but differ in the timing of their frequency shifts. The ANA-associated ancestry background shows the earliest evidence for selection at rs35395, followed by EHG and WHG around 10,000 years ago and CHG about 2,000 years later. In all ancestry backgrounds, except ANA, the selected haplotypes plateau at high frequency by about 2,000 years ago, whilst the ANA haplotype background reaches near-fixation 1,000 years earlier. We also detect strong selection at theSLC24A5 locus (rs1426654: A;P = 2.28 × 10−16;s = 0.0185), which is also associated with skin pigmentation63,67. At this locus, the selected allele increased in frequency even earlier thanSLC45A2 and reached near-fixation around 3,500 years ago. Selection on this locus thus seems to have occurred early on in groups that were moving northwards and westwards and only later in the WHG background after these groups encountered and admixed with the incoming populations.

Selection among major axes of variation

Beyond patterns of genetic change at the Mesolithic–Neolithic transition, much genetic variability observed today reflects high genetic differentiation in the hunter-gatherer groups that eventually contributed to present-day European genetic diversity43. Indeed, many loci associated with cardiovascular disease, metabolism and lifestyle diseases trace their genetic variability before the Neolithic transition to ancient differential selection in ancestry groups occupying different parts of the Eurasian continent (Supplementary Note 7). These may represent selection episodes that preceded the admixture events described above and led to differentiation between ancient hunter-gatherer groups in the late Pleistocene and early Holocene. One of these overlaps with theSLC24A3 gene, which is a salt-sensitivity gene significantly expressed in obese individuals68. Another spansROPN1 andKALRN, two genes involved in vascular disorders69. A further region containsSLC35F3, which codes for a thiamine transport70 and has been associated with hypertension in a Han Chinese cohort71. Finally, there is a candidate region containing several genes (CH25H andFAS) associated with obesity and lipid metabolism72,73 and another peak with several genes (ASXL2, RAB10, HADHA andGPR113) involved in glucose homoeostasis and fatty acid metabolism74,75,76,77. These loci predominantly reflect ancient patterns of extreme differentiation between Eastern and Western Eurasian genomes and may be candidates for selection after the separation of the Pleistocene populations that occupied different environments across the continent (roughly 45,000 years ago13).

Pathogenic structural variants

Rare, recurrent copy-number variants (CNVs) are known to cause neurodevelopmental disorders and are associated with a range of psychiatric and physical traits with variable expressivity and incomplete penetrance78,79. To understand the prevalence of pathogenic structural variants over time we examined 50 genomic regions susceptible to recurrent CNVs, known to be the most prevalent drivers of human developmental pathologies80. The analysis included 1,442 ancient shotgun genomes passing quality control for CNV analysis (Supplementary Note 10) and 1,093 present-day human genomes for comparison81,82. We identified CNVs in ancient individuals at ten loci using a read depth-based approach and digital comparative genomic hybridization83. Although most of the observed CNVs (including duplications at 15q11.2 andCHRNA7 and CNVs spanning parts of the TAR locus and 22q11.2 distal) have not been unambiguously associated with disease in large studies, the identified CNVs include deletions and duplications that have been associated with developmental delay, dysmorphic features and neuropsychiatric abnormalities such as autism (most notably at 1q21.1, 3q29, 16p12.1 and the DiGeorge/VCFS locus but also deletions at 15q11.2 and duplications at 16p13.11). Overall, the carrier frequency in the ancient individuals is similar to that reported in the UKB genomes (1.25% versus 1.6% at 15q11.2 andCHRNA7 combined and 0.8% versus 1.1% across the remaining loci combined)84. These results indicate that large, recurrent CNVs, which can lead to several pathologies, were present at similar frequencies in the ancient and present-day populations included in this study.

Phenotypic legacy of ancient Eurasians

In addition to identifying evidence of selection for trait-associated variants, we also estimated the contribution from different genetic ancestries (associated with EHG, CHG, WHG, Steppe pastoralists and Neolithic farmers) to variation in complex traits in present-day individuals. We calculated ancestry-specific polygenic risk score—hereafter ancestral risk scores (ARS)—on the basis of chromosome painting of over 400,000 UKB samples using ChromoPainter11 (Fig.5 and Supplementary Note 9). This allowed us to identify which ancient ancestry components are over-represented in present-day UK populations at loci significantly associated with a given trait and is analogous to the genetic risk that a present-day individual would possess if they were composed entirely of one of the ancestry groupings defined in this study. This analysis avoids issues related to the portability of polygenic risk scores between populations85, as our ARSs are calculated from the same individuals used to estimate the effect sizes. Working with many imputed ancient genomes provides high statistical power to use ancient populations as ancestral sources. We focused on 35 phenotypes whose polygenic scores were significantly overdispersed among the ancient populations (Supplementary Note 9), as well as well as three large effect alleles at theAPOE gene (ApoE2, ApoE3 and ApoE4) known to significantly mediate risk of developing Alzheimer’s disease86. We emphasize that this approach makes no direct reference to ancient phenotypes but instead describes how these genetic ancestry components contributed to the present-day phenotypic landscape.

Fig. 5: ARSs for 35 complex traits.
figure 5

Showing the genetic risk that a present-day individual would possess if they were composed entirely of one ancestry. On the basis of chromosome painting of the UKB, for 35 complex traits found to be significantly overdispersed in ancient populations. Confidence intervals (95%) are estimated by bootstrapping present-day samples (n = 408,884) and centred on the mean estimate.

We find that for many anthropometric traits—such as trunk predicted mass, forced expiratory volume in 1-second (FEV1) and basal metabolic rate—the ARS for Steppe ancestry was the highest, followed by EHG and CHG/WHG, whilst Neolithic farmer ancestry consistently scored the lowest for these measurements. Consistent with previous studies, hair and skin pigmentation also showed significant differences, with scores for skin colour for WHG, EHG and CHG higher (that is, darker) than for Neolithic farmer and Steppe-associated ancestries8,9,27,28; and scores for traits related to malignant neoplasms of the skin were elevated in Neolithic farmer-associated ancestries. Both Neolithic farmer- and Steppe-associated ancestries have higher scores for blonde and light brown hair, whereas the hunter-gatherer-associated ancestries have higher scores for dark brown hair and CHG-associated ancestries had the highest score for black hair.

In terms of genetic contributions to risk for diseases, the WHG ancestral component had strikingly high scores for traits related to cholesterol, blood pressure and diabetes. The Neolithic farmer component scored the highest for anxiety, guilty feelings and irritability; CHG and WHG ancestry components consistently scored the lowest for these three traits. We found the ApoE4 allele (rs429358: C and rs7412: C, which increases risk of Alzheimer’s disease) preferentially painted with a WHG/EHG haplotypic background, suggesting it was probably brought into Western Eurasia by early hunter-gatherers (Supplementary Note 9). This result is in line with the present-day European distribution of this allele, which is highest in northeastern Europe, where the proportion of these ancestries is larger than in other regions of the continent87. By contrast, we found the ApoE2 allele (rs429358: T and rs7412: T, which decreases the risk for Alzheimer’s disease) on a haplotypic background with affinities to Steppe pastoralists. Our pan-ancestry analysis identified positive selection favouring ApoE2 (P = 6.99 × 10−3;s = 0.0130), beginning about 7,000 years ago and plateauing around 2,500 years ago (Supplementary Note 4). However, we did not identify evidence of selection for either ApoE3 (rs429358: T and rs7412: C) or ApoE4, contrary to a recent study with a smaller sample size and unphased genotypes88. The selective forces probably favouring ApoE2 in Steppe pastoralists may be associated with protective immune responses against infectious challenges, such as protection against malaria or an unknown viral infection (Supplementary Note 9).

In light of the ancestry gradients within the United Kingdom and across Eurasia (Fig.2), these results support the suggestion that migration-mediated geographic variation in phenotypes and disease risk is commonplace, and points to a way forward for explaining geographically structured disease prevalence through differential admixture processes between present-day populations. These results also help to clarify the famous discussion of selection in Europe relating to height7,89. Our finding that the Steppe- and EHG-associated ancestral components have elevated genetic values for height in the UKB demonstrates that height differences between Northern and Southern Europe may be a consequence of differential ancestry, rather than selection, as claimed in many previous studies90. However, our results do not preclude the possibility that height has been selected for in specific populations91,92.

Discussion

The fundamental changes in diet resulting from the transitions from hunting and gathering to farming and subsequently to pastoralism, precipitated far-reaching consequences for the physical and mental health of present-day Eurasian populations. These dramatic cultural changes created a heterogeneous mix of selection pressures, probably related to changes in diet and increased population densities, including selection for resistance to new infectious challenges. Owing to the highly pleiotropic nature of each sweep region, it is difficult to ascribe causal factors to any of our selection signals and we did not exhaustively test all non-trait-associated variants. However, our results show that selection during the Holocene has had a substantial impact on present-day genetic disease risk, as well as the distribution of genetic factors affecting metabolic and anthropometric traits. Our analyses have also shown that the ability to detect signatures of natural selection in present-day human genomes is drastically limited by conflicting selection pressures in different ancestral populations masking the signals. Developing methods to trace selection while accounting for differential admixture allowed us to effectively double the number of genome-wide significant selection peaks and helped clarify the trajectories of several variants related to diet and lifestyle. Furthermore, we have shown that numerous complex traits thought to have been under local selection are better explained by differing genetic contributions of ancient individuals to present-day variation. Overall, our results emphasize how the interplay between ancient selection and the main admixture events occurring in the Mesolithic, Neolithic and Bronze Age have profoundly shaped the patterns of genetic variation observed in present-day humans across Eurasia.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All ancient genomic data used in this study are already published and listed in Supplementary Table1. Data were aligned to the human reference GRCh37. Modern human genomes were obtained from the 1000 Genomes Project25, the Simons Genome Diversity Project81 and the Human Genome Diversity Project82. GWAS data were obtained from the GWAS Catalog24, the FinnGen Study48 and the UKB5.

Code availability

The scripts used to run the chromosome painting (Supplementary Note 2) and calculate ARS in the UKB (Supplementary Note 9) are available athttps://github.com/will-camb/mesoneo_selection_paper (https://doi.org/10.5281/zenodo.8301166). The software to perform the ancestral path chromosome painting described in Supplementary Note3 is available on GitHub athttps://github.com/AliPearson/AncestralPaths (https://doi.org/10.5281/zenodo.8319452) and the demographic model is available in the stdpopsim library (https://popsim-consortium.github.io/stdpopsim-docs/stable/catalog.html#sec_catalog_homsap_models_ancienteurope_4a21). The analysis pipeline and ‘conda’ environment necessary to replicate the analysis of allele frequency trajectories of trait-associated variants in Supplementary Note4 are available athttps://github.com/ekirving/mesoneo_paper (https://doi.org/10.5281/zenodo.8289755). The modified version of CLUES used in this study is available fromhttps://github.com/standard-aaron/clues (https://doi.org/10.5281/zenodo.8228252). The pipeline to replicate the analyses for Supplementary Note7 can be found athttps://github.com/albarema/neo (https://doi.org/10.5281/zenodo.8301253). All other analyses relied on available software which has been fully referenced in the manuscript and detailed in the relevantSupplementary notes.

References

  1. Allentoft, M. E. et al. Population genomics of post-glacial western Eurasia.Naturehttps://doi.org/10.1038/s41586-023-06865-0 (2024).

  2. Page, A. E. et al. Reproductive trade-offs in extant hunter-gatherers suggest adaptive mechanism for the Neolithic expansion.Proc. Natl Acad. Sci. USA113, 4694–4699 (2016).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  3. Marciniak, S., Bergey, C., Silva, A. M. & Hałuszko, A. An integrative skeletal and paleogenomic analysis of prehistoric stature variation suggests relatively reduced health for early European farmers.Proc. Natl Acad. Sci. USA119, e2106743119 (2022).

  4. Visscher, P. M. et al. 10 years of GWAS discovery: biology, function and translation.Am. J. Hum. Genet.101, 5–22 (2017).

    Article CAS PubMed PubMed Central  Google Scholar 

  5. Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data.Nature562, 203–209 (2018).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  6. Vitti, J. J., Grossman, S. R. & Sabeti, P. C. Detecting natural selection in genomic data.Annu. Rev. Genet.47, 97–120 (2013).

    Article CAS PubMed  Google Scholar 

  7. Mathieson, I. et al. Genome-wide patterns of selection in 230 ancient Eurasians.Nature528, 499–503 (2015).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  8. Ju, D. & Mathieson, I. The evolution of skin pigmentation-associated variation in West Eurasia.Proc. Natl Acad. Sci. USA118, e2009227118 (2021).

    Article CAS PubMed  Google Scholar 

  9. Wilde, S. et al. Direct evidence for positive selection of skin, hair and eye pigmentation in Europeans during the last 5,000 y.Proc. Natl Acad. Sci. USA111, 4832–4837 (2014).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  10. Sousa da Mota, B. et al. Imputation of ancient human genomes.Nat. Commun.14, 3660 (2023).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  11. Lawson, D. J., Hellenthal, G., Myers, S. & Falush, D. Inference of population structure using dense haplotype data.PLoS Genet.8, e1002453 (2012).

    Article CAS PubMed PubMed Central  Google Scholar 

  12. Hellenthal, G. et al. A genetic atlas of human admixture history.Science343, 747–751 (2014).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  13. Sikora, M. et al. The population history of northeastern Siberia since the Pleistocene.Nature570, 182–188 (2019).

    Article ADS CAS PubMed  Google Scholar 

  14. Shinde, V. et al. An ancient Harappan genome lacks ancestry from Steppe pastoralists or Iranian farmers.Cell179, 729–735 (2019).

    Article CAS PubMed PubMed Central  Google Scholar 

  15. Hofmanová, Z. et al. Early farmers from across Europe directly descended from Neolithic Aegeans.Proc. Natl Acad. Sci. USA113, 6886–6891 (2016).

    Article ADS PubMed PubMed Central  Google Scholar 

  16. Galinsky, K. J., Loh, P.-R., Mallick, S., Patterson, N. J. & Price, A. L. Population structure of UK Biobank and ancient Eurasians reveals adaptation at genes influencing blood pressure.Am. J. Hum. Genet.99, 1130–1139 (2016).

    Article CAS PubMed PubMed Central  Google Scholar 

  17. Patterson, N. et al. Large-scale migration into Britain during the Middle to Late Bronze Age.Nature601, 588–594 (2022).

    Article ADS CAS PubMed  Google Scholar 

  18. Olalde, I. et al. The Beaker phenomenon and the genomic transformation of northwest Europe.Nature555, 190–196 (2018).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  19. Zaidi, A. A. & Mathieson, I. Demographic history mediates the effect of stratification on polygenic scores.eLife9, e61548 (2020).

    Article CAS PubMed PubMed Central  Google Scholar 

  20. Jones, E. R. et al. Upper Palaeolithic genomes reveal deep roots of modern Eurasians.Nat. Commun.6, 8912 (2015).

    Article ADS CAS PubMed  Google Scholar 

  21. Speidel, L., Forest, M., Shi, S. & Myers, S. R. A method for genome-wide genealogy estimation for thousands of samples.Nat. Genet.51, 1321–1329 (2019).

    Article CAS PubMed PubMed Central  Google Scholar 

  22. Speidel, L. et al. Inferring population histories for ancient genomes using genome-wide genealogies.Mol. Biol. Evol.38, 3497–3511 (2021).

    Article CAS PubMed PubMed Central  Google Scholar 

  23. Stern, A. J., Wilton, P. R. & Nielsen, R. An approximate full-likelihood method for inferring selection and allele frequency trajectories from DNA sequence data.PLoS Genet.15, e1008384 (2019).

    Article PubMed PubMed Central  Google Scholar 

  24. Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019.Nucleic Acids Res.47, D1005–D1012 (2019).

    Article CAS PubMed  Google Scholar 

  25. 1000 Genomes Project Consortium et al. A global reference for human genetic variation.Nature526, 68–74 (2015).

    Article  Google Scholar 

  26. Souilmi, Y. et al. Admixture has obscured signals of historical hard sweeps in humans.Nat. Ecol. Evol.6, 2003–2015 (2022).

    Article PubMed PubMed Central  Google Scholar 

  27. Allentoft, M. E. et al. Population genomics of Bronze Age Eurasia.Nature522, 167–172 (2015).

    Article ADS CAS PubMed  Google Scholar 

  28. Haak, W. et al. Massive migration from the steppe was a source for Indo-European languages in Europe.Nature522, 207–211 (2015).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  29. Enattah, N. S. et al. Independent introduction of two lactase-persistence alleles into human populations reflects different history of adaptation to milk culture.Am. J. Hum. Genet.82, 57–72 (2008).

    Article CAS PubMed PubMed Central  Google Scholar 

  30. Itan, Y., Powell, A., Beaumont, M. A., Burger, J. & Thomas, M. G. The origins of lactase persistence in Europe.PLoS Comput. Biol.5, e1000491 (2009).

    Article ADS MathSciNet PubMed PubMed Central  Google Scholar 

  31. Ségurel, L. & Bon, C. On the evolution of lactase persistence in humans.Annu. Rev. Genomics Hum. Genet.18, 297–319 (2017).

    Article PubMed  Google Scholar 

  32. Segurel, L. et al. Why and when was lactase persistence selected for? Insights from Central Asian herders and ancient DNA.PLoS Biol.18, e3000742 (2020).

    Article CAS PubMed PubMed Central  Google Scholar 

  33. Enattah, N. S. et al. Identification of a variant associated with adult-type hypolactasia.Nat. Genet.30, 233–237 (2002).

    Article CAS PubMed  Google Scholar 

  34. Wang, L. et al. A MicroRNA linking human positive selection and metabolic disorders.Cell183, 684–701 (2020).

    Article CAS PubMed PubMed Central  Google Scholar 

  35. Evershed, R. P. et al. Dairying, diseases and the evolution of lactase persistence in Europe.Nature608, 336–345 (2022).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  36. Mallick, S. et al. The Allen Ancient DNA Resource (AADR): a curated compendium of ancient human genomes. Preprint atbioRxivhttps://doi.org/10.1101/2023.04.06.535797 (2023).

  37. Willer, C. J. et al. Discovery and refinement of loci associated with lipid levels.Nat. Genet.45, 1274–1283 (2013).

    Article CAS PubMed PubMed Central  Google Scholar 

  38. Gallois, A. et al. A comprehensive study of metabolite genetics reveals strong pleiotropy and heterogeneity across time and context.Nat. Commun.10, 4788 (2019).

    Article ADS PubMed PubMed Central  Google Scholar 

  39. Buckley, M. T. et al. Selection in Europeans on fatty acid desaturases associated with dietary changes.Mol. Biol. Evol.34, 1307–1318 (2017).

    Article CAS PubMed PubMed Central  Google Scholar 

  40. Ye, K., Gao, F., Wang, D., Bar-Yosef, O. & Keinan, A. Dietary adaptation of FADS genes in Europe varied across time and geography.Nat. Ecol. Evol.1, 167 (2017).

    Article PubMed PubMed Central  Google Scholar 

  41. Mathieson, S. & Mathieson, I. FADS1 and the timing of human adaptation to agriculture.Mol. Biol. Evol.35, 2957–2970 (2018).

    Article CAS PubMed PubMed Central  Google Scholar 

  42. Lazaridis, I. The evolutionary history of human populations in Europe.Curr. Opin. Genet. Dev.53, 21–27 (2018).

    Article CAS PubMed  Google Scholar 

  43. Luu, K., Bazin, E. & Blum, M. G. B. pcadapt: an R package to perform genome scans for selection based on principal component analysis.Mol. Ecol. Resour.17, 67–77 (2017).

    Article CAS PubMed  Google Scholar 

  44. Sánchez-Solana, B., Li, D.-Q. & Kumar, R. Cytosolic functions of MORC2 in lipogenesis and adipogenesis.Biochim. Biophys. Acta1843, 316–326 (2014).

    Article PubMed  Google Scholar 

  45. Kim, S. V. et al. GPR15-mediated homing controls immune homeostasis in the large intestine mucosa.Science340, 1456–1459 (2013).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  46. Nguyen, L. P. et al. Role and species-specific expression of colon T cell homing receptor GPR15 in colitis.Nat. Immunol.16, 207–213 (2015).

  47. Monteleone, G. et al. Mongersen, an oral SMAD7 antisense oligonucleotide and Crohn’s disease.N. Engl. J. Med.372, 1104–1113 (2015).

    Article CAS PubMed  Google Scholar 

  48. Kurki, M. I. et al. FinnGen provides genetic insights from a well-phenotyped isolated population.Nature613, 508–518 (2023).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  49. Morris, J. A. et al. An atlas of genetic influences on osteoporosis in humans and mice.Nat. Genet.51, 258–266 (2018).

    Article PubMed PubMed Central  Google Scholar 

  50. Brinkworth, J. F. & Barreiro, L. B. The contribution of natural selection to present-day susceptibility to chronic inflammatory and autoimmune disease.Curr. Opin. Immunol.31, 66–78 (2014).

    Article CAS PubMed PubMed Central  Google Scholar 

  51. Barrie, W. et al. Elevated genetic risk for multiple sclerosis emerged in steppe pastoralist populations.Naturehttps://doi.org/10.1038/s41586-023-06618-z (2024).

  52. Jones, A. V. et al. GWAS of self-reported mosquito bite size, itch intensity and attractiveness to mosquitoes implicates immune-related predisposition loci.Hum. Mol. Genet.26, 1391–1406 (2017).

    Article CAS PubMed PubMed Central  Google Scholar 

  53. Gutierrez-Achury, J. et al. Functional implications of disease-specific variants in loci jointly associated with coeliac disease and rheumatoid arthritis.Hum. Mol. Genet.25, 180–190 (2016).

    Article CAS PubMed  Google Scholar 

  54. Stefansson, H. et al. A common inversion under selection in Europeans.Nat. Genet.37, 129–137 (2005).

    Article CAS PubMed  Google Scholar 

  55. Steinberg, K. M. et al. Structural diversity and African origin of the 17q21.31 inversion polymorphism.Nat. Genet.44, 872–880 (2012).

    Article CAS PubMed PubMed Central  Google Scholar 

  56. Kılınç, G. M. et al. The demographic development of the first farmers in Anatolia.Curr. Biol.26, 2659–2666 (2016).

    Article PubMed PubMed Central  Google Scholar 

  57. Broushaki, F. et al. Early Neolithic genomes from the eastern Fertile Crescent.Science353, 499–503 (2016).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  58. Jones, E. R. et al. The Neolithic transition in the Baltic was not driven by admixture with early European farmers.Curr. Biol.27, 576–582 (2017).

    Article CAS PubMed PubMed Central  Google Scholar 

  59. Andreadis, A., Brown, W. M. & Kosik, K. S. Structure and novel exons of the human tau gene.Biochemistry31, 10626–10633 (1992).

    Article CAS PubMed  Google Scholar 

  60. Jansen, P. R. et al. Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways.Nat. Genet.51, 394–403 (2019).

    Article CAS PubMed  Google Scholar 

  61. Desikan, R. S. et al. Genetic overlap between Alzheimer’s disease and Parkinson’s disease at the MAPT locus.Mol. Psychiatry20, 1588–1595 (2015).

    Article CAS PubMed PubMed Central  Google Scholar 

  62. Aoki, K. Sexual selection as a cause of human skin colour variation: Darwin’s hypothesis revisited.Ann. Hum. Biol.29, 589–608 (2002).

    Article PubMed  Google Scholar 

  63. Lona-Durazo, F. et al. Meta-analysis of GWA studies provides new insights on the genetic architecture of skin pigmentation in recently admixed populations.BMC Genet.20, 59 (2019).

    Article PubMed PubMed Central  Google Scholar 

  64. Jablonski, N. G. & Chaplin, G. The evolution of human skin coloration.J. Hum. Evol.39, 57–106 (2000).

    Article CAS PubMed  Google Scholar 

  65. Engelsen, O. The relationship between ultraviolet radiation exposure and vitamin D status.Nutrients2, 482–495 (2010).

    Article CAS PubMed PubMed Central  Google Scholar 

  66. Voight, B. F., Kudaravalli, S., Wen, X. & Pritchard, J. K. A map of recent positive selection in the human genome.PLoS Biol.4, e72 (2006).

    Article PubMed PubMed Central  Google Scholar 

  67. Martin, A. R. et al. An unexpectedly complex architecture for skin pigmentation in Africans.Cell171, 1340–1353 (2017).

    Article CAS PubMed PubMed Central  Google Scholar 

  68. Wu, H. et al. Transcriptome sequencing to detect the potential role of long noncoding RNAs in salt-sensitive hypertensive rats.BioMed Res. Int.2019, 2816959 (2019).

    Article PubMed PubMed Central  Google Scholar 

  69. Wang, L. et al. Peakwide mapping on chromosome 3q13 identifies the kalirin gene as a novel candidate gene for coronary artery disease.Am. J. Hum. Genet.80, 650–663 (2007).

  70. Zhang, K. et al. Genetic implication of a novel thiamine transporter in human hypertension.J. Am. Coll. Cardiol.63, 1542–1555 (2014).

    Article CAS PubMed PubMed Central  Google Scholar 

  71. Zang, X.-L. et al. Association of a SNP in SLC35F3 gene with the risk of hypertension in a Chinese Han population.Front. Genet.7, 108 (2016).

  72. Russo, L. et al. Cholesterol 25-hydroxylase (CH25H) as a promoter of adipose tissue inflammation in obesity and diabetes.Mol. Metab.39, 100983 (2020).

    Article CAS PubMed PubMed Central  Google Scholar 

  73. Demir, A., Kahraman, R., Candan, G. & Ergen, A. The role of FAS gene variants in inflammatory bowel disease.Turk. J. Gastroenterol.31, 356–361 (2020).

    Article PubMed PubMed Central  Google Scholar 

  74. Izawa, T. et al. ASXL2 regulates glucose, lipid and skeletal homeostasis.Cell Rep.11, 1625–1637 (2015).

    Article CAS PubMed PubMed Central  Google Scholar 

  75. Vazirani, R. P. et al. Disruption of adipose Rab10-dependent insulin signaling causes hepatic insulin resistance.Diabetes65, 1577–1589 (2016).

    Article CAS PubMed PubMed Central  Google Scholar 

  76. Thapa, D. et al. The protein acetylase GCN5L1 modulates hepatic fatty acid oxidation activity via acetylation of the mitochondrial β-oxidation enzyme HADHA.J. Biol. Chem.293, 17676–17684 (2018).

    Article CAS PubMed PubMed Central  Google Scholar 

  77. Ong, H. S. & Yim, H. C. H. inRegulation of Inflammatory Signaling in Health and Disease (ed. Xu, D.) 153–174 (Springer, 2017).

  78. Girirajan, S., Campbell, C. D. & Eichler, E. E. Human copy number variation and complex genetic disease.Annu. Rev. Genet.45, 203–226 (2011).

    Article CAS PubMed PubMed Central  Google Scholar 

  79. Weise, A. et al. Microdeletion and microduplication syndromes.J. Histochem. Cytochem.60, 346–358 (2012).

    Article CAS PubMed PubMed Central  Google Scholar 

  80. Girirajan, S. et al. Phenotypic heterogeneity of genomic disorders and rare copy-number variants.N. Engl. J. Med.367, 1321–1331 (2012).

    Article CAS PubMed PubMed Central  Google Scholar 

  81. Mallick, S. et al. The Simons Genome Diversity Project: 300 genomes from 142 diverse populations.Nature538, 201–206 (2016).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  82. Bergström, A. et al. Insights into human genetic variation and population history from 929 diverse genomes.Science367, eaay5012 (2020).

    Article PubMed PubMed Central  Google Scholar 

  83. Sudmant, P. H. et al. Diversity of human copy number variation and multicopy genes.Science330, 641–646 (2010).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  84. Crawford, K. et al. Medical consequences of pathogenic CNVs in adults: analysis of the UK Biobank.J. Med. Genet.56, 131–138 (2019).

    Article CAS PubMed  Google Scholar 

  85. Martin, A. R. et al. Human demographic history impacts genetic risk prediction across diverse populations.Am. J. Hum. Genet.100, 635–649 (2017).

    Article CAS PubMed PubMed Central  Google Scholar 

  86. Corder, E. H. et al. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer’s disease in late onset families.Science261, 921–923 (1993).

    Article ADS CAS PubMed  Google Scholar 

  87. Belloy, M. E., Napolioni, V. & Greicius, M. D. A quarter century of APOE and Alzheimer’s disease: progress to date and the path forward.Neuron101, 820–838 (2019).

    Article CAS PubMed PubMed Central  Google Scholar 

  88. Kolbe, D. et al. Current allele distribution of the human longevity gene APOE in Europe can mainly be explained by ancient admixture.Aging Cell22, e13819 (2023).

    Article CAS PubMed PubMed Central  Google Scholar 

  89. Rosenstock, E. et al. Human stature in the Near East and Europe ca. 10,000–1000 bc: its spatiotemporal development in a Bayesian errors-in-variables model.Archaeol. Anthropol. Sci.11, 5657–5690 (2019).

    Article  Google Scholar 

  90. Field, Y. et al. Detection of human adaptation during the past 2000 years.Science354, 760–764 (2016).

    Article ADS CAS PubMed PubMed Central  Google Scholar 

  91. Chen, M. et al. Evidence of polygenic adaptation in Sardinia at height-associated loci ascertained from the Biobank Japan.Am. J. Hum. Genet.107, 60–71 (2020).

    Article CAS PubMed PubMed Central  Google Scholar 

  92. Howe, L. J. et al. Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects.Nat. Genet.54, 581–592 (2022).

    Article CAS PubMed PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank all the former and current staff at the Lundbeck Foundation GeoGenetics Centre and the GeoGenetics Sequencing Core and colleagues across the many institutions detailed below. We are particularly grateful to L. Olsen as project manager for the Lundbeck Foundation GeoGenetics Centre project. We thank UKB for access to the UKB genomic resource. We want to acknowledge the participants and investigators of the FinnGen study. We are thankful to Illumina for collaboration. E.W. thanks St. John’s College, Cambridge, for providing a stimulating environment of discussion and learning. The Lundbeck Foundation GeoGenetics Centre is supported by the Lundbeck Foundation (R302-2018-2155 and R155-2013-16338), the Novo Nordisk Foundation (NNF18SA0035006), the Wellcome Trust (214300), Carlsberg Foundation (CF18-0024), the Danish National Research Foundation (DNRF94 and DNRF174), the University of Copenhagen (KU2016 programme), Ferring Pharmaceuticals A/S and a COREX ERC Synergy grant (ID 951385). This research has been conducted using the UKB Resource and the iPSYCH Initiative, funded by the Lundbeck Foundation (R102-A9118 and R155-2014-1724). This work was further supported by the Swedish Foundation for Humanities and Social Sciences grant (Riksbankens Jubileumsfond M16-0455:1) to K.K.. E.K.I.-P. and A.R.-M. were supported by the Lundbeck Foundation (R302-2018-2155) and the Novo Nordisk Foundation (NNF18SA0035006). A.P., R.D. and E.W. were supported by the Wellcome Trust (214300). R.M. was supported by a SSHRC doctoral studentship (G101449). R.A.H. and T.K. were supported by the Carlsberg Foundation (CF19-0712). L.S. was supported by a Sir Henry Wellcome fellowship (220457/Z/20/Z). L.F. was supported by the OAK Foundation (OCAY-15-520). P.H.S. was supported by the Institute of General Medical Sciences (R35GM142916) and a Vallee Scholars Award. R.N. was supported by the National Institutes of Health (R01GM138634). F.R. was supported by a Villum Young Investigator Grant (project no. 00025300), a Novo Nordisk Fonden Data Science Ascending Investigator Award (NNF22OC0076816) and by the European Research Council (ERC) under the European Union’s Horizon Europe programme (grant agreements No. 101077592 and 951385).

Author information

Author notes
  1. These authors contributed equally: Evan K. Irving-Pease, Alba Refoyo-Martínez, William Barrie, Andrés Ingason, Alice Pearson, Anders Fischer

  2. These authors jointly supervised this work: Daniel J. Lawson, Richard Durbin, Thorfinn Korneliussen, Thomas Werge, Morten E. Allentoft, Martin Sikora, Rasmus Nielsen, Fernando Racimo, Eske Willerslev

Authors and Affiliations

  1. Lundbeck Foundation GeoGenetics Centre, Globe Institute, University of Copenhagen, Copenhagen, Denmark

    Evan K. Irving-Pease, Alba Refoyo-Martínez, Andrés Ingason, Anders Fischer, Fabrice Demeter, Rasmus A. Henriksen, Tharsika Vimala, Hugh McColl, Gabriele Scorrano, Abigail Ramsøe, Anders Rosengren, Lei Zhao, Kristian Kristiansen, Thorfinn Korneliussen, Thomas Werge, Morten E. Allentoft, Martin Sikora, Rasmus Nielsen, Fernando Racimo & Eske Willerslev

  2. GeoGenetics Group, Department of Zoology, University of Cambridge, Cambridge, UK

    William Barrie, Ruairidh Macleod & Eske Willerslev

  3. Institute of Biological Psychiatry, Mental Health Services, Copenhagen University Hospital, Roskilde, Denmark

    Andrés Ingason, Andrew J. Schork & Anders Rosengren

  4. Department of Genetics, University of Cambridge, Cambridge, UK

    Alice Pearson & Richard Durbin

  5. Department of Zoology, University of Cambridge, Cambridge, UK

    Alice Pearson

  6. Department of Historical Studies, University of Gothenburg, Gothenburg, Sweden

    Anders Fischer, Karl-Göran Sjögren & Kristian Kristiansen

  7. Sealand Archaeology, Kalundborg, Denmark

    Anders Fischer

  8. Department of Integrative Biology, University of California Berkeley, Berkeley, CA, USA

    Alma S. Halgren & Peter H. Sudmant

  9. UCL Genetics Institute, University College London, London, UK

    Ruairidh Macleod & Leo Speidel

  10. Eco-anthropologie, Muséum national d’Histoire naturelle, CNRS, Université Paris Cité, Musée de l’Homme, Paris, France

    Fabrice Demeter

  11. Center for Computational Biology, University of California, Berkeley, CA, USA

    Andrew H. Vaughn, Aaron J. Stern & Peter H. Sudmant

  12. Ancient Genomics Laboratory, The Francis Crick Institute, London, UK

    Leo Speidel

  13. Neurogenomics Division, The Translational Genomics Research Institute (TGEN), Phoenix, AZ, USA

    Andrew J. Schork

  14. Oxford Centre for Neuroinflammation, Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, UK

    Astrid K. N. Iversen & Lars Fugger

  15. Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, UK

    Astrid K. N. Iversen

  16. Department of Clinical Medicine, Aarhus University Hospital, Aarhus, Denmark

    Lars Fugger

  17. MRC Human Immunology Unit, John Radcliffe Hospital, University of Oxford, Oxford, UK

    Lars Fugger

  18. Institute of Statistical Sciences, School of Mathematics, University of Bristol, Bristol, UK

    Daniel J. Lawson

  19. Wellcome Sanger Institute, Cambridge, UK

    Richard Durbin

  20. Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark

    Thomas Werge

  21. Institute of Biological Psychiatry, Mental Health Center Sct Hans, Copenhagen University Hospital, Copenhagen, Denmark

    Thomas Werge

  22. Trace and Environmental DNA (TrEnD) Laboratory, School of Molecular and Life Science, Curtin University, Perth, Western Australia, Australia

    Morten E. Allentoft

  23. Departments of Integrative Biology and Statistics, UC Berkeley, Berkeley, CA, USA

    Rasmus Nielsen

  24. MARUM Center for Marine Environmental Sciences and Faculty of Geosciences, University of Bremen, Bremen, Germany

    Eske Willerslev

Authors
  1. Evan K. Irving-Pease

    You can also search for this author inPubMed Google Scholar

  2. Alba Refoyo-Martínez

    You can also search for this author inPubMed Google Scholar

  3. William Barrie

    You can also search for this author inPubMed Google Scholar

  4. Andrés Ingason

    You can also search for this author inPubMed Google Scholar

  5. Alice Pearson

    You can also search for this author inPubMed Google Scholar

  6. Anders Fischer

    You can also search for this author inPubMed Google Scholar

  7. Karl-Göran Sjögren

    You can also search for this author inPubMed Google Scholar

  8. Alma S. Halgren

    You can also search for this author inPubMed Google Scholar

  9. Ruairidh Macleod

    You can also search for this author inPubMed Google Scholar

  10. Fabrice Demeter

    You can also search for this author inPubMed Google Scholar

  11. Rasmus A. Henriksen

    You can also search for this author inPubMed Google Scholar

  12. Tharsika Vimala

    You can also search for this author inPubMed Google Scholar

  13. Hugh McColl

    You can also search for this author inPubMed Google Scholar

  14. Andrew H. Vaughn

    You can also search for this author inPubMed Google Scholar

  15. Leo Speidel

    You can also search for this author inPubMed Google Scholar

  16. Aaron J. Stern

    You can also search for this author inPubMed Google Scholar

  17. Gabriele Scorrano

    You can also search for this author inPubMed Google Scholar

  18. Abigail Ramsøe

    You can also search for this author inPubMed Google Scholar

  19. Andrew J. Schork

    You can also search for this author inPubMed Google Scholar

  20. Anders Rosengren

    You can also search for this author inPubMed Google Scholar

  21. Lei Zhao

    You can also search for this author inPubMed Google Scholar

  22. Kristian Kristiansen

    You can also search for this author inPubMed Google Scholar

  23. Astrid K. N. Iversen

    You can also search for this author inPubMed Google Scholar

  24. Lars Fugger

    You can also search for this author inPubMed Google Scholar

  25. Peter H. Sudmant

    You can also search for this author inPubMed Google Scholar

  26. Daniel J. Lawson

    You can also search for this author inPubMed Google Scholar

  27. Richard Durbin

    You can also search for this author inPubMed Google Scholar

  28. Thorfinn Korneliussen

    You can also search for this author inPubMed Google Scholar

  29. Thomas Werge

    You can also search for this author inPubMed Google Scholar

  30. Morten E. Allentoft

    You can also search for this author inPubMed Google Scholar

  31. Martin Sikora

    You can also search for this author inPubMed Google Scholar

  32. Rasmus Nielsen

    You can also search for this author inPubMed Google Scholar

  33. Fernando Racimo

    You can also search for this author inPubMed Google Scholar

  34. Eske Willerslev

    You can also search for this author inPubMed Google Scholar

Contributions

E.K.I.-P., A.R.-M., W.B., A.I., A.P. and A.F. contributed equally to this work. P.H.S., D.J.L., R.D., T.K., T.W., M.E.A., M.S., R.N., F.R. and E.W. led the study. A.F., T.W., M.E.A., M.S. and E.W. conceptualized the study. P.H.S., D.J.L., R.D., T.K., T.W., M.E.A., M.S., R.N., F.R. and E.W. supervised the research. M.E.A., K.K., R.D., T.W., R.N. and E.W. acquired funding for research. E.K.I.-P., A.R.-M., A.I., A.P., W.B., A.H.V., L.S., A. J. Stern, K.K., D.J.L., R.D., T.K. M.E.A., M.S., R.N. and F.R. were involved in developing and applying methodology. E.K.I.-P., A.R.-M., A.I., A.P., W.B., A.S.H., R.A.H., T.V., H.M., A.H.V., L.S., A. Ramsøe, A. J. Schork, A. Rosengren, L.Z., P.H.S., T.K., M.E.A., M.S. and F.R. undertook formal analyses of data. E.K.I.-P., A.R.-M., A.I., A.P., A.F., W.B., K.-G.S., A.S.H., R.A.H., T.V., A. J. Stern, A. Ramsøe, A. Rosengren, L.Z., A.K.N.I., L.F., P.H.S., D.J.L., T.K., M.S., F.R. and E.W. drafted the main text. E.K.I.-P., A.R.-M., A.I., A.P., A.F., W.B., K.-G.S., A.S.H., R.A.H., T.V., A. J. Stern, G.S., A. Ramsøe, A. Rosengren, L.Z., A.K.N.I., L.F., P.H.S., D.J.L., M.S. and E.W. drafted theSupplementary Information andTables. E.K.I.-P., A.R.-M., A.I., A.P., A.F., W.B., K.-G.S., A.S.H., R.M., F.D., R.A.H., T.V., H.M., A. Ramsøe, A. J. Schork, L.Z., K.K., A.K.N.I., L.F., P.H.S., D.J.L., R.D., T.K., T.W., M.E.A., M.S., R.N., F.R. and E.W. were involved in reviewing drafts and editing. All co-authors read, commented on and agreed on the submitted manuscript.

Corresponding authors

Correspondence toEvan K. Irving-Pease,Rasmus Nielsen,Fernando Racimo orEske Willerslev.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature thanks the anonymous reviewers for their contribution to the peer review of this work.Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Selection at theMCM6 locus.

CLUES selection results for the most significant sweep locus, showing the pan-ancestry analysis (ALL) plus the four marginal ancestries: Western hunter-gatherers (WHG), Eastern hunter-gatherers (EHG), Caucasus hunter-gatherers (CHG) and Anatolian farmers (ANA). Row one shows zoomed Manhattan plots of the p values for each ancestry and row two shows allele trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Extended Data Fig. 2 Selection at theSLC45A2 locus.

CLUES selection results for the second most significant sweep locus, showing the pan-ancestry analysis (ALL) plus the four marginal ancestries: Western hunter-gatherers (WHG), Eastern hunter-gatherers (EHG), Caucasus hunter-gatherers (CHG) and Anatolian farmers (ANA). Row one shows zoomed Manhattan plots of the p values for each ancestry and row two shows allele trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Extended Data Fig. 3 Selection at the HLA locus.

CLUES selection results for the third most significant sweep locus, showing the pan-ancestry analysis (ALL) plus the four marginal ancestries: Western hunter-gatherers (WHG), Eastern hunter-gatherers (EHG), Caucasus hunter-gatherers (CHG) and Anatolian farmers (ANA). Row one shows zoomed Manhattan plots of the p values for each ancestry and row two shows allele trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Extended Data Fig. 4 Selection at theACAD10 locus.

CLUES selection results for the fourth most significant sweep locus, showing the pan-ancestry analysis (ALL) plus the four marginal ancestries: Western hunter-gatherers (WHG), Eastern hunter-gatherers (EHG), Caucasus hunter-gatherers (CHG) and Anatolian farmers (ANA). Row one shows zoomed Manhattan plots of the p values for each ancestry and row two shows allele trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Extended Data Fig. 5 Selection at theCCDC12 locus.

CLUES selection results for the fifth most significant sweep locus, showing the pan-ancestry analysis (ALL) plus the four marginal ancestries: Western hunter-gatherers (WHG), Eastern hunter-gatherers (EHG), Caucasus hunter-gatherers (CHG) and Anatolian farmers (ANA). Row one shows zoomed Manhattan plots of the p values for each ancestry and row two shows allele trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Extended Data Fig. 6 Selection at theRNA5SP158 locus.

CLUES selection results for the sixth most significant sweep locus, showing the pan-ancestry analysis (ALL) plus the four marginal ancestries: Western hunter-gatherers (WHG), Eastern hunter-gatherers (EHG), Caucasus hunter-gatherers (CHG) and Anatolian farmers (ANA). Row one shows zoomed Manhattan plots of the p values for each ancestry and row two shows allele trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Extended Data Fig. 7 Selection at theGATA4 locus.

CLUES selection results for the seventh most significant sweep locus, showing the pan-ancestry analysis (ALL) plus the four marginal ancestries: Western hunter-gatherers (WHG), Eastern hunter-gatherers (EHG), Caucasus hunter-gatherers (CHG) and Anatolian farmers (ANA). Row one shows zoomed Manhattan plots of the p values for each ancestry and row two shows allele trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Extended Data Fig. 8 Selection at theARL17B locus.

CLUES selection results for the eighth most significant sweep locus, showing the pan-ancestry analysis (ALL) plus the four marginal ancestries: Western hunter-gatherers (WHG), Eastern hunter-gatherers (EHG), Caucasus hunter-gatherers (CHG) and Anatolian farmers (ANA). Row one shows zoomed Manhattan plots of the p values for each ancestry and row two shows allele trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Extended Data Fig. 9 Selection at theIRF1 locus.

CLUES selection results for the ninth most significant sweep locus, showing the pan-ancestry analysis (ALL) plus the four marginal ancestries: Western hunter-gatherers (WHG), Eastern hunter-gatherers (EHG), Caucasus hunter-gatherers (CHG) and Anatolian farmers (ANA). Row one shows zoomed Manhattan plots of the p values for each ancestry and row two shows allele trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Extended Data Fig. 10 Selection at theKRT18P51 locus.

CLUES selection results for the tenth most significant sweep locus, showing the pan-ancestry analysis (ALL) plus the four marginal ancestries: Western hunter-gatherers (WHG), Eastern hunter-gatherers (EHG), Caucasus hunter-gatherers (CHG) and Anatolian farmers (ANA). Row one shows zoomed Manhattan plots of the p values for each ancestry and row two shows allele trajectories for the top SNPs across all ancestries (grey shading for the marginal ancestries indicates approximate temporal extent of the pre-admixture population).

Extended Data Fig. 11 The 17q21.31 inversion locus.

A) Haplotypes of the 17q21.31 locus: the ancestral (non-inverted) H1 17q21.31 and the inverted H2 haplotype. Duplications of theKANSL1 gene have occurred independently on both lineages yielding H1D and H2D haplotypes. B) Frequency of the 17q21.31 inversion and duplication haplotypes across present-day global populations (Human Genome Diversity Project82). C) Change in the frequency of the 17q21.31 inversion haplotype through time.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visithttp://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Irving-Pease, E.K., Refoyo-Martínez, A., Barrie, W.et al. The selection landscape and genetic legacy of ancient Eurasians.Nature625, 312–320 (2024). https://doi.org/10.1038/s41586-023-06705-1

Download citation

Download PDF

Associated content

Prehistoric events might explain European multiple sclerosis risk

  • Samira Asgari
  • Lionel A. Pousaz
NatureNews & Views

Elevated genetic risk for multiple sclerosis emerged in steppe pastoralist populations

  • William Barrie
  • Yaoling Yang
  • Eske Willerslev
NatureArticleOpen Access

100 ancient genomes show repeated population turnovers in Neolithic Denmark

  • Morten E. Allentoft
  • Martin Sikora
  • Eske Willerslev
NatureArticleOpen Access

Population genomics of post-glacial western Eurasia

  • Morten E. Allentoft
  • Martin Sikora
  • Eske Willerslev
NatureArticleOpen Access

Advertisement

Search

Advanced search

Quick links

Nature Briefing: Translational Research

Sign up for theNature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly.Sign up for Nature Briefing: Translational Research

[8]ページ先頭

©2009-2025 Movatter.jp