Movatterモバイル変換


[0]ホーム

URL:


Skip to main content
NCBI home page
Search in PMCSearch
As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsement of, or agreement with, the contents by NLM or the National Institutes of Health.
Learn more:PMC Disclaimer | PMC Copyright Notice
Molecular Biology and Evolution logo

Rapid Evolution and the Importance of Recombination to the Gastroenteric PathogenCampylobacter jejuni

Daniel J Wilson*,1,,Edith Gabriel†,2,Andrew JH Leatherbarrow,John Cheesbrough§,Steven Gee§,Eric Bolton,Andrew Fox§,,C Anthony Hart¶,3,Peter J Diggle,Paul Fearnhead*
*Department of Maths and Statistics, Lancaster University, Lancaster, United Kingdom
Department of Medicine, Lancaster University, Lancaster, United Kingdom
Faculty of Veterinary Science, University of Liverpool, Leahurst, Neston, United Kingdom
§Preston Microbiology Services, Royal Preston Hospital, Lancashire Teaching Hospitals NHS Foundation Trust, Preston, United Kingdom
Manchester Medical Microbiology Partnership, P.O. Box 209, Clinical Sciences Building, Manchester Royal Infirmary, Manchester, United Kingdom
Division of Medical Microbiology, School of Infection and Host Defence, University of Liverpool, Liverpool, United Kingdom

E-mail:djw@uchicago.edu.

1

Present address: Department of Human Genetics, University of Chicago, 920 East 58th Street, CLSC 410, Chicago, IL 60637 USA.

2

Present address: Université d'Avignon, IUT STID, Site Agroparc, BP 1207, Avignon, France.

3

Deceased.

Rasmus Nielsen, Associate Editor

Corresponding author.

Accepted 2008 Nov 6; Issue date 2009 Feb.

© 2008 The Authors

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

PMCID: PMC2639114  PMID:19008526

Abstract

Responsible for the majority of bacterial gastroenteritis in the developed world,Campylobacter jejuni is a pervasive pathogen of humans and animals, but its evolution is obscure. In this paper, we exploit contemporary genetic diversity and empirical evidence to piece together the evolutionary history ofC. jejuni and quantify its evolutionary potential. Our combined population genetics–phylogenetics approach reveals a surprising picture.Campylobacter jejuni is a rapidly evolving species, subject to intense purifying selection that purges 60% of novel variation, but possessing a massive evolutionary potential. The low mutation rate is offset by a large effective population size so that a mutation at any site can occur somewhere in the population within the space of a week. Recombination has a fundamental role, generating diversity at twice the rate of de novo mutation, and facilitating gene flow betweenC. jejuni and its sister speciesCampylobacter coli. We attempt to calibrate the rate of molecular evolution inC. jejuni based solely on within-species variation. The rates we obtain are up to 1,000 times faster than conventional estimates, placing theC. jejuniC. coli split at the time of the Neolithic revolution. We weigh the plausibility of such recent bacterial evolution against alternative explanations and discuss the evidence required to settle the issue.

Keywords:Campylobacter jejuni, molecular clock, recombination, selection, coalescent, Neolithic

Introduction

The World Health Organization expects that every year 1% of the population of developed nations will suffer campylobacteriosis (Humphrey et al. 2007), a diarrheal disease that can lead to serious sequelae such as Guillain–Barré syndrome and reactive arthritis (Zia et al. 2003).Campylobacter jejuni is the principal bacterial agent responsible for gastroenteritis, ahead ofSalmonella,Escherichia coli,Clostridium, andListeria combined (Adak et al. 2005). For such a common pathogen, surprisingly little is known about its evolution. What we do know is thatCampylobacter species are zoonotic pathogens that colonize the gut of a wide variety of birds and mammals.Campylobacter jejuni, the species responsible for 90% of human disease, is found commonly in cattle, sheep, pigs, poultry, wild birds, rabbits, other wild mammals, household pets, molluscs, sewage, and in natural water sources such as rivers and the coast (Jones 2001;Humphrey et al. 2007). TheC. jejuni gene pools in different host species are largely overlapping (McCarthy et al. 2007), but population genetic analysis has revealed that the majority of human cases of disease are caused by isolates associated with livestock and poultry (Wilson et al. 2008).

Qualitative evidence suggests that adaptation is ongoing inC. jejuni and is facilitated by horizontal gene transfer. The mechanism of virulence inC. jejuni is still poorly understood, but loci responsible for adherence, cellular invasion, toxin production, and flagellar motility are thought to be important virulence factors (Fouts et al. 2005). The genetic basis for antimicrobial drug resistance is known, and its spread by recombination has been demonstrated both withinC. jejuni (de Boer et al. 2002) and between related species (Oyarzabal et al. 2007). The resistance ofC. jejuni to a range of antibiotics is common throughout the world and is thought to have been driven by frequent use in animals farmed for meat (Moore et al. 2006).

What we do not know aboutC. jejuni is how dynamic is it as a species, what is the timescale of its evolution, how quickly might it adapt, and what is the extent to which recombination facilitates gene transfer withinC. jejuni and to or from its sister species? Recently,Sheppard et al. (2008) have suggested that the rate of recombination betweenC. jejuni and its sister speciesCampylobacter coli is sufficient to have begun to reverse the speciation process, but the timescale over which that might be happening is unclear.

Integral to these questions is the matter of calibrating the molecular clock, an issue fraught with difficulty in the bacterial world. Unlike multicellular eukaryotes, bacteria do not easily fossilize, and when they do their unremarkable morphology does not allow accurate taxonomic classification. Unlike viruses, bacteria mutate slowly, so slowly that the evolution of natural populations has not been readily measured in real time. To date, the rate of evolution in bacteria has been calibrated indirectly.Ochman and Wilson (1987) placed upper and lower bounds on a series of bacterial phylogenetic splits by cross-referencing other events that can be dated. For example, the common ancestor of mitochondria and their closest living bacterial relatives must have occurred more recently than the Palaeoproterozoic increase in atmospheric oxygen and prior to the radiation of mitochondrion-bearing eukaryotes.Moran et al. (1993) calibrated the molecular clock in bacterial endosymbionts by assuming cospeciation with their aphid hosts, for whom a fossil record is available. These phylogenetic approaches often conflict with empirical approaches that are based on laboratory measurements of generation lengths and mutation rates (Lenski et al. 2003;Ochman 2003).

Our study is based on a longitudinal sample of 1,205C. jejuni isolates collected over a 3-year period from patients in Lancashire, England (Wilson et al. 2008), and DNA sequenced using multilocus sequence typing (MLST,Dingle et al. 2001). We characterize the ongoing evolution ofC. jejuni using a population genetic (microevolutionary) model of the forces of drift, mutation, recombination, and natural selection. We exploit the longitudinal sample to calibrate the molecular clock forC. jejuni directly from within-species variation, and we utilize empirical measurements of generation lengths and mutation rates to quantify the species’ effective population size and evolutionary potential. Finally, we look at the evolution ofC. jejuni in the wider (macroevolutionary) context of theCampylobacter genus, employing our rate estimates to date phylogenetic splits, such as the common ancestor ofC. jejuni and its sister speciesC. coli. We evaluate our rate estimates in light of previous work and discuss the plausibility of a Neolithic origin ofC. jejuni.

Methods

Isolates

We analyzed 1,205 of theC. jejuni isolates ofWilson et al. (2008) that were available at the time of writing. The isolates were collected from patients diagnosed with campylobacteriosis and notified through general practitioners and hospitals to the Preston Microbiology Services Laboratory in the Preston postcode district between January 1, 2000, and December 31, 2002. The sampling time corresponds to the date at which the isolate was received at the laboratory. The study area covers 968 km2, consisting of both urban (Preston, Leyland, Chorley, and Garstang) and rural (Ribble estuary and Ribble valley) districts and comprised 403,000 people at the 2001 census. As is the norm with campylobacteriosis, the cases we studied were sporadic in nature; there was no evidence for outbreaks. The isolates were sequenced at seven housekeeping loci (aspA,glnA,gltA,glyA,pgm,tkt, anduncA) using the MLST (Dingle et al. 2001) producing 3,309 nucleotides in total per isolate.

Analysis Overview

We used a variety of standard tools from a molecular evolution toolkit in order to infer the evolutionary history ofC. jejuni: Structure (Falush et al. 2003) to identifyC. jejuni–C. coli hybrids, an importance sampler (Fearnhead 2008) to calibrate the rate of molecular evolution, approximate Bayesian computation (ABC) (Beaumont et al. 2002) to estimate population genetic parameters forC. jejuni, and BEAST (Drummond et al. 2002) to reconstruct the phylogeny of the genusCampylobacter.

Identification ofC. coli Hybrids

It was necessary to identifyC. jejuni–C. coli hybrids because the microevolutionary models that we subsequently used are based on the coalescent (Kingman 1982), which is a probabilistic model for the relatedness of individuals in a population (the genealogy). Interspecific gene flow introduces alleles that are very distantly related to the rest of the population, and therefore violate the assumptions of relatedness made by the coalescent. Gene flow betweenC. jejuni and its sister speciesC. coli has been reported previously (Dingle et al. 2005), so we used Structure (Falush et al. 2003) to identify alleles imported fromC. coli before fitting the evolutionary models.

Informally, Structure assigns individuals to populations (or species in our case) on the basis of allele frequencies. Alleles at one locus that are typically observed to be associated withC. coli alleles at other loci will be assigned toC. coli. Although they are sister species, relatively few alleles are found in both species. Except when there has been interspecific gene flow, there are usually enough fixed nucleotide differences betweenC. jejuni andC. coli alleles to accurately identify the origin (Dingle et al. 2001). For each polymorphic nucleotide, Structure gives a posterior probability ofC. jejuni ancestry (as opposed toC. coli ancestry). From this we defined hybrids as isolates with a posterior probability of dual ancestry greater than 0.95. Full details of the analysis are given in the supplementary methods,Supplementary Material online.

Microevolution ofC. jejuni

The evolutionary models that we employed, both population genetic and phylogenetic, are modular insofar as they comprise the following components:

  • Model of mutation

  • Model of recombination

  • Model of relatedness (the genealogy or phylogeny)

  • Method of statistical inference

In evolutionary genetics, the choice of model represents a tradeoff between the competing desires for a biologically realistic model and one for which statistical inference is feasible. The method of inference is the limiting step: Simpler models can be fitted using powerful likelihood-based methods (such as importance sampling [IS] or Markov chain Monte Carlo [MCMC]), which are statistically efficient in that they exploit all the information the data have to offer. More complex models can only be fitted using simulation-based methods (such as ABC,Beaumont et al. 2002), which are suboptimal because they use summaries of the full data.

Recombination in particular massively increases the complexity of a model, and intraspecific recombination is frequent inC. jejuni (Fearnhead et al. 2005), both within and between genes. To learn about the evolution ofC. jejuni, we adopted a two-stage approach that represents a compromise between our competing desires 1) to calibrate the rate of evolutionary change inC. jejuni and 2) to learn about the multifarious forces shapingC. jejuni. The former requires powerful likelihood-based inference to extract what is likely to be a weak signal, because our sampling period of 3 years is likely to be short relative to the timescale of bacterial evolution. The latter requires a complex model that incorporates drift, mutation, selection, and recombination.

In the first step, we fitted the following model that is simple enough to use a likelihood-based IS method (Example 3 ofFearnhead 2008). The 1,018 isolates were analyzed for which sampling times were available.

  • Infinite alleles model of mutation (Kimura and Crow 1964), in which new alleles are generated at rateθ. This rate encompasses the generation of novel alleles by mutation, intraspecific recombination, and interspecific recombination.

  • Free recombination between loci, which is to say the loci are assumed unlinked or independent.

  • The coalescent with serial samples (Rodrigo and Felsenstein 1999), in which the parameterNeg determines the rate of coalescence per year.

  • Likelihood-based IS (Fearnhead 2008).

The allelic mutation rateθ is measured in coalescent time units of 2Neg years, whereNe is the effective population size andg the generation length in years. In this model, the alleles identified asC. coli imports were assumed to have arisen by interspecific recombination, and the dates of their introduction were estimated. We used priors onθ andNeg that are flat on the logarithmic scale. The object of inference in this, the first step, was to estimate the timescale of evolution, which is determined by the coalescent parameterNeg. For an illustration of howNeg affects the signal of measurable evolution, seesupplementary figure S1,Supplementary Material online.

In the second step, we fitted a more complex model using ABC (see supplementary methods,Supplementary Material online, for full details), in which selection is modeled as a form of mutational bias. This analysis was based on the sequences of 881 pureC. jejuni isolates available at the time.

  • Nielsen and Yang (1998) codon model whose parameters are the synonymous mutation rateθS, the transition–transversion ratioκ, and thedN/dS ratioω.

  • A model of recombination suitable for bacteria (Wiuf and Hein 2000), whose parameters are the rate of recombinationρ and the average length of importτ.

  • The coalescent with serial samples, with parameterNeg.

  • Simulation-based ABC.

The parametersθS andρ are measured per kilobase (kb), in coalescent time units of 2Neg years. In this model,C. coli hybrids were excluded from the analysis. Except forNeg, we used priors that are flat on the logarithmic scale. In order to calibrate the rate of molecular change, which is to say convertθS into real-time units of years, we used the posterior from the simpler model as an informative prior forNeg. This was necessary because we had found that ABC was not sufficiently sensitive to estimateNeg.

To distinguish the calibrated parameters from those measured in coalescent time units, we useμS to denote the rate of synonymous mutation per kb per year andr to denote the rate of recombination per kb per year, whereasθS andρ denote the corresponding rates in coalescent time units. Formally,θS=2NegμS andρ=2Negr. We also useθN andμN to denote the corresponding rates of nonsynonymous change, and the total mutation rates areθ=θS+θN andμ=μS+μN.

Macroevolution ofCampylobacter

We put the evolution ofC. jejuni into a wider context by inferring the phylogenetic history of sevenCampylobacter species for which similar MLST schemes have been designed (Dingle et al. 2001;Miller et al. 2005;van Bergen et al. 2005). For each species we chose a typical isolate and tested that there was no interspecies recombination between the chosen isolates using a permutation test based on the correlation between physical distance and linkage disequilibrium (LD) (McVean et al. 2002). We then fitted the following phylogenetic model to the concatenated gene sequences. For further details see supplementary methods,Supplementary Material online.

We used informative priors forκ andω, taken from our posteriors estimated from the microevolutionary model. On the timescale of theCampylobacter phylogeny, our sequences are essentially sampled contemporaneously. Therefore, the data do not contain information regarding the rate of evolution. That was provided entirely by our informative prior onμS, which was taken from the posterior estimated from the microevolutionary model.

Results

In presenting our results, we begin by scrutinizing genetic diversity and LD in the contemporary population. This reveals a number of insights, including the identification of newC. jejuniC. coli hybrids, and the demonstration that recombination is the primary mechanism driving molecular change. Then we proceed to calibrating the real-time rate of molecular change. We detect a signal of measurable evolution inC. jejuni and employ that calibration to date historical events, including the importation of genes fromC. coli and the most recent common ancestor (MRCA) ofC. jejuni. We discuss the real-time evolutionary potential ofC. jejuni, including the likely efficacy of selection and the size of the gene pool. Finally, we utilize our estimate of the molecular clock to calibrate the phylogeny of the genusCampylobacter. We discuss the cultural changes that may have coincided with theC. jejuniC. coli split and the robustness of the approach in light of the conflict that arises with traditional estimates.

Recombination Dominates the Evolution ofC. jejuni

A cursory analysis of the patterns of nucleotide diversity inC. jejuni immediately reveals evidence of interspecies gene flow. For each of the seven genes,figure 1 illustrates the number of nucleotide differences between each pair of alleles. At six loci, there is distinct clustering between alleles that are genetically similar (in light gray) and dissimilar (in dark gray). The majority of alleles differ by fewer than 20 nucleotides, but a small number are highly divergent, differing by 40 nucleotides or more from the rest. By comparing our sample ofC. jejuni isolates to theC. coli isolates ofDingle et al. (2005) using the program Structure (Falush et al. 2003), we found that the genetically divergent alleles, most of which are observed at low frequency, are imports fromC. coli.

FIG. 1.—

FIG. 1.—

Histograms of the number of nucleotide differences between each pair of alleles at the seven MLST loci. WhenCampylobacter coli–derived alleles are removed, the dark gray portions of the histograms disappear.

Thirty of 1,205C. jejuni isolates were found to containC. coli alleles. Sequence type (ST-) 61, which carries theC. coli-deriveduncA-17 allele (Dingle et al. 2005), accounted for 23 of those. Five other STs were newly identified as hybrids; for each one,figure 2 illustrates the posterior probability ofC. jejuni ancestry across the seven loci. ST-1869 and ST-1933, represented by one isolate each, carriedC. coli–derived allelesgltA-30 andaspA-33, respectively. ST-2383, also represented by a single isolate, differed from ST-61 by a single nucleotide inaspA. The only evidence for intragenic mosaicism was found inpgm-93, carried by ST-350, of which there were two isolates. This allele contains a 200-bp stretch of DNA withC. coli ancestry, flanked on both sides by sequence ofC. jejuni ancestry. The rarityaspA-33,gltA-30, andpgm-93, and the uniformity of the background on whichuncA-17 is found, suggest these genes were introduced singly in four independent ancestral cross-species recombination events. However, the ancestral history of ST-1934, represented by a single isolate, is likely to be more complex. Five of the seven genes wereC. coli derived, intimating a more complex history involving multiple cross-species gene transfer events. A single isolate, ST-962, boreC. coli alleles at all seven genes, suggesting this isolate was misclassified.

FIG. 2.—

FIG. 2.—

Campylobacter jejuniCampylobacter coli hybrids identified by Structure. Six sequence types were identified as hybrids. For each hybrid, the posterior probability ofC. jejuni ancestry (as opposed toC. coli ancestry) is shown in gray for the seven MLST loci.

Removal of theC. coli alleles erases the genetic discontinuity from patterns ofC. jejuni diversity (leaving the light gray portions offig. 1) and allows us to identify the contribution of mutation and selection to population diversity. PureC. jejuni isolates differed by an average of 49.8 nucleotides out of the 3,309 sequenced by MLST. The majority of differences were synonymous, indicative of the housekeeping function of the loci. Using ABC, we estimated that a pair ofC. jejuni isolates, sampled at the same time, differ byθ = 13.7 mutations per kb on average, of whichθS = 11.8 would be synonymous (seetable 1 for credible intervals [CI]). This makesC. jejuni diverse relative to other bacteria (Pérez-Losada et al. 2006). The intense purifying selection experienced by these genes is reflected in the smalldN/dS ratio ofω = 0.0283. Mutations are also highly skewed in favor of transitions (κ = 19.0). To emphasize the strength of selection, we used our parameter estimates to calculate that the theoretical mutation rate, in the absence of selection (i.e., ifω = 1), would beθ0 = 35.6 per kb, some 2.6 times higher than the actual rate. This implies that purifying selection purges 60% of novel genetic variation.

Table 1.

Estimates of Evolutionary Parameters inCampylobacter Jejuni

ParameterUnitsA Posteriori
A priori 95% CI
Point Estimate95% CI
Allele model
NegMean coalescence timeYears209155–28892–820
θAllelic mutation rate(2Neg)−14.383.75–5.181.1–9.4
1
Sequence model
θTotal mutation ratekb−1 (2Neg)−113.78.35–23.92.1–180
θSSynonymous mutation ratekb−1 (2Neg)−111.87.51–19.71.9–170
θNNonsynonymous mutation ratekb−1 (2Neg)−11.90.84–4.20.2–10
θ0Neutral mutation ratekb−1 (2Neg)−135.622.0–61.85.8–510
κTransition–transversion ratio19.08.85–39.83.3–180
ωdN/dS ratio0.02830.0165–0.04920.0022–0.18
ρτRecombination rate between distant loci(2Neg)−16.083.19–11.22 × 10−4–8 × 105
ρRecombination ratekb−1 (2Neg)−11.310.0273–42.20.0014–72
τMean DNA import lengthkb4.540.100–2140.015–6,800

The geometric mean was used to obtain point estimates. The (2.5%, 97.5%) quantiles were used to calculate the 95% CI. All priors were uniform on the logarithmic scale.

Recombination is prevalent inC. jejuni (Fearnhead et al. 2005), facilitating gene flow within the species, as well as importing diversity from without. Permutation tests (McVean et al. 2002) showed significantly lower LD between loci than withinP < 0.01), but lacked power to demonstrate significant levels of intragenic recombination on a locus-by-locus basis. Nevertheless, formal model fitting revealed nonzero levels of intragenic recombination. Using ABC, we estimated that an average ofρ = 1.31 recombination breakpoints per kb would occur on the evolutionary branches separating a pair of isolates. Assuming an exponential distribution for the length of DNA imported during homologous recombination, we estimated an average import length ofτ = 4.54 kb. The wide CIs forρ andτ (table 1) attest to the weak intragenic signal, but these figures are consistent with those estimated by LDhat (Fearnhead et al. 2005). Appreciably more power was available to estimate the interlocus rate of recombination. A curious aspect of bacterial recombination is that residual, nonzero LD is expected even between distant loci (Wiuf and Hein 2000). This is because the rate of recombination plateaus to a maximum ofρτ, rather than increasing linearly with physical distance. We estimated the long-range rate of recombination to beρτ = 6.08, with a relatively tight CI (seetable 1).

A simple comparison of the rate of recombination to mutation conceals the primacy of recombination as the dominant force that generates molecular change. The higher rate of point mutation per se is offset by the wider effect that each recombination event has. Hundreds to thousands of nucleotides are imported during recombination, of which a proportionπ/1,000 will differ between the incoming and existing sequence, whereπ is the average number of pairwise nucleotide differences per kb in the population. Therefore, within-species recombination drives molecular change by a factorρτπ/2θ more quickly than de novo mutation, which we estimated to equal 2.67 (95% CI 1.39–4.95).

By quantifying the rate of recombination, we were able not only to establish the importance of intraspecific recombination relative to mutation but also to compare the relative importance of intraspecific with interspecific gene flow. We reasoned earlier that the ancestral history ofC. jejuni bears witness to at least four importations from sister speciesC. coli. From our estimate of the within-species recombination rate, we predict that during the same ancestral history, there were approximately 230 intraspecific recombination events. Although we did not formally infer the rate of recombination betweenC. jejuni andC. coli, we can deduce that the rate of cross-species gene flow is little more than an order of magnitude (roughly 230/4 = 57.5 times) less frequent than within-species gene flow. This reinforces the observation that recombination is fundamental, not just in driving molecular change withinC. jejuni but also in facilitating cross-species gene flow, which is likely to have important implications for long-term adaptation.

Is Genetic Variation inC. jejuni Just 400 Years Old?

Central to calibrating the rate of molecular evolution is estimatingNeg, the timescale of the genealogy. The product of the effective population size (Ne) and the generation length (g), this parameter dictates the rate of coalescence inC. jejuni, it determines the date of the MRCA, and it allows the rates of mutation and recombination to be measured in real-time units. To have power to estimateNeg, the population must be measurably evolving on the timescale of the sampling period (Drummond et al. 2003). That is equivalent to saying there must have been a detectable number of mutation, recombination, or coalescence events in the population during the 3-year longitudinal study.Supplementary figure S1,Supplementary Material online, illustrates the idea and shows where in the data the signal lies.

We used the importance sampler ofFearnhead (2008) to estimate the rate of molecular change inC. jejuni (table 1). To determine whether the population was measurably evolving, we conducted a formal hypothesis test in which we compared two models, one using the longitudinal sampling times and the other assuming all sequences were sampled simultaneously. For the two models, measurably evolving (M1) versus not measurably evolving (M0), we estimated the likelihood averaged over the parameter values and calculated a Bayes factor, which is the ratio of these likelihoods. We obtained a Bayes factor of 3.0 × 1020, which is much greater than one, indicating very strong support for the measurably evolving hypothesis.

We estimated thatNeg 209 years (95% CI 155–288), which can be understood as the average age of the common ancestor of a pair ofC. jejuni sequences sampled contemporaneously. The magnitude ofNeg, together with the fact that the population is measurably evolving on a timescale of just 3 years, suggests a rapid rate of evolution inC. jejuni. In a demographically stable population, the date of the MRCA is a measure of population turnover and is expected to equal 2Neg. In a recombining population such asC. jejuni, the MRCA varies between genes and even within a gene. We estimated a very recent date for the average MRCA across genes ofTMRCA = AD 1591 (95% CI 1304–1771; seetable 2). This suggests that the rate of turnover of genetic variation at the average locus is just 409 years. The MRCA represents merely the root of the contemporary genealogy and does not coincide with the birth of the species, but it does mark a horizon beyond which we cannot use intraspecific genetic variation to reconstruct the evolutionary past. Later, we employ homologous sequences from otherCampylobacter species to infer the deeper history ofC. jejuni.

Table 2.

Dating Ancestral Events

Calibrated DateUncalibrated Date (Units ofNeg)
EventPoint Estimate95% CIPoint Estimate95% CI
TMRCAMRCA1,5921,305–1,7721.879.21 × 10−1–3.68
TaspA-33Import ofaspA-33 fromCampylobacter coliAugust 2000Jan 1997–Mar 20011.34 × 10−38.68 × 10−5–0.0422
Tpgm-93Import ofpgm-93 fromC. coliJanuary 1998Jul 1965–May 20008.73 × 10−38.25 × 10−4–0.244
TuncA-17Import ofuncA-17 fromC. coliMarch 1966Sep 1726–Jul 19961.20 × 10−11.23 × 10−2–1.06

Evaluating the timescale of evolution inC. jejuni should allow events other than the MRCA to be dated. Of particular interest are the dates of the cross-species recombination events identified earlier. Those events are, namely, the importation ofaspA-33,gltA-30,pgm-93, anduncA-17. We report both calibrated and uncalibrated dates intable 2; the uncalibrated dates are measured in coalescent time units ofNeg. We found that two genes are very recent imports fromC. coli, consistent with their low frequency. The calibrated dates suggest that with 95% probability,aspA-33 was introduced between January 1997 and March 2001, andpgm-93 was imported between July 1965 and May 2000. The uncalibrated dates show that these rare alleles were imported very much more recently than theTMRCA. We were unable to date the importation ofgltA-30 because no sampling time was recorded for the isolate carrying it. However, based on its sample frequency, its importation date is likely to resemble that ofaspA-33. TheC. coli-deriveduncA-17 is the most frequent and most ancient import. Present in 25 isolates, we estimated its date of introduction to be March 1966 (95% CI September 1725–July 1995) or 6.5% of the evolutionary time since the MRCA.

Knowledge regarding the timescale of evolution inC. jejuni allows the absolute rate of molecular change, or the molecular clock, to be calibrated. The parameterθ is the number of mutations (per kb) by which the average pair of sequences differs. However, an equivalent interpretation is thatθ is the rate of mutation (per kb) in coalescent time units of 2Neg years. Similarly,ρ is the rate of recombination (per kb) per 2Neg years. Therefore, we can combine our estimate ofNeg obtained by IS with our estimates of population genetic parameters obtained by ABC to convert the mutation and recombination rates from coalescent time to real time.Table 3 summarizes the results of this conversion. We inferred an absolute mutation rate ofμ = 3.23 × 10−2 per kb per year, comprising a synonymous rate ofμS = 2.79 × 10−2 and a nonsynonymous rate ofμN = 4.4 × 10−3 per kb per year. This corresponds to an average waiting time, per lineage per kb, of 31.0 years before a mutation arises. We estimated an absolute recombination rate ofr = 3.07 × 10−3 per kb per year, rising to = 1.45 × 10−2 per year between distant loci. We go on to use the absolute rate of synonymous mutation (μS) calculated here to calibrate the phylogeny of the genusCampylobacter.

Table 3.

Calibrated Rate Parameters inCampylobacter jejuni

Parameter
UnitsPoint Estimate95% CI
μTotal mutation ratekb−1 year−13.23 × 10−21.86 × 10−2–5.81 × 10−2
μSSynonymous mutation ratekb−1 year−12.79 × 10−21.60 × 10−2–5.08 × 10−2
μNNonsynonymous mutation ratekb−1 year−14.40 × 10−32.60 × 10−3–7.30 × 10−3
μ0Neutral mutation ratekb−1 year−18.39 × 10−24.85 × 10−2–1.51 × 10−1
Recombination rate between distant lociYear−11.45 × 10−27.11 × 10−3–2.92 × 10−2
rRecombination ratekb−1 year−13.07 × 10−34.79 × 10−5–1.27 × 10−1

We have already seen thatC. jejuni experiences intense purifying selection at the housekeeping loci studied here (ω = 0.0273). In the absence of selection, the absolute mutation rate would beμ0 = 8.39 × 10−2 per kb per year (95% CI 4.85 × 0−2 − 15.1 × 10−2), 2.6 times higher than that observed. Besides reflecting the strict functional constraint of the genes in question, the strength of selection conveyed by these figures intimates a large effective population size and gene pool. Intense functional constraint suggests thatC. jejuni is already highly adapted, but it is of interest to quantify the evolutionary potential of the species or the rate at which it could respond to a change in selection pressure.

The Evolutionary Potential ofC. jejuni Is Immense

The evolutionary potential, or adaptability, of a species depends on two quantities not readily calculable from population genetic data. The real-time total-population rate of mutation,Neμ0, determines the rate at which advantageous variants may arise in the event of a change in selection pressure. Together with the amount of standing variation, it determines the rate of adaptation; informally, this is the size of the gene pool. The effective population size,Ne, limits the efficacy of selection (Kimura 1955) and hence how likely it is that an advantageous variant would spread should it arise. We cannot estimate either quantity directly from the sequence data. However, knowledge of the generation length (g) or the per-generation neutral mutation rate (m0) would be sufficient to calculate the quantities of interest, via the parameters we have been able to infer.

Campylobacter jejuni is a microaerophilic bacterium that is adapted to growth at 37 or 42 °C, typical of the mammalian and avian gut, respectively. Growth experiments in culture demonstrate that the generation length depends on many factors including the genotype, temperature, and the presence of competing strains (Velayudhan and Kelly 2002;Khanna et al. 2006;Jackson et al. 2007;Konkel et al. 2007). An excursion into the recent literature reveals that the doubling time of wild typeC. jejuni ranges from 90 min to 5 h, although we cannot discount longer generation times in vivo. By assuming that empirical estimates of generation length follow a log-normal distribution and utilizing vague priors on the mean and variance of that distribution, we obtained a posterior-predictive distribution on generation length (see, e.g.,Gelman et al. 2003, p. 74).Figure 3a shows this distribution, with the empirically measured generation times from 10 growth experiments (Velayudhan and Kelly 2002;Khanna et al. 2006;Jackson et al. 2007;Konkel et al. 2007) of wild typeC. jejuni at 37 and 42 °C, both alone and in mixed culture, indicated by black lines crossing the horizontal axis. We calculated a point estimate of 2.44 h, with 95% CI of 0.719–8.29 h (table 4).

FIG. 3.—

FIG. 3.—

Inference based on empirical parameter estimation. Posterior distributions of (a) generation lengthg, (b) mutation ratem0, (c) effective population sizeNe, and (d) waiting time for a novel mutationW, based on empirical estimates of the generation length (dark gray histograms) or mutation rate (light gray histograms), and population genetic estimate ofθ0 (c,d). The empirical data are indicated with black lines crossing the horizontal axis in (a) and (b). The background histogram in (c) and (d) additionally depends on the population genetic estimate ofNeg, rendering them sensitive to the calibration of the molecular clock. Seetable 4 for more details.

Table 4.

Inference Based on Empirical Parameter Estimates

Parameter
UnitsMethodPoint Estimate95% CI
Empirical estimates
gGeneration lengthYearsIn vitro doubling times2.79 × 10−48.20 × 10−5–9.46 × 10−4
Hours2.440.719–8.29
m0Neutral mutation ratekb−1 g−1Drake's method2.77 × 10−62.26 × 10−7–3.41 × 10−5
Composite estimates
NeEffective population sizeNe = θ0/(2m0)6.42 × 1064.95 × 105–8.29 × 107
Ne = Neg/g7.50 × 1052.13 × 105–2.64 × 106
WInverse population mutation rateDaysW = 365 × (2g)/θ05.711.52–21.4
DaysW = 365 × (4Negm0)/θ020.6670.0437–10.2
μ0Neutral mutation ratekb−1 year−1μ0 = m0/g9.95 × 10−36.18 × 10−4–1.60 × 10−1

Methods denoted by are calibration sensitive.

No direct estimate of the de novo per-generation mutation rate,m0, is available forC. jejuni, butDrake (1991) andDrake et al. (1998) reported a startlingly consistent pattern in the genomic rate of mutation between different micro-organisms. Per kb, the mutation rates of these organisms (bacteriophages,E. coli,Saccharomyces cerevisiae, andNeurospora crassa) vary 16,000-fold. Per genome, the rate varies just 2.5-fold, excepting outliers. Drake calculated a mean genomic mutation rate of 3.3 × 10−3, which equals 1.9 × 10−6 per kb forC. jejuni, which has a 1.7-Mb genome (www.nmpdr.org/content/campy.php). To quantify the uncertainty in this approach, we again assumed a log-normal distribution for variation inm0 to obtain a posterior-predictive distribution based on Drake's mutation rates, adjusted for genome size. Because we included outliers, we obtained a point estimate of 2.77 × 10−6 per kb and wide CI (table 4). The distribution is plotted infigure 3b, with the empirically measured mutation rates, adjusted for genome size, indicated by black lines crossing the horizontal axis.

From these empirical estimates, we can calculate evolutionary quantities of interest. A number of simple formulae relate the parametersg,m0,θ0, andNeg to the effective population size,Ne, and the real-time population mutation rate,Neμ0. An intuitive representation of this latter quantity isW = 365 × 1,000/(Neμ0), which gives the expected waiting time, in days, for a mutation to arise at any particular nucleotide, somewhere in the population. The formulae for calculatingNe andW are detailed intable 4. When the formula involvesNeg, the estimate is sensitive to the calibration of the molecular clock:Ne is calibration sensitive when estimated fromg but notm0.W is calibration sensitive when estimated fromm0 but notg. By comparing calibration-sensitive and insensitive estimates, this provides a useful check.

The posterior distribution ofNe is plotted as a light gray histogram in the foreground offigure 3c to emphasize its dependence onm0 and its insensitivity to calibration. In the background, in dark gray is plotted the estimate based ong, which is calibration sensitive. Based onm0, we estimate an effective population size of 6.42 million. The wide CIs (table 4) principally reflect the underlying uncertainty inm0. The calibration-sensitive estimate based ong is 0.75 million and has tighter, partially overlapping CIs that reflect the lesser uncertainty ing. Infigure 3d, the estimate ofW based ong (dark gray histogram in foreground) is insensitive to calibration and has tight CIs surrounding the point estimate of 5.71 days. The estimate based onm0 is 0.667 days (light gray histogram in background); the posterior distribution has wide CIs and is calibration sensitive.

Qualitatively, the results agree that the effective population size ofC. jejuni is large, on the order of hundreds of thousands to tens of millions. This suggests that selection is highly efficacious inC. jejuni, sensitive to fitness advantages as small as 1 × 10−6. Earlier, we estimated adN/dS ratio of 0.0283, suggesting that selection is, for the most part, purifying. However, a second consequence of the large effective population size is to counteract the low intrinsic mutation rate, such that the average waiting time for a new mutation to arise at any particular nucleotide is just 5.7 days, maybe less. Although most are lost by drift or purged by selection, novel genetic variants are arising at such a rate that the potential ofC. jejuni to adapt to changes in selection pressure appears to be immense.

Despite the qualitatively similar conclusions, there exist inconsistencies between the estimates ofNe andW based ong andm0. These inconsistencies are manifest in the nonoverlapping portions of the light and dark gray histograms infigure 3c andd. The reason for this partial overlap can be understood when we useg andm0 to obtain an empirical calibration of the neutral rate of molecular change inC. jejuni.Table 4 shows that the empirical point estimate ofμ0 is 9.95 × 10−3, an order of magnitude lower than that estimated by our population genetics method from within-species variation (table 3). Although the CIs of the empirical estimate are sufficiently wide (table 4) to subsume the population genetics estimate, the empirical evidence suggests a slower molecular clock overall.

IsCampylobacter Speciating on a Timescale of Thousands of Years?

Earlier, we remarked that the MRCA of a species constitutes a horizon beyond which we cannot use intraspecific genetic variation to reconstruct evolutionary history. In a recombining species, the MRCA can differ between loci, but we estimated that the date of the MRCA for the average locus inC. jejuni existed around 400 years ago. To delve deeper into the species' evolutionary history, it is necessary to employ more distantly related molecular sequences, so we used the sequences of sixCampylobacter species for which MLST schemes have also been developed:C. coli (Dingle et al. 2001),Campylobacter fetus (van Bergen et al. 2005),Campylobacter helveticus (Miller et al. 2005),Campylobacterinsulaenigrae (Stoddard et al. 2007),Campylobacter lari (Miller et al. 2005), andC. upsaliensis (Miller et al. 2005). The MLST schemes for these species have four genes in common:glnA,glyA,tkt, anduncA (otherwise known asatpA).

TheCampylobacter species we studied exhibit an interesting array of pathogenicities and host ranges. BesidesC. jejuni, which is responsible for 90% of human campylobacteriosis (Gillespie et al. 2002),C. coli,C. lari, andC. upsaliensis have been documented in sporadic cases or outbreaks of gastroenteritis in humans (Miller et al. 2005).Campylobacter coli has a host range largely overlapping with that ofC. jejuni albeit with a greater affinity for pigs (Dingle et al. 2005) and, like its sister species, tends to be carried asymptomatically in the gut of livestock, poultry, wild birds, and mammals.Campylobacter lari likewise has a wide host range but is characterized by its isolation from seagulls, mussels, and oysters (Miller et al. 2005).Campylobacter upsaliensis, together with the relatedC. helveticus, is associated with domestic cats and dogs (Miller et al. 2005). Two subspecies are known ofC. fetus.C. fetus subsp.fetus can induce abortion in sheep and less often in cattle and humans (van Bergen et al. 2005). The genetically uniformC. fetus subsp.venerealis is cattle restricted, in which it causes a venereal infection that can lead to infertility and abortion (van Bergen et al. 2005). A somewhat distinct, reptile-associated strain ofC. fetus has also been described (Tu et al. 2001), which has been documented in at least one case of human disease (Tu et al. 2004). The most recently described of these species,C. insulaenigrae, has been isolated from several marine mammals: common seals and a harbor porpoise in Scotland (Foster et al. 2004) and northern elephant seals in California (Stoddard et al. 2007). It too has been observed in a case of invasive human disease (Chua et al. 2007). VariousCampylobacter species are routinely isolated from sewage and environmental sources, including fresh and marine water (Jones 2001).

For each species, a typical isolate was chosen (see supplementary methods,Supplementary Material online). As no recombination was detected between these sequences (P = 0.47), we constructed a phylogeny using BEAST (Drummond et al. 2002), which we calibrated using our population genetic estimate of the synonymous mutation rate based on variation withinC. jejuni (table 3). The standard method of dating recent bacterial evolution (Achtman et al. 2004;Roumagnac et al. 2006) is to calibrate the rate of sequence divergence relative toE. coli andSalmonella typhimurium, whichOchman and Wilson (1987) estimated to have split 120–160 Ma. They calculated a molecular clock of 1% per 50 My in the 16S rRNA gene. That would date the split betweenC. jejuni andC. coli, which differ by 0.4% on average (Gorkiewicz et al. 2003), to 10 Ma.

By our method, we obtained a vastly different estimate of 6,580 years ago (95% CI 3,580–12,400). The phylogeny infigure 4 is labeled with the inferred split times of all seven species. Two sources of uncertainty exist in the date estimates. The first source, which is modest in this case, is uncertainty in the split times relative to one another, indicated by the error bars associated with each node in the phylogeny. The second source, which dominates here, is uncertainty in the calibration of the molecular clock, which we represent as uncertainty in the scale bar of the phylogeny. Relative and total uncertainty in split times is detailed intable 5. There was negligible uncertainty in the tree topology.

FIG. 4.—

FIG. 4.—

Phylogeny of the genusCampylobacter. Nodes are labeled with estimated divergence times using BEAST. Error bars associated with each node indicate relative uncertainty in node height. Uncertainty due to calibration of the molecular clock is represented by a 95% CI below the scale bar. Posterior uncertainty in the tree topology was negligible. The scale bar was calibrated from intraspecific variation inCampylobacter jejuni. For alternative scales, seetable 7.

Table 5.

Phylogenetic Split Times in the GenusCampylobacter

SplitPoint Estimate95% CI
Relative UncertaintyTotal Uncertainty
colijejuni6,5806,240–6,9303,580–12,400
helveticus upsaliensis4,4004,190–4,6302,400–8,290
insulaenigraelari6,5306,150–6,9403,560–12,500
helveticusjejuni16,00015,200–16,8008,800–30,300
larijejuni21,20020,300–22,10011,600–39,500
fetusjejuni33,80032,800–34,80019,000–62,600

Consistent with recent Neighbor-Joining (NJ) phylogenies based on the 16S rRNA,rpoB, andgroEL genes (Kärenlampi et al. 2004;Korczak et al. 2006), we found thatC. jejuni andC. coli are sister species, as are the pet-associatedC. helveticus andC. upsaliensis, and the recently discoveredC. insulaenigrae andC. lari. On the deeper structure, the NJ trees disagreed mutually, and with our Bayesian phylogeny, except on the observation thatC. fetus is the most divergent of the seven species. The phylogeny we inferred is consistent with the observation ofFouts et al. (2005) thatC.upsaliensis shares more protein identity (74.7%) withC.jejuni at the genomic level than doesC.lari (68.9%).

Our method of calibration suggests thatCampylobacter is speciating on the order of thousands, rather than millions of years. We dated the root of the tree to 33,800 years ago (95% CI 19,000–62,600). Using Yule's model (Yule 1924), a speciation rate ofλ = 0.0452 per lineage per 1,000 years was inferred (table 6), which equates to an expected waiting time of 22,000 years between speciation events.

Table 6.

Evolutionary Parameters in the GenusCampylobacter

ParameterPoint Estimate95% CI
μ2.93 × 10−21.60 × 10−2–4.99 × 10−2
κ1.811.51–2.15
ω0.01200.00961–0.0152
λ0.04520.0149–0.120

Calibration of the phylogeny is determined by the mutation rate, which we estimated atμ = 2.93 × 10−2 per kb per year, slightly lower than the mutation rate withinC. jejuni. The mutation rate is a function of the synonymous mutation rate (μS), which by our method was estimated from withinC. jejuni, and the transition–transversion ratio (κ) anddN/dS ratio (ω). The latter two quantities were coestimated with the phylogeny but informed with prior estimates from withinC. jejuni. As would be expected (Rocha et al. 2006), thedN/dS ratio was lower and in this case significantly lower (ω = 0.0120), betweenCampylobacter species than withinC. jejuni, which accounts for the lower total mutation rate. Curiously, we obtained a transition–transversion ratio 10-fold lower (κ = 1.81) betweenCampylobacter species than withinC. jejuni.

To compare the intraspecific, population genetics calibration with other methods of calibration, we offer alternative scales intable 7 for the scale bar infigure 4. The empirical method utilizes the neutral molecular clock estimate fromtable 4, which is based on in vitro doubling times and experimental estimates of the per-generation mutation rate. Using the empirical approach, the length of the scale bar is 42,200 years with a wide CI (2,690–661,000) that subsumes the CI for the intraspecific method. TheOchman and Wilson (1987) method has been used in several recent publications (e.g.,Achtman et al. 2004,Roumagnac et al. 2006) to calibrate the timescale of bacterial evolution and makes the length of the scale bar 7.6 My. The quantification of uncertainty in the original paper did not incorporate all sources of error, and so we do not put forward a CI. Finally, the coalescent approach usesθS/2 for the synonymous rate of molecular change, which yields date estimates in coalescent time units ofNeg years. TheTMRCA forC. jejuni occurred around 2Neg years ago, so this provides a relative means of dating the phylogeny. The length of the scale bar is 23.6Neg, with a CI of 14.1–39.6Neg. That makes theC. jejuniC. coli split around 17 times more ancient than the MRCA ofC. jejuni.

Table 7.

Alternative Scale Bars forCampylobacter Phylogeny

MethodScale (Years)95% CI
Intraspecifica5,0002,830–8,820
Empiricalb42,2002,690–661,000
Ochman–Wilsonc7,600,000Not quantified
Coalescentd23.6Neg14.1Neg–39.6Neg
a

Based onμS fromtable 3.

b

Based onμS calculated fromμ0 intable 4.

d

Based onθS fromtable 1.

Epidemiological Clustering Might Distort Calibration

In light of the 1,000-fold discrepancy between our intraspecific method of calibrating the rate ofCampylobacter evolution and conventional estimates, we performed further statistical tests of robustness. We defer biological considerations of the plausibility of our estimates to the discussion. The formal hypothesis test we performed earlier strongly supported a model in whichC. jejuni is measurably evolving, over one in which it is not. When above 1, the Bayes factor supports the measurably evolving model, and when below 1, it supports the simpler model in which all sequences were treated as if they were sampled simultaneously. We obtained a Bayes factor of 3.0 × 1020, which is much greater than 1. However, for completeness, we performed a permutation test in which the sampling times of the sequences were randomized 100 times, and the Bayes factor recomputed in each case in order to obtain a reference distribution. No Bayes factor even remotely as large as 3.0 × 1020 was observed during the permutations, confirming that a signal consistent with measurable evolution does indeed exist.

It is possible that epidemiological clustering over time could cause an artifactual signal of ongoing evolution. For example, successive epidemics of genetically relatedC. jejuni sweeping through the human population could cause a correlation between sampling time and genetic distance: Organisms sampled closer together in time would be genetically more similar, mimicking the pattern expected in a measurably evolving population. There was no obvious pattern of recurrent epidemics in the data—indeedC. jejuni generally occurs sporadically (Wilson et al. 2008)—but in any event it would be difficult to distinguish from ongoing evolution. In an attempt to eliminate subtle epidemiological clustering, we thinned the data set so that no two cases occurred fewer than seven days apart, leaving 116 sequences. Having removed 90% of the sequences, we obtained a much smaller Bayes factor in favor of the measurably evolving model of 33.1. We repeated the permutation test as before. This time, out of 100 permutations, a Bayes factor larger than 33.1 was obtained three times. This suggests the measurably evolving model is still preferred, but much less so than before. It remains unclear whether by thinning the data, a large reduction in the Bayes factor was obtained because epidemiological clustering was causing an artifactual signal of ongoing evolution or because removing 90% of the sequences weakens inferential power.

Discussion

A Neolithic Origin forC. jejuni

Starkly at variance with conventional estimates of the timescale of bacteria evolution, how plausible is the molecular clock we estimated from within-species variation? Our estimate of theC. jejuniC. coli split time coincides with the Neolithic domestication of a wide variety of animal and plant species, a time of broad cultural transition known as the Neolithic revolution, which began around 10,000 years ago. In particular, a date of 6,580 years ago coincides roughly with the spread of domestic pigs from the Near East and the first known domestication of wild boar of European descent in the Paris Basin, in the early fourth millennium BC (Larson et al. 2005,2007). The relevance of this is the particular association ofC. coli with domestic pigs (Dingle et al. 2005), which invites speculation as to the role of the advent of the domestic pig in driving the speciation ofC. jejuni orC. coli from their common ancestor.

Interestingly, evidence is accumulating (Mira et al. 2006) that the Neolithic revolution played an important role in creating new niches for a variety of pathogens of humans and their domesticated species, perhaps as a result of changes in agricultural practice or the advent of animal domestication. The genomes of pathogens such asBordetella pertussis (whooping cough in humans),Pseudomonas syringae pathovartomato (bacterial speck on tomatoes), andBurkholderia mallei (glanders in horses) exhibit a proliferation of insertion sequences connected with niche specialization and subsequent genome reduction/corruption. However,C. jejuni is unusual in harboring virtually no insertion sequences (Parkhill et al. 2000), and both it andC. coli retain wide host ranges.

No Resolution for the Bacterial Molecular Clock

Our synonymous rate estimate ofμS = 2.79 × 10−2 per kb per year is considerably at odds with traditional estimates of the bacterial molecular clock (Ochman and Wilson 1987), which would date theC. jejuni–C. coli split at 10 Ma. There are of course a number of reasons to exercise caution. Our Bayesian approach accounts for all sources of evolutionary uncertainty under the model, but uncertainty in the fundamental model assumptions is not so easily quantified. Extrapolation is the most dangerous form of statistical prediction, so the assumption that mutation rates are constant within and between species is a major one. It is widely appreciated that the long-term molecular clock is slower than short term (Rocha et al. 2006), but this is generally thought to be accounted for by the delayed effects of purifying selection. Although we attempted to abate the problem by assuming only that the synonymous rate is constant, we saw that not just thedN/dS ratio but also the transition–transversion ratio was significantly lower between species than withinC. jejuni. Secondly, we demonstrated that a signal exists indicating thatC. jejuni is measurably evolving on the timescale of our sample, but we cannot definitively rule out an artifactual association between sampling time and genetic similarity. For example, concurrent epidemic clusters sweeping through the population might generate such an association, although there was no evidence for it.

Even so, a rapid bacterial molecular clock cannot be dismissed out of hand. Indeed, two other attempts to calibrate the rate of bacterial evolution from intraspecific genetic variation have yielded similar estimates.Pérez-Losada et al. (2007) estimated a mutation rate of 4.58 × 10−2 kb−1 year−1 for housekeeping genes inNeisseria gonorrhoeae based on surveys of gonorrhoea patients in Baltimore, MD, between 1991 and 2005.Falush et al. (2001) used sequential biopsies from carriers ofHelicobacter pylori in New Orleans, LA, and Colombia to estimate that the rate of mutation in housekeeping genes may have been as high as 4.1 × 10−2 kb−1 year−1.

Traditional rate estimates are themselves at odds with empirical mutation rate estimates (Lenski et al. 2003;Ochman 2003), as we found here. Per-generation mutation rates estimated in the laboratory are several orders of magnitude higher than expected from Ochman and Wilson's molecular clock, even allowing for natural selection. Furthermore, certain commensal and pathogenic bacteria whose divergence can be dated via their host species exhibit 16S rRNA genetic diversity that is higher than expected from a rate of 1% per 50 My (Mira et al. 2006). Our own empirical estimates of the rate of molecular evolution were closer to our population genetic estimates than to the Ochman–Wilson estimates, differing by a factor of 10 in their point estimates and overlapping partially in their CIs. That discrepancy could be explained if Drake's method of mutation rate estimation (Drake 1991,Drake et al. 1998) gives a conservative estimate. Drake's method makes many assumptions and does not take account of lineage-specific knowledge. For example,C. jejuni lacks a number of genes responsible for DNA repair (Parkhill et al. 2000), which could underestimate the mutation rate sufficiently to explain the discrepancy between empirical and population genetics estimates. What we can refute is the suggestion (Tu et al. 2001) that mammal and reptile-associatedC. fetus genotypes diverged 200 Ma, prior to the origin of mammals. This hypothesis is not even supported by the Ochman–Wilson method of calibration, which puts theC. jejuni–C. fetus split, the root of our phylogeny, at a mere 51.4 Ma.

Conclusion

Patterns of genetic variation within and between species provide an, albeit corrupted, document of evolutionary history. The picture of evolutionary history painted by genetic diversity inC. jejuni is one of a dynamically evolving species shaped by frequent recombination and intense purifying selection.Campylobacter jejuni is highly adaptable by virtue of its large effective population size, which counters a mutation rate that is low in comparison with viral pathogens (Drake et al. 1998). Together with routine cross-species gene flow, our analysis reveals a pathogen with the potential to adapt rapidly to changes in selection pressure.

Comparison with gene sequences from related species shows thatC. jejuni may have evolved recently, within the past 12,000 years and possibly in response to changes in agricultural practice and the advent of animal domestication coinciding with the Neolithic revolution.Campylobacter is a dynamic genus in which species are no boundary to gene flow. Host ranges are wide and overlapping, and zoonosis between host species is common.

We believe that studies of ancient DNA offer the best prospect for accurately calibrating recent evolution in bacteria, although progress so far has been limited (Willerslev et al. 2004;Barnes and Thomas 2006). We have shown that large within-species samples can be analyzed with complex evolutionary models that incorporate recombination, without resorting to phylogenetic tree building. By integrating microevolutionary (population genetic) and macroevolutionary (phylogenetic) approaches in a Bayesian manner, we were able to quantify cumulative evolutionary uncertainty and perform detailed evolutionary inference. Certain gaps in our knowledge have been highlighted, including the generation length of pathogens in vivo and the per-generation rate of mutation in all but a few species. Finally, we have proposed thatCampylobacter may be evolving on a timescale of thousands, rather than millions of years. Our results contradict the traditional view of the rate of evolution in bacteria, but at the very least, they demand a re-evaluation of molecular clock estimates that are widely in use 20 years on.

Supplementary Material

Supplementary methods andsupplementary figure S1 are available atMolecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).

Supplementary Material

[Supplementary Data]
msn264_index.html (752B, html)

Acknowledgments

The authors would like to thank David Balding, Mark Beaumont, Bob Griffiths, Rosalind Harding, Martin Maiden, Noel McCarthy, Gil McVean, Julian Parkhill, Andrew Rambaut, Roisin Ure, and Ian Wilson for advice and useful discussion. This work is part of the Veterinary Training Research Initiative, jointly funded by the Higher Education Funding Council of England and the Department for Environment, Food, and Rural Affairs. Funding was also received from the Engineering and Physical Sciences Research Council.

References

  1. Achtman M, Morelli G, Zhu P, et al. (17 co-authors) Microevolution and history of the plague bacillus, Yersinia pestis. Proc Natl Acad Sci USA. 2004;101:17837–17842. doi: 10.1073/pnas.0408026101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Adak GK, Meakins SM, Yip H, Lopman BA, O'Brien SJ. Disease risks from foods, England and Wales, 1996–2000. Emerg Infect Dis. 2005;11:365–372. doi: 10.3201/eid1103.040191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barnes I, Thomas MG. Evaluating bacterial pathogen DNA preservation in museum osteological collections. Proc R Soc Lond B. 2006;273:645–653. doi: 10.1098/rspb.2005.3339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Beaumont MA, Zhang W, Balding DJ. Approximate Bayesian computation in population genetics. Genetics. 2002;162:2025–2035. doi: 10.1093/genetics/162.4.2025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Chua K, Gürtler V, Montgomery J, Fraenkel M, Mayall BC, Grayson ML. Campylobacter insulaenigrae causing septicaemia and enteritis. J Med Microbiol. 2007;56:1565–1567. doi: 10.1099/jmm.0.47366-0. [DOI] [PubMed] [Google Scholar]
  6. de Boer P, Wagenaar JA, Achterberg RP, van Putten JPM, Schouls LM, Duim B. Generation of Campylobacter jejuni genetic diversity in vivo. Mol Microbiol. 2002;44:351–359. doi: 10.1046/j.1365-2958.2002.02930.x. [DOI] [PubMed] [Google Scholar]
  7. Dingle KE, Colles FM, Falush D, Maiden MCJ. Sequence typing and comparison of population biology of Campylobacter coli and Campylobacter jejuni. J Clin Microbiol. 2005;43:340–347. doi: 10.1128/JCM.43.1.340-347.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Dingle KE, Colles FM, Wareing DRA, Maiden MCJ, Ure MCJ, Maiden R, Fox AJ, Bolton FE, Bootsma HJ, Willems RJ, Urwin R, Maiden MC. Multilocus sequence typing system for Campylobacter jejuni. J Clin Microbiol. 2001;39:14–23. doi: 10.1128/JCM.39.1.14-23.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Drake JW. A constant rate of spontaneous mutation in DNA-based microbes. Proc Natl Acad Sci USA. 1991;88:7160–7164. doi: 10.1073/pnas.88.16.7160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Drake JW, Charlesworth B, Charlesworth D, Crow JF. Rates of spontaneous mutation. Genetics. 1998;148:1667–1686. doi: 10.1093/genetics/148.4.1667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Drummond AJ, Nicholls GK, Rodrigo AG, Solomon W. Estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data. Genetics. 2002;161:1307–1320. doi: 10.1093/genetics/161.3.1307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Drummond AJ, Pybus OG, Rambaut A, Forsberg R, Rodrigo AG. Measurably evolving populations. Trends Ecol Evol. 2003;18:481–488. [Google Scholar]
  13. Falush D, Kraft C, Taylor NS, Correa P, Fox JG, Achtman M, Suerbaum S. Recombination and mutation during long-term gastric colonization by Helicobacter pylori: estimates of clock rates, recombination size, and minimal age. Proc Natl Acad Sci USA. 2001;98:15056–15061. doi: 10.1073/pnas.251396098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Falush D, Stephens M, Pritchard JK. Inference of population structure from multilocus genotype data: linked loci and correlated allele frequencies. Genetics. 2003;164:1567–1587. doi: 10.1093/genetics/164.4.1567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fearnhead P. Computational methods for complex stochastic systems: a review of some alternatives to MCMC. Stat Comput. 2008;18:151–171. [Google Scholar]
  16. Fearnhead P, Smith NGC, Barrigas M, Fox A, French N. Analysis of recombination in Campylobacter jejuni from MLST population data. J Mol Evol. 2005;61:333–340. doi: 10.1007/s00239-004-0316-0. [DOI] [PubMed] [Google Scholar]
  17. Foster G, Holmes B, Lawsono PA, Thorne P, Byrer DE, Ross HM, Xerry J, Thompson PM, Collins MD. Campylobacter insulaenigrae sp. nov., isolated from marine mammals. Int J Syst Evol Microbiol. 2004;54:2369–2373. doi: 10.1099/ijs.0.63147-0. [DOI] [PubMed] [Google Scholar]
  18. Fouts DE, Mongodin EF, Mandrell RE, et al. (21 co-authors) Major structural differences and novel potential virulence mechanisms from the genomes of multiple Campylobacter species. PLoS Biol. 2005;3:e15. doi: 10.1371/journal.pbio.0030015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian data analysis. 2nd ed. Boca Raton (FL): Chapman Hall; 2003. [Google Scholar]
  20. Gillespie IA, O'Brien SJ, Frost JA, Adak GK, Horby P, Swan AV, Painter MJ, Neal KR. A case–case comparison of Campylobacter coli and Campylobacter jejuni infection: a tool for generating hypotheses. Emerg Infect Dis. 2002;8:937–942. doi: 10.3201/eid0809.10.3201/eid0809.010187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gorkiewicz G, Feierl G, Schober C, Dieber F, Köfer J, Zechner R, Zechner EL. Species-specific identification of Campylobacters by partial 16S rRNA gene sequencing. J Clin Microbiol. 2003;41:2537–2546. doi: 10.1128/JCM.41.6.2537-2546.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Humphrey T, O'Brien SJ, Madsen M. Campylobacters as zoonotic pathogens: a food production perspective. Int J Food Microbiol. 2007;117:237–257. doi: 10.1016/j.ijfoodmicro.2007.01.006. [DOI] [PubMed] [Google Scholar]
  23. Jackson RJ, Elvers KT, Lee LJ, Gidley MD, Wainwright LM, Lightfoot J, Park SF, Poole RK. Oxygen reactivity of both respiratory oxidases in Campylobacter jejuni: the cydAB genes encode a cyanide-resistant, low-affinity oxidase that is not of the cytochrome bd type. J Bacteriol. 2007;189:1604–1615. doi: 10.1128/JB.00897-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Jones K. Campylobacters in water, sewage and the environment. J Appl Microbiol. 2001;90:68S–79S. doi: 10.1046/j.1365-2672.2001.01355.x. [DOI] [PubMed] [Google Scholar]
  25. Kärenlampi RI, Tolvanen TP, Hänninen M-L. Phylogenetic analysis and PCR-restriction fragment length polymorphism identification of Campylobacter species based on partial groEL gene sequences. J Clin Microbiol. 2004;42:5731–5738. doi: 10.1128/JCM.42.12.5731-5738.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Khanna MR, Bhavsar SP, Kapadnis BP. Effect of temperature on growth and chemotactic behaviour of Campylobacter jejuni. Lett Appl Microbiol. 2006;43:84–90. doi: 10.1111/j.1472-765X.2006.01904.x. [DOI] [PubMed] [Google Scholar]
  27. Kimura M. Stochastic processes and distributions of gene frequencies under natural selection. Cold Spring Harbor Symp. 1955;20:33–53. doi: 10.1101/sqb.1955.020.01.006. [DOI] [PubMed] [Google Scholar]
  28. Kimura M, Crow JF. The number of alleles that can be maintained in a finite population. Genetics. 1964;49:725–738. doi: 10.1093/genetics/49.4.725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kingman JFC. On the genealogy of large populations. J Appl Prob. 1982;19A:27–43. [Google Scholar]
  30. Konkel ME, Christensen JE, Dhillon AS, Lane AB, Hare-Sanford R, Schaberg DM, Larson CL. Campylobacter jejuni strains compete for colonization in broiler chicks. Appl Environ Microbiol. 2007;73:2297–2305. doi: 10.1128/AEM.02193-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Korczak BM, Stieber R, Emler S, et al. Genetic relatedness within the genus Campylobacter inferred from rpoB sequences. Int J Syst Evol Microbiol. 2006;56:937–945. doi: 10.1099/ijs.0.64109-0. [DOI] [PubMed] [Google Scholar]
  32. Larson G, Albarella U, Dobney K, et al. (19 co-authors) Ancient DNA, pig domestication, and the spread of the Neolithic into Europe. Proc Natl Acad Sci USA. 2007;104:15276–15281. doi: 10.1073/pnas.0703411104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Larson G, Dobney K, Albarella U, et al. (13 co-authors) Worldwide phylogeography of wild boar reveals multiple centers of pig domestication. Science. 2005;307:1618–1621. doi: 10.1126/science.1106927. [DOI] [PubMed] [Google Scholar]
  34. Lenski RE, Winkworth CL, Riley MA. Rates of DNA sequence evolution in experimental populations of Escherichia coli during 20,000 generations. J Mol Evol. 2003;56:498–508. doi: 10.1007/s00239-002-2423-0. [DOI] [PubMed] [Google Scholar]
  35. McCarthy ND, Colles FM, Dingle KE, Bagnall MC, Manning G, Maiden MC, Falush D. Host-associated genetic import in Campylobacter jejuni. Emerg Infect Dis. 2007;13:267–272. doi: 10.3201/eid1302.060620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. McVean GAT, Awadalla P, Fearnhead P. A coalescent method for detecting recombination from gene sequences. Genetics. 2002;160:1231–1241. doi: 10.1093/genetics/160.3.1231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Miller WG, On SLW, Wang G, Fontanoz S, Lastovica AJ, Mandrell RE. Extended multilocus sequence typing system for Campylobacter coli, C. lari, C. upsaliensis, and C. helveticus. J Clin Microbiol. 2005;43:2315–2329. doi: 10.1128/JCM.43.5.2315-2329.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Mira A, Pushker R, Rodríguez-Valera F. The Neolithic revolution of bacterial genomes. Trends Microbiol. 2006;14:200–206. doi: 10.1016/j.tim.2006.03.001. [DOI] [PubMed] [Google Scholar]
  39. Moore JE, Barton MD, Blair IS, et al. (18 co-authors) The epidemiology of antibiotic resistance in Campylobacter. Microbe Infec. 2006;8:1955–1966. doi: 10.1016/j.micinf.2005.12.030. [DOI] [PubMed] [Google Scholar]
  40. Moran NA, Munson MA, Baumann P, Ishikawa H. A molecular clock in endosymbiotic bacteria is calibrated using insect hosts. Proc R Soc Lond B. 1993;253:167–171. [Google Scholar]
  41. Nielsen R, Yang Z. Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics. 1998;148:929–936. doi: 10.1093/genetics/148.3.929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Ochman H. Neutral mutations and neutral substitutions in bacterial genomes. Mol Biol Evol. 2003;20:2091–2096. doi: 10.1093/molbev/msg229. [DOI] [PubMed] [Google Scholar]
  43. Ochman H, Wilson AC. Evolution in bacteria: evidence for a universal substitution rate in cellular genomes. J Mol Evol. 1987;26:74–86. doi: 10.1007/BF02111283. [DOI] [PubMed] [Google Scholar]
  44. Oyarzabal OA, Rad R, Backert S. Conjugative transfer of chromosomally encoded antibiotic resistance from Helicobacter pylori to Campylobacter jejuni. J Clin Microbiol. 2007;45:402–408. doi: 10.1128/JCM.01456-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Parkhill J, Wren BW, Mungall K, et al. (21 co-authors) The genome sequence of the food-borne pathogen Campylobacter jejuni reveals hypervariable sequences. Nature. 2000;403:665–668. doi: 10.1038/35001088. [DOI] [PubMed] [Google Scholar]
  46. Pérez-Losada M, Browne EB, Madsen A, Wirth T, Viscidi RP, Crandall KA. Population genetics of microbial pathogens estimated from multilocus sequence typing (MLST) data. Infect Genet Evol. 2006;6:97–112. doi: 10.1016/j.meegid.2005.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Pérez-Losada M, Crandall KA, Zenilman J, Viscidi RP. Temporal trends in gonococcal population genetics in a high prevalence urban community. Infect Genet Evol. 2007;7:271–278. doi: 10.1016/j.meegid.2006.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Rocha EPC, Maynard Smith J, Hurst LD, Holden MTG, Cooper JE, Smith NH, Feil EJ. Comparisons of dN/dS are time dependent for closely related bacterial genomes. J Theor Biol. 2006;239:226–235. doi: 10.1016/j.jtbi.2005.08.037. [DOI] [PubMed] [Google Scholar]
  49. Rodrigo A, Felsenstein J. Coalescent approaches to HIV-1 population genetics. In: Crandall K, editor. Molecular evolution of HIV. Baltimore (MD): John Hopkins University Press; 1999. pp. 233–272. [Google Scholar]
  50. Roumagnac P, Weill F-X, Dolecek C, et al. (11 co-authors) Evolutionary history of Salmonella typhi. Science. 2006;314:1301–1304. doi: 10.1126/science.1134933. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Sheppard SK, McCarthy ND, Falush D, Maiden MCJ. Convergence of Campylobacter species: implications for bacterial evolution. Science. 2008;320:237–239. doi: 10.1126/science.1155532. [DOI] [PubMed] [Google Scholar]
  52. Stoddard RA, Miller WG, Foley JE, Lawrence J, Gulland FMD, Conrad PA, Byrne BA. Campylobacter insulaenigrae isolates from northern elephant seals (Mirounga angustirostris) in California. Appl Environ Microbiol. 2007;73:1729–1735. doi: 10.1128/AEM.01816-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Tu Z-C, Dewhirst FE, Blaser MJ. Evidence that the Campylobacter fetus sap locus is an ancient genomic constituent with origins before mammals and reptiles diverged. Infect Immun. 2001;69:2237–2244. doi: 10.1128/IAI.69.4.2237-2244.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Tu Z-C, Zeitlin G, Gagner J-P, Keo T, Hanna BA, Blaser MJ. Campylobacter fetus of reptile origin as a human pathogen. J Clin Microbiol. 2004;42:4405–4407. doi: 10.1128/JCM.42.9.4405-4407.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. van Bergen MAP, Dingle KE, Maiden MCJ, Newell DG, van der Graaf-Van Bloois L, van Putten JP, Wagenaar JA. Clonal nature of Campylobacter fetus as defined by multilocus sequence typing. J Clin Microbiol. 2005;43:5888–5898. doi: 10.1128/JCM.43.12.5888-5898.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Velayudhan J, Kelly DJ. Analysis of gluconeogenic and anaplerotic enzymes in Campylobacter jejuni: an essential role for phosphoenolpyruvate carboxykinase. Microbiology. 2002;148:685–694. doi: 10.1099/00221287-148-3-685. [DOI] [PubMed] [Google Scholar]
  57. Willerslev E, Hansen A, Rønn R, Brand TB, Barnes I, Wiuf C, Gilichinsky D, Mitchell D, Cooper A. Long-term persistence of bacterial DNA. Curr Biol. 2004;14:R9–R10. doi: 10.1016/j.cub.2003.12.012. [DOI] [PubMed] [Google Scholar]
  58. Wilson DJ, Gabriel E, Leatherbarrow AJH, Cheesbrough J, Gee S, Bolton E, Fox A, Hart CA, Diggle PJ, Fearnhead F. PLoS Genet. 2008;4:e1000203. doi: 10.1371/journal.pgen.1000203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Wiuf C, Hein J. The coalescent with gene conversion. Genetics. 2000;155:451–462. doi: 10.1093/genetics/155.1.451. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Yule GU. A mathematical theory of evolution, based on the conclusions of Dr. J.C. Willis. Philos Trans Roy Soc Lond Ser B. 1924;213:21–87. [Google Scholar]
  61. Zia S, Wareing D, Sutton C, et al. Health problems following Campylobacter jejuni enteritis in a Lancashire population. Rheumatology. 2003;42:1083–1088. doi: 10.1093/rheumatology/keg303. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

[Supplementary Data]
msn264_index.html (752B, html)
msn264_1.pdf (472.3KB, pdf)

Articles from Molecular Biology and Evolution are provided here courtesy ofOxford University Press

ACTIONS

RESOURCES


[8]ページ先頭

©2009-2025 Movatter.jp