Movatterモバイル変換


[0]ホーム

URL:


Skip to main content

Advertisement

Springer Nature Link
Log in

Phylogenetic Diversity Measures and Their Decomposition: A Framework Based on Hill Numbers

  • Chapter
  • Open Access

You have full access to thisopen access chapter

Part of the book series:Topics in Biodiversity and Conservation ((TOBC,volume 14))

Abstract

Conservation biologists need robust, intuitive mathematical tools to quantify and assess patterns and changes in biodiversity. Here we review some commonly used abundance-based species diversity measures and their phylogenetic generalizations. Most of the previous abundance-sensitive measures and their phylogenetic generalizations lack an essential property, the replication principle or doubling property. This often leads to inconsistent or counter-intuitive interpretations, especially in conservation applications. Hill numbers or the “effective number of species” obey the replication principle and thus resolve many of the interpretational problems. Hill numbers were recently extended to incorporate phylogeny; the resulting measures take into account phylogenetic differences between species while still satisfying the replication principle. We review the framework of phylogenetic diversity measures based on Hill numbers and their decomposition into independent alpha and beta components. Both additive and multiplicative decompositions lead to the same classes of normalized phylogenetic similarity or differentiation measures. These classes include multiple-assemblage phylogenetic generalizations of the Jaccard, Sørensen, Horn and Morisita-Horn measures. For two assemblages, these classes also include the commonly usedUniFrac andPhyloSør indices as special cases. Our approach provides a mathematically rigorous, self-consistent, ecologically meaningful set of tools for conservationists who must assess the phylogenetic diversity and complementarity of potential protected areas. Our framework is applied to a real dataset to illustrate (i) how to use phylogenetic diversity profiles to completely convey species abundances and phylogenetic information among species in an assemblage; and (ii) how to use phylogenetic similarity (or differentiation) profiles to assess phylogenetic resemblance or difference among multiple assemblages.

You have full access to this open access chapter, Download chapter PDF

Similar content being viewed by others

Explore related subjects

Discover the latest articles, books and news in related subjects, suggested using machine learning.

Introduction

Many of the most pressing and fundamental questions in biodiversity conservation require robust and sensible measures for quantifying and assessing changes in biodiversity. Many environmental and monitoring projects also require objective and meaningful similarity (or differentiation) measures to compare the diversities of multiple assemblages and their degree of complementarity in order to best conserve genetic, species, and ecosystem diversity. An enormous number of diversity measures and related similarity (or differentiation) indices have been proposed, not only in ecology but also in genetics, economics, information science, linguistics, physics, and social sciences, among others. See Magurran (2004) and Magurran and McGill (2011) for overviews.

In traditional species diversity measures, all species are considered to be equally different from each other; only species richness and abundances are involved. There are two general approaches: parametric and non-parametric (Magurran2004). Parametric approaches assume a particular species abundance distribution (such as the lognormal or gamma) or a species rank abundance distribution (such as the negative binomial or log-series), and then use the parameters (e.g., Fisher’s alpha) of the distribution to quantify diversity. However, these methods often do not perform well and the results are un-interpretable unless the “true” species abundance distribution is known (Colwell and Coddington1994; Chao2005). The parametric model also does not permit meaningful comparison of assemblages with different abundance distributions. For example, a log-normal abundance model cannot be compared to an assemblage whose abundance distribution follows a gamma distribution. Non-parametric methods make no assumptions about the distributional form of the underlying species abundance distribution. The most widely used abundance-sensitive non-parametric measures have been the Shannon entropy and the Gini-Simpson index. These two measures, along with species richness were integrated into a class of measures called generalized entropies (Havrdra and Charvat1967; Daróczy1970; Patil and Taillie1979; Tsallis1988; Keylock2005), which will be briefly reviewed in this chapter.

How to quantify abundance-based species diversity in an assemblage has been one of the most controversial issues in community ecology (e.g. Hurlbert1971; Routledge1979; Patil and Taillie1982; Purvis and Hector2000; Jost2006,2007; Jost et al.2010). There have also been intense debates on the choice of diversity partitioning schemes; see Ellison (2010) and the Forum that follows it. Surprisingly, all authors in that forum achieved a consensus on the use of Hill numbers, also called “effective number of species”, as the best choice to quantify abundance-based species diversity. Hill numbers are a mathematically unified family of diversity indices (differing among themselves only by a parameterq) that incorporate species richness and species relative abundances. They were first used in ecology by MacArthur (1965,1972), developed by Hill (1973), and recently reintroduced to ecologists by Jost (2006,2007).

Hill numbers obey thereplication principle ordoublingproperty , an essential mathematical property that capture biologists’ notion of diversity (MacArthur1965; Hill1973). This property requires that if we haveN equally diverse, equally large assemblages with no species in common, the diversity of the pooled assemblage must beN times the diversity of a single group. In other words, they are linear with respect to addition of equally-common species. We will review different versions of this property later. Classical diversity measures, such as Shannon entropy and the Gini-Simpson index, donot obey this principle and can lead to inconsistent or counter-intuitive interpretations, especially in conservation applications (Jost2006,2007). Hill numbers resolve many of the interpretational problems caused by classical diversity indices. Diversity measures that obey the replication principle yield self-consistent assessment in conservation applications, have intuitively-interpretable magnitudes, and can be meaningfully decomposed. In this chapter, Hill numbers are adopted as a general framework for quantifying and partitioning diversities.

Pielou (1975, p. 17) was the first to notice that traditional abundance-based species diversity measures could be broadened to include phylogenetic, functional, or other differences between species. We here concentrate on phylogenetic differences, though our framework can also be extended to functional traits (Tilman2001; Petchey and Gaston2002; Weiher2011). For conservation purposes, an assemblage of phylogenetically divergent species is more diverse than an assemblage consisting of closely related species, all else being equal. Phylogenetic differences among species can be based directly on their evolutionary histories, either in the form of taxonomic classification or well-supported phylogenetic trees (Faith1992; Warwick and Clarke1995; McPeek and Miller1996; Crozier1997; Helmus et al.2007; Webb2000; Webb et al.2002; Pavoine et al.2010; Ives and Helmus2010,2011; Vellend et al.2011; Cavender-Bares et al.2009,2012 among others). Three special issues inEcology were devoted to integrating ecology and phylogenetics; see McPeek and Miller (1996), Webb et al. (2006), and Cavender-Bares et al. (2012) and papers in each issue. Phylogenetic diversity measures are especially relevant for conservation applications, since they quantify the amount of evolutionary history preserved by the assemblage; see Lean and MacLaurin (chapter “The Value of Phylogenetic Diversity”).

The most widely used phylogenetic metric is Faith’s phylogenetic diversity (PD) (Faith1992) which is defined as the sum of the branch lengths of a phylogenetic tree connecting all species in the target assemblage. As shown in Chao et al. (2010), Faith’s PD can be regarded as a phylogenetic generalization of species richness. The rarefaction formula for Faith’s PD was developed by Nipperess and Matsen (2013) and Nipperess (chapter “The Rarefaction of Phylogenetic Diversity: Formulation, Extension and Application”). Recently, Chao et al. (2015) derived an integrated sampling, rarefaction, and extrapolation methodology to compare Faith’s PD of a set of assemblages. Like species richness, Faith’s PD does not consider species abundances. For some conservation applications, the mere presence or absence of a species is all that matters, or all that can be determined from the available data. In those cases, Faith’s PD is a good measure of phylogenetic diversity. However, there are important advantages to incorporating abundance information into phylogenetic diversity measures for conservation. For example, some human impacts can result in the phylogenetic simplification of an ecosystem, reducing the population shares of phylogenetically distinct species relative to typical species. An abundance-based measure can catch this effect before it leads to actual extinctions.

Ecosystem simplification may be worthy of conservation concern even if it does not lead to extinctions of focal organisms. Often, the focal organisms for conservation represent a tiny fraction of the ecosystem’s biomass or richness. Each focal species will be tied to a web of non-focal species whose abundances are not usually monitored (e.g., insects). All else being equal, a more equitable distribution of the abundances of focal organisms will be able to support a more diverse, robust and stable set of non-focal species. Faith (chapter “Using Phylogenetic Dissimilarities Among Sites for Biodiversity Assessments and Conservation”) rightly argues that phylogenetic diversity is a good proxy for functional diversity. Therefore an ecosystem with a more equitable distribution of abundance across phylogenetic lineages should also exhibit greater functional complexity (per interaction between individuals) than an ecosystem whose phylogenetically unusual elements are rare. If we have to prioritize such ecosystems, the more phylogenetically equitable one, which thoroughly integrates diverse lineages, should be preferred. In addition to being more resistant to lineage extinctions, a complex, well-integrated ecosystem may be worth preserving in and of itself, above and beyond its component species; conservation is not just about species. Evolution may take a different course in ecosystems whose members are constantly surprised by their interactions compared with an ecosystem whose interactors are highly predictable. These conservation goals – robustness against extinction of distinctive lineages, and preservation of well-integrated ecosystems with unique future option values – require phylogenetic diversity measures that incorporate species importance values.

Rao’s quadratic entropyQ (Rao1982), a generalization of the Gini-Simpson index, was the first diversity measure that accounts for both phylogeny and species abundances. The phylogenetic entropyHP (Allen et al.2009) extends Shannon entropy to incorporate phylogenetic distances among species. Since Shannon entropy and the Gini-Simpson index do not obey the replication principle, neither do their phylogenetic generalizations. These generalizations will therefore have the same interpretational problems as their parent measures; see Chao et al. (2010, their Supplementary Material) for examples.

Chao et al. (2010) extended Hill numbers and related similarity measures to incorporate phylogeny. The new phylogenetic Hill numbers obey a generalized replication principle. Their measures were subsequently extended by Faith and Richards (2012) and Faith (2013). Both the original Hill numbers and their phylogenetic generalizations facilitate diversity decomposition (Jost2007; Chiu et al.2014). As with the original Hill numbers, both additive and multiplicative decompositions of phylogenetic Hill numbers lead to the same classes of similarity (or differentiation) measures. Hill numbers therefore provide a unified framework to quantify both abundance-based and phylogenetic diversity.

In this chapter, we first briefly review the classic abundance-based species diversity measures (section “Generalized Entropies”) and their phylogenetic generalizations (section “Phylogenetic generalized entropies”) for an assemblage. Then we focus on the framework of Hill numbers (section “Hill numbers and the replication principle”), phylogenetic Hill numbers (section “Phylogenetic Hill numbers and related measures”) and related phylogenetic diversity measures. We also discuss the replication principle and its phylogenetic generalization (section “Replication principle for phylogenetic diversity measures”). For multiple assemblages, we review the diversity decomposition based on phylogenetic diversity measures (section “Decomposition of phylogenetic diversity measures”). The associated phylogenetic similarity and differentiation measures are then presented (section “Normalized phylogenetic similarity measures”). We use a real example for illustration (section “An example”). Our practical recommendations are provided in section “Conclusion”.

Classic Measures and Their Phylogenetic Generalizations

Generalized Entropies

The species richness of an assemblage is a simple count of the number of species present. It is the most intuitive and frequently used measure of biodiversity, and is a key metric in conservation biology (MacArthur and Wilson1967; Hubbell2001; Magurran2004). However, it does not incorporate any information about the abundances of species, and it is a very hard number to estimate accurately from small samples (Colwell and Coddington1994; Chao2005; Gotelli and Colwell2011).

Shannon entropy is a popular classical abundance-based diversity index and has been used in many disciplines. Shannon entropy is

$$ {H}_{Sh}=-{\displaystyle \sum}_{i=1}^S{p}_i \log {p}_i, $$
(1a)

whereS is the number of species in the assemblage, and theith species has relative abundancepi. Shannon entropy gives the uncertainty in the species identity of a randomly chosen individual in the assemblage. Another popular measure is the Gini-Simpson index,

$$ {H}_{GS}=1-{\displaystyle \sum}_{i=1}^S{p}_i^2, $$
(1b)

which gives the probability that two randomly chosen individuals belong to different species. These two abundance-sensitive measures, along with species richness, can be united into a single family ofgeneralized entropy :

$$ {}{}^qH=\left(1-{\displaystyle \sum}_{i=1}^S{p}_i^q\right)/\left(q-1\right). $$
(1c)

The parameterq determines the sensitivity of the measure to the relative frequencies of the species. Whenq = 0,qH becomesS − 1; Whenq tends to 1,qH tends to Shannon entropy. Whenq = 2,qH reduces to the Gini-Simpson index. This family was found many times in different disciplines (Havrdra and Charvat1967; Daróczy1970; Patil and Taillie1979; Tsallis1988; Keylock2005). There are many other families of generalized entropies, notably the Rényi entropies (Rényi1961).

Although the traditional abundance-sensitive generalized entropies and their special cases have been useful in many disciplines (e.g., see Magurran2004), they do not behave in the same intuitive linear way as species richness. In ecosystems with high diversity, mass extinctions hardly affect their values (Jost2010). They also lead to logical contradictions in conservation biology, because they do not measure a conserved quantity (e.g., under a given conservation plan, the proportion of “diversity” lost and the proportion preserved canboth be 90 % or more); see Jost (2006,2007) and Jost et al. (2010). Thus, changes in their magnitude cannot be properly compared or interpreted. Also, the main measure of similarity in the additive approach for traditional measures, the within-group or “ alpha” diversity divided by the total or “gamma” diversity, does not actually quantify the compositional similarity of the assemblages under study. This ratio can be arbitrarily close to unity (supposedly indicating high similarity) even when the assemblages being compared have no species in common. Finally, these measures each use different units (e.g., the Gini-Simpson index is a probability whereas Shannon entropy is in units of information), so they cannot be compared with each other. All these problems are consequences of their failure to satisfy the replication principle. Hill numbers obey the replication principle and resolve all these problems; see section “Hill numbers and the replication principle”.

Phylogenetic Generalized Entropies

The classic measures reviewed in section “Generalized Entropies” were extended to incorporate phylogenetic distance between species. As mentioned in the Introduction and will be shown in section “Phylogenetic Hill numbers and related measures”, Faith’s PD can be regarded as a phylogenetic generalization of species richness.

Rao’squadraticentropy takes account of both phylogeny and species abundances (Rao1982):

$$ Q={\displaystyle \sum}_{i,j}{d}_{ij}{p}_i{p}_j, $$
(2a)

wheredij denotes the phylogenetic distance (in years since divergence, number of DNA base changes, or other metric) between speciesi andj, andpi andpj denote the relative abundance of speciesi andj. This index measures the average phylogenetic distance between any two individuals randomly selected from the assemblage. Rao’sQ represents a phylogenetic generalization of the Gini-Simpson index because in the special case of no phylogenetic structure (all species are equally related to one another),dii = 0 anddij = 1 (i ≠ j), it reduces to the Gini-Simpson index.

ThephylogeneticentropyHP is a generalization of Shannon’s entropy to incorporate phylogenetic distances among species (Allen et al.2009):

$$ {H}_P=-{\displaystyle \sum}_i{L}_i{a}_i \log {a}_i $$
(2b)

where the summation is over all branches of a rooted phylogenetic tree,Li is the length of branchi, andai denotes the summed relative abundance of all species descended from branchi.

For ultrametric trees, Faith’s PD, Allen et al.’sHP, and Rao’sQ can be united into a single parametric family ofphylogenetic generalized entropies (Pavoine et al.2009):

$$ {}^qI=\left(T-{\displaystyle \sum}_{i\kern0.22em }{L}_i{a}_i^q\right)/\left(q-1\right). $$
(2c)

Here,Li andai are defined in Eq. (2b) andT is the age of the root node of the tree. Then0I = Faith’s PD minusT;1I is identical to Allen et al.’s entropyHP given in Eq. (2b); and2I is identical to Rao’s quadratic entropyQ given in Eq. (2a). In the special case thatT = 1 (the tree height is normalized to unit length) and all branches have unit length, then the phylogenetic generalized entropy reduces to the classical generalized entropy defined in Eq. (1c), with species relative abundances {p1,p2, …,pS} as the tip-node abundances.

The abundance-sensitive (q > 0) phylogenetic generalized entropies provide useful information, but they do not obey the replication principle and thus have the same interpretational problems as their parent measures. This motivated Chao et al. (2010) to extend Hill numbers to phylogenetic Hill numbers, which obey the replication principle; see section “Phylogenetic Hill numbers and related measures”.

Hill Numbers and Their Phylogenetic Generalizations

Hill Numbers and the Replication Principle

Pioneering work by Kimura and Crow (1964) in genetics and MacArthur (1965) in ecology showed that the Shannon and Gini-Simpson measures can be easily converted to “effective number of species” (i.e., the number of equally abundant species that are needed to give the same value of the diversity measure), which use the same units as species richness. Shannon entropy can be converted by taking its exponential, and the Gini-Simpson index can be converted by the formula 1/(1−HGS). Hill (1973) integrated species richness and the converted Shannon and Gini-Simpson measures into a class of diversity measures called “Hill numbers ” of orderq, or the “effective number of species”, defined as

$$ {}{}^qD={\left({\displaystyle \sum}_{i=1}^S{p}_i^q\right)}^{1/\left(1-q\right)},q\ge 0,q\ne 1. $$
(3a)

This measure is undefined forq = 1, but its limit asq tends to 1 exists and gives

$$ {}{}^1D=\underset{q\to 1}{ \lim }{}{}^qD= \exp \left(-{\displaystyle \sum}_{i=1}^S{p}_i \log {p}_i\right)= \exp \left({H}_{Sh}\right). $$
(3b)

The relationship between Hill number of orderq (q ≠ 1) and the generalized entropy can be expressed as

$$ {}{}^qD={\left[1-\left(q-1\right)\left({}{}^qH\right)\right]}^{1/\left(1-q\right)}. $$
(3c)

Whenq = 0, the species abundances do not count at all and0D = S is obtained. Whenq = 1, the species are weighed in proportion to their frequencies, and the measure1D (in Eq. (3b)) can be interpreted as the effective number of common or “typical” species (i.e., species with typical abundances) in the assemblage. Whenq = 2, abundant species are favored and rare species are discounted; the measure2D becomes the inverse Simpson concentration. The measure2D can be interpreted as the effective number of dominant or very abundant species in the assemblage. In general, ifqD = x, then the diversity of orderq of this community is the same as that of an idealized reference community withx equally abundant species. All Hill numbers are in units of “species”. It is thus possible to plot them on a single graph as a continuous function of the parameterq. Thisdiversity profile characterizes the species-abundance distribution of an assemblage and provides complete information about its diversity. The steepness of its slope graphically illustrates the degree of dominance in the assemblage. An example is given in section “An example”.

Hill numbers differ fundamentally from Shannon entropy and the Gini-Simpson index in that they obey the replication principle. Hill (1973) proved a weak version of the doubling property: if two completely distinct assemblages (i.e., no species in common) have identical relative abundance distributions, then the Hill number doubles if the assemblages are combined with equal weights. Chiu et al. (2014, their Appendix B) recently proved a strong version of the doubling property: if two completely distinct assemblages have identical Hill numbers of orderq (relative abundance distributions may be different, unlike the weak version), then the Hill number of the same order doubles if the two assemblages are combined with equal weights. Species richness is a Hill number (withq = 0) and obeys both versions of the doubling property, but most other diversity indices do not obey even the weak version. Because Hill numbers obey this replication principle, changes in their magnitude have simple interpretations, and the ratio of alpha diversity to gamma diversity accurately reflects the compositional similarity of the communities. The replication principle is best known in economics, where it has long been recognized as an important property of concentration and diversity measures (Hannah and Kay1977). In ecology, the doubling property has been extensively discussed by many authors (MacArthur1965,1972; Hill1973; Whittaker1972; Routledge1979; Peet1974; Jost2006,2007,2009; Ricotta and Szeidl2009; Jost et al.2010) and has been extended to phylogenetic measures (Chao et al.2010); see below.

Phylogenetic Hill Numbers and Related Measures

When the branch lengths are proportional to divergence time, all branch tips are the same distance from the root (the first node). Such trees are called “ultrametric” trees. We first discuss the phylogenetic diversity measures for ultrametric trees. The phylogenetic Hill numbers developed by Chao et al. (2010) for an ultrametric tree can be intuitively explained as the Hill number of a time-average of a tree’s generalized entropy over some evolutionary time interval of interest. Suppose the phylogenetic tree for an assemblage is calibrated to some relative or absolute timescale. We can slice this phylogenetic tree at any timet in the past; see the left panel of Fig.1 (reproduced from Chao et al.2010) for illustration and details about how to deal with shared lineages. The number of lineages at that time is the number of branch cuts, and the relative importance of each of these lineages for the present-day assemblage is the sum of the relative abundances of the branch’s descendants in the present-day assemblage. Using these relative importance values, we can calculate the generalized entropy of orderq for the slice. The mean of these entropies, beginning at time –T (i.e.,T years before present) and continuing until the present, is converted to a Hill number using Eq. (3c). This is thephylogenetic Hill number, which conveys information about the shape of the tree over the time interval of interest. Chao et al. (2010) symbolize it as\( {}{}^q\overline{D}\;(T) \), and also refer to it as themean phylogenetic diversityof order q overT years (or simply themean diversity for the interval [−T, 0]):

$$ {}{}^q\overline{D}\;(T)={\left\{{\displaystyle \sum}_{\kern0.22em i\in {B}_T}\frac{L_i}{T}\;{a}_i^q\right\}}^{1/\left(1-q\right)}=\frac{1}{T}{\left\{{\displaystyle \sum}_{\kern0.22em i\in {B}_T}{L}_i{\left(\frac{a_i}{T}\right)}^q\right\}}^{1/\left(1-q\right)},q\ge 0,q\ne 1; $$
(4a)
$$ {}{}^1\overline{D}\;(T)=\underset{q\to 1}{ \lim }{}{}^q\overline{D}\;(T)= \exp \left[-{\displaystyle \sum}_{i\in {B}_T}\frac{L_i}{T}{a}_i \log {a}_i\right], $$
(4b)

whereBT is the set of all branches in the time interval [−T, 0],Li is the length of branchi in the setBT, andai is the total relative abundance descended from branchi. The mean diversity\( {}{}^q\overline{D}\;(T) \) is interpreted as “the effective number of equally abundant andequally distinct lineages all with branch lengthsT during the time interval fromT years ago to the present”. Here “equally distinct” also implies that the phylogenetic distance between any two species isT, so lineages are completely distinct (i.e., there are no shared branches).

Fig. 1
figure 1figure 1

(a) A hypothetical ultrametric rooted phylogenetic tree with four species. Three different slices corresponding to three different times are shown. For a fixedT (not restricted to the age of the root), the nodes divide the phylogenetic tree into segments 1, 2 and 3 with duration (length)T1,T2 andT3, respectively. In any moment of segment 1, there are four species (i.e. four branches cut); in segment 2, there are three species; and in segment 3, there are two species. The mean species richness over the time interval [−T, 0] is\( \left({T}_1/T\right)\times 4+\left({T}_2/T\right)\times 3+\left({T}_3/T\right)\times 2 \). In any moment of segment 1, the species relative abundances (i.e. node abundances correspond to the four branches) are {p1,p2,p3,p4}; in segment 2, the species relative abundances are {g1,g2,g3} = {p1,p2 + p3,p4}; in segment 3, the species relative abundances are {h1,h2} = {p1 + p2 + p3,p4}. (b) A hypothetical non-ultrametric tree. Let\( \overline{T} \) be the weighted (by species abundance) mean of the distances from root node to each of the terminal branch tips.\( \overline{T}=4\times 0.5+\left(3.5+2\right)\times 0.2+\left(1+2\right)\times 0.3=4 \). Note\( \overline{T} \) is also the weighted (by branch length) total node abundance because\( \overline{T}=0.5\times 4+0.2\times 3.5+0.3\times 1+0.5\times 2=4 \). Conceptually, the ‘branch diversity’ is defined for an assemblage of four branches: each has, respectively, relative abundance\( 0.5/\overline{T}=0.125 \),\( 0.2/\overline{T}=0.05 \),\( 0.3/\overline{T}=0.075 \) and\( 0.5/\overline{T}=0.125 \); and each has, respectively, weight (i.e. branch length) 4, 3.5, 1 and 2. This is equivalent to an assemblage with 10.5 equally weighted ‘branches’: there are four branches with relative abundance\( 0.5/\overline{T}=0.125 \); 3.5 branches with relative abundance\( 0.2/\overline{T}=0.05 \); one branch with relative abundance\( 0.3/\overline{T}=0.075 \) and two branches with relative abundance\( 0.5/\overline{T}=0.125 \) (This figure is reproduced from Fig.1 of Chao et al.2010)

The phylogenetic Hill numbers are invariant to the units used to measure branch lengths. When all lineages are completely distinct, the measure\( {}{}^q\overline{D}\;(T) \) reduces to the Hill numbers\( {}^qD={\left({\displaystyle \sum}_i{a}_i^q\right)}^{1/\left(1-q\right)} \). This includes the special case thatT tends to zero, i.e., the case that we ignore phylogeny and only consider the present-day community. This shows that the framework based on Hill numbers provides a unified approach to integrate abundances and phylogeny. Also, here we have a simple idealized reference tree to understand the value of\( {}{}^q\overline{D}\;(T)=z \) for an arbitrary tree: the mean phylogenetic diversity of the tree over the time period [−T, 0] is the same as the diversity of an idealized assemblage consisting ofz equally abundant and equally distinct lineages all with branch lengthT.

Forq = 0, whenT is chosen as the age of the root node, we have\( {}{}^0\overline{D}\;(T)=\mathrm{Faith}'\mathrm{s}\;\mathrm{P}\mathrm{D}/T \), which can be interpreted aslineagerichness . Faith’s PD can thus be regarded as a phylogenetic generalization of species richness. We can roughly interpret\( {}{}^1\overline{D}\;(T) \) as the effective number ofcommon lineages, and\( {}{}^2\overline{D}\;(T) \) as the effective number ofdominant lineages in the time period [−T, 0]. WhenT is chosen as the age of the root node, a simple relationship exists between phylogenetic entropyHP (Allen et al.2009) and the measure\( {}{}^1\overline{D}\;(T) \):

$$ {}{}^1\overline{D}\;(T)= \exp \left({H}_P/T\right). $$
(4c)

Forq = 2, whenT is chosen as the age of the root node, there is a simple relationship between our measures and the widely used Rao’s quadratic entropyQ (Chao et al.2010):

$$ {}{}^2\overline{D}\;(T)=\frac{1}{1-Q/T}. $$
(4d)

Thebranch or phylogeneticdiversityqPD(T) of orderq during the time interval fromT years ago to the present is defined as the product of\( {}{}^q\overline{D}\;(T) \) andT. It quantifies the amount of evolutionary history on the system over the interval [−T, 0], or “the effective total branch-length” (Chao et al.2010):

$$ {}^qPD\;(T)=T\times {}{}^q\overline{D}(T)={\left\{{\displaystyle \sum}_{\kern0.22em i\in {B}_T}{L}_i{\left(\frac{a_i}{T}\right)}^q\right\}}^{1/\left(1-q\right)}. $$
(5a)
$$ {}{}^1PD\;(T)=\underset{q\to 1}{ \lim }{}{}^qPD\;(T)= \exp \left[-{\displaystyle \sum}_{i\in {B}_T}{L}_i\frac{a_i}{T} \log \left(\frac{a_i}{T}\right)\right]. $$
(5b)

Ifq = 0, andT is age of the root node, then0PD(T) reduces to Faith’s PD, regardless of branching pattern or abundances. As explained by Chao et al. (2010), we could imagine that all the branch segments in the interval [−T, 0] form a single assemblage with relative abundance set {ai/T;iBT}. In this assemblage, for eachi there areLi “branches” with relative abundanceai/T. Then the Hill number of orderq for this assemblage is exactly the branch diversityqPD(T) given in Eq. (5a). Dividing this Hill number byT, we obtain\( {}{}^q\overline{D}\;(T) \) given in Eq. (4a). Note in our framework thatqPD(T) is truly a class of Hill numbers (“the effective number of lineage-years”), whereas\( {}{}^q\overline{D}\;(T) \) (“the effective number of lineages”) denotes a (generalized) mean of Hill numbers. See Faith and Richards (2012) and Faith (2013) for extensions of the measureqPD(T).

Unlike previous phylogenetic diversity measures developed in the literature,\( {}{}^q\overline{D}\;(T) \) andqPD(T) depend explicitly on two parameters, the abundance sensitivity parameterq and the time perspective (or time-depth) parameterT. The reasons we need this time-depth parameter and our suggestion to choose a perspective time are given as follows.

  1. 1.

    When we compare the phylogenetic diversities of several assemblages based on the measures\( {}{}^q\overline{D}\;(T) \) andqPD(T), all measures should refer to the same time periods to make meaningful comparisons. That is, the time-depthT should be kept as the same for all assemblages. Therefore, a parameter is required to specify the time-depth.

  2. 2.

    The choice of time perspective should reflect an investigator’s aims and facilitate comparisons with other studies. We suggest that at least two selected time perspectives should be included:T = 0, andT = the age of the root node of a phylogenetic tree connecting all species in the study. For the case ofT = 0, the phylogeny is ignored and the diversity profile reduces to the profile in the present-day assemblage based on the ordinary Hill numbers. If we chooseT to be the age of the oldest node in the tree, we recover some of the standard measures of phylogenetic diversity (see Eqs. (4c) and (4d)).

  3. 3.

    As suggested in Chiu et al. (2014), other time perspectives can be selected, such asT = the age of the node at which the group of interest diverges from the rest of the species. This choice ofT is independent of the species actually sampled, so it allows statistically robust comparisons across investigations and regions (unlike the conventional choice ofT as the root node of the tree containing the species actually observed). This choice also provides an accurate measure of the proportion of a taxonomic group’s evolutionary history preserved in a given assemblage. Another choice is the time of the most recent common ancestor of all taxa alive today. Other choices may be made, depending on the purpose of an investigation. The formula in Chiu et al. (2014, p. 42) can be used to convert phylogenetic diversity from one temporal perspective to another.

To see how the measures vary withq and time perspectiveT, we recommend using two types of profiles to completely characterize phylogenetic tree information and species abundances as described below. See section “An example” for examples. (1) The first type of diversity profile is obtained by plottingqPD(T) or\( {}{}^q\overline{D}\;(T) \) as a function of orderq asq varies from 0 to about 3 or 4 (beyond which there is usually little change), for some selected values of temporal perspectiveT. For this type of profile,qPD(T) and\( {}{}^q\overline{D}\;(T) \) have similar patterns asT is fixed, so it is sufficient to plot the profile only for one measure. (2) The second type of diversity profile is obtained by plottingqPD(T) and\( {}{}^q\overline{D}\;(T) \) as functions ofT separately forq = 0, 1, and 2. This profile shows the effect of time-depth or evolution change on our diversity measures.

For the second type of profile,qPD(T) and\( {}{}^q\overline{D}\;(T) \) generally exhibit different patterns (the profile of\( {}{}^q\overline{D}\;(T) \) is decreasing withT whereas the profile ofqPD(T) forq = 0 (Faith’s PD) is always increasing, and forq > 0 is generally increasing up to a certain point, so the profiles for both measures are informative. The parameterq gives the sensitivity of the two measures to present-day species relative abundances. As in the ordinary Hill numbers, the measures withq = 2 favor more abundant species, so they are useful in ecological studies to examine the phylogenetic relationships of the dominant species in a set of assemblages, or those examining functional diversity. The measures ofq = 0 emphasizes rare species, so they are useful when abundance information is not necessarily relevant (e.g., when ecologists try to identify past episodes of differentiation, or for some conservation biology applications). The measures withq = 1 weigh species according to their frequencies and can be used in most applications when neither dominant nor rare species should be favored.

When the measure of evolutionary change is typically based on the number of nucleotide base changes at a selected locus, or the amount of functional or morphological differentiation from a common ancestor, the branches of the resulting tree will then be uneven, so the tree is non-ultrametric. In this case, Chao et al. (2010) showed that the time parameterT in all formulas should be replaced by themean base change ormeanbranch length\( \overline{T}, \) the mean of the distances from the tree base to each of the terminal branch tips (i.e., the mean evolutionary change per species over the interval of interest). See the right panel of Fig.1 for an illustrative example. Let\( {B}_{\overline{T}} \) denote the set of branches connecting all focal species, with mean branch length\( \overline{T}. \) Then we can express\( \overline{T} \) as\( \overline{T}={\displaystyle \sum}_{i\in {B}_{\overline{T}}}{L}_i{a}_i \). The diversity of a non-ultrametric tree with mean evolutionary change\( \overline{T} \) is the same as that of an ultrametric tree with time parameter\( \overline{T}. \) Therefore, the diversity formulas for a non-ultrametric tree are obtained by replacingT by\( \overline{T} \) in Eqs. (4a), (4b), (5a), and (5b). The resulting measures are denoted respectively as\( {}{}^q\overline{D}\;\left(\overline{T}\right) \),\( {}{}^1\overline{D}\;\left(\overline{T}\right) \),\( {}{}^qPD\;\left(\overline{T}\right) \) and\( {}{}^1PD\;\left(\overline{T}\right) \); see Chao et al. (2010) for details. When we compare the phylogenetic diversity based on the measures\( {}{}^q\overline{D}\;\left(\overline{T}\right) \) and\( {}{}^qPD\;\left(\overline{T}\right) \) for several non-ultrametric trees, all measures should refer to the same mean base change\( \overline{T} \) to make meaningful comparisons.

Replication Principle for Phylogenetic Diversity Measures

The replication principle was generalized to a phylogenetic version in Chao et al. (2010). Suppose there areN equally large and completely phylogenetically distinct assemblages (no shared lineages across assemblages, though lineages within an assemblage may be shared); see Fig.2 (reproduced from Chiu et al.2014) for an illustrative example. Suppose these assemblages have the same phylogenetic Hill numberX. If these assemblages are pooled, then the pooled assemblages must have a phylogenetic Hill numberN ×X. In the proof of this replication principle, Chao et al. (2010) assumed that theseN assemblages have the same mean branch lengths. Here we relax this assumption and allow assemblages to have different mean branch lengths. (In the special case of ultrametric trees, this means that we allow different time perspectives for different assemblages.)

Suppose in assemblagek, the mean branch length is\( {\overline{T}}_k \), and the branch set is\( {B}_{{\overline{T}}_k,k} \) (we omit\( {\overline{T}}_k \) in the subscript and just useBk in the following proof for notational simplicity) with branch lengths {Lik;iBk} and the corresponding nodes abundances {aik;iBk},k = 1, 2, …,N. Assume that all assemblages have the same phylogenetic Hill numbers\( {}{}^q\overline{D}\;\left({\overline{T}}_k\right)=X, \) implying\( {\displaystyle \sum}_{i\in {B}_k}{L}_{ik}\kern0.24em {a}_{ik}^q={X}^{1-q}{\overline{T}}_k \) for allk =1, 2, …,N. When theN trees are pooled with equal weight for each tree, each node abundanceaik in the pooled tree becomesaik/N, and the mean branch length becomes\( \overline{T}=\left(1/N\right){\displaystyle \sum}_{k=1}^N{\overline{T}}_k \). Then the phylogenetic Hill number of orderq for the pooled assemblage becomes

$$ \begin{array}{c}{}{}^q\overline{D}\;\left(\overline{T}\right)={\left\{{\displaystyle \sum}_{k=1}^N{\displaystyle \sum}_{i\in {B}_k}\frac{L_{ik}}{\overline{T}}{\left(\frac{a_{ik}}{N}\right)}^q\right\}}^{1/\left(1-q\right)}={\left\{\frac{1}{N^q}{\displaystyle \sum}_{k=1}^N\frac{1}{\overline{T}}{\displaystyle \sum}_{i\in {B}_k}{L}_{ik}{a}_{ik}^q\right\}}^{1/\left(1-q\right)}\\ {}={\left\{\frac{1}{N^q}{\displaystyle \sum}_{k=1}^N\frac{T_k}{\overline{T}}{X}^{1-q}\right\}}^{1/\left(1-q\right)}={\left\{{N}^{1-q}\times {X}^{1-q}\right\}}^{1/\left(1-q\right)}=N\times X.\end{array} $$
(6)

This proves a stronger version of the replication principle for phylogenetic Hill numbers. Note the mean branch length in the pooled assemblage is the average of individual mean branch lengths. For example, if\( {}{}^q\overline{D}\left({\overline{T}}_1=2\right)={}{}^q\overline{D}\left({\overline{T}}_2=6\right)=10, \) then in an effective sense, there are ten lineages with mean branch length 2 in Assemblage 1 and there are ten lineages with mean branch length 6 in Assemblage 2. The replication principle implies that there are 20 lineages in the pooled tree with mean branch length 4. Since\( {}{}^qPD\;\left({\overline{T}}_k\right)={}{}^q\overline{D}\;\left({\overline{T}}_k\right)\times {\overline{T}}_k \), the replication principle for the phylogenetic diversity\( {}{}^qPD\;\left(\overline{T}\right) \) does need the assumption that all assemblages have the same mean branch lengths\( \left({\overline{T}}_1={\overline{T}}_2=\dots ={\overline{T}}_N\right) \). The proof is parallel and thus omitted.

Fig. 2
figure 2figure 2

Replication Principle for two completely phylogenetically distinct assemblages with totally different structures.Left panel: Assemblage 1 (black) includes three species with species relative abundances {p11,p21,p31} for the three tips. Assemblage 2 (grey) includes four species with species relative abundances {p12,p22,p32,p42} for the four tips. The diversity of the pooled tree is double of that of each tree as long as the two assemblages are completely phylogenetically distinct as shown (no lineages shared between assemblages, though lineages within an assemblage may be shared) and have identical mean diversities (i.e., phylogenetic Hill number).Right panel: The same is valid for two completely phylogenetically distinct non-ultrametric assemblages (This figure is reproduced from Fig.1of Chiu et al.2014)

Decomposition of Phylogenetic Diversity Measures

Decomposition of species richness and its phylogenetic analogues into within- and between-group (alpha and beta) components is widely used (Whittaker1972; Faith et al.2009). However, these take no notice of abundance differences between sites. Conservationists using these measures cannot distinguish a site whose species are equally abundant from a site with the same species but with a highly skewed abundance distribution whose most phylogenetically distinctive species are rare. The former site would be a better bet for conservation. These considerations, and others, motivate the development of decomposition theory for abundance-based phylogenetic diversity measures. The decomposition also leads to abundance-sensitive measures of phylogenetic similarity and complementarity.

When there areN assemblages, the phylogenetic Hill numbers\( {}{}^q\overline{D}\;(T) \) (Eqs.4a and4b) and phylogenetic diversityqPD(T) (Eqs.5a and5b) of the pooled assemblage can be multiplicatively decomposed into independent alpha and beta components (Chiu et al.2014). We briefly describe the decomposition of the measure\( {}{}^q\overline{D}\;(T) \) here for the ultrametric case, and only summarize the decomposition of the measureqPD(T). The extension to the non-ultrametric case for both measures is obtained by simply replacing allT in the formulas with the mean branch length\( \overline{T} \) of the pooled assemblage.

To begin the partitioning, a pooled tree is constructed for theN assemblages. Assume that there areS species in the present-day assemblage (i.e., there areS tip nodes). For any tip nodei, letzik denote any measure of species importance of theith species in thekth assemblage,i = 1, 2, …,S,k = 1, 2, …,N. The measurezik is referred to as “abundance” for simplicity, although it can be absolute abundances, relative abundances, incidence, biomasses, cover areas or any other importance measure. Define\( {z}_{+k}={\displaystyle \sum}_{i=1}^S{z}_{ik} \) (i.e., the “+” sign inz+k denotes a sum over the tip nodes only) as the current size of thekth assemblage. Let\( {z}_{++}={\displaystyle \sum}_{k=1}^N{z}_{+k} \) be the total abundance in the present-day pooled assemblage.

Now consider the phylogenetic tree in the time interval [−T, 0], and in the pooled assemblage defineBT andLi as in section “Phylogenetic Hill numbers and related measures”. We extend the definition ofzik to include all nodes and their corresponding branches by definingzik for alliBT as the total abundances descended from branchi. (Here the indexi can correspond to both tip-node and internal node; ifi is a tip-node, thenzik represents data of the current assemblage as defined in the preceding paragraph.) As shown in Fig.2 of Chiu et al. (2014), the diversity for each individual assemblage can be computed from the pooled tree structure, and only the node abundances vary with assemblages.

In the pooled assemblage, the node abundance for branchi (iBT) is\( {z}_{i+}={\displaystyle \sum}_{k=1}^N{z}_{ik} \) with branch relative abundancezi+/z++, so the phylogenetic gamma diversity of orderq can be calculated from Eq. (4a) as

$$ {}{}^q\overline{D}_{\gamma }(T)=\frac{1}{T}{\left\{{\displaystyle \sum}_{\kern0.22em i\in {B}_T}{L}_i{\left(\frac{z_{i+}/{z}_{++}}{T}\right)}^q\right\}}^{1/\left(1-q\right)},q\ge 0,q\ne 1. $$
(7a)

The limit whenq approaches unity exists and is equal to

$$ {}{}^1\overline{D}_{\gamma }(T)=\underset{q\to 1}{ \lim }{}{}^q\overline{D}_{\gamma }(T)= \exp \left[-{\displaystyle \sum}_{i\in {B}_T}\frac{L_i}{T}\left(\frac{z_{i+}}{z_{++}}\right) \log \left(\frac{z_{i+}}{z_{++}}\right)\right]. $$
(7b)

The gamma diversity is the effective number of equally abundant and equally distinct lineages all with branch lengthsT in the pooled assemblage.

Chiu et al. (2014) derived the following phylogenetic alpha diversity forq ≥ 0 andq ≠ 1:

$$ {}{}^q\overline{D}_{\alpha }(T)=\frac{1}{TN}{\left\{{\displaystyle \sum}_{\kern0.22em i\in {B}_T}{L}_i{\displaystyle \sum}_{k=1}^N{\left(\frac{z_{ik}/{z}_{++}}{T}\right)}^q\right\}}^{1/\left(1-q\right)} $$
(8a)

Forq = 1, we have

$$ {}{}^1\overline{D}_{\alpha }(T)=\underset{q\to 1}{ \lim }{}{}^q\overline{D}_{\alpha }(T)= \exp \left[-{\displaystyle \sum}_{i\in {B}_T}{L}_i{\displaystyle \sum}_{k=1}^N\frac{z_{ik}/{z}_{++}}{T} \log \frac{z_{ik}/{z}_{++}}{T}- \log (NT)\right] $$
(8b)

The alpha diversity is interpreted as the effective number of equally abundant and equally distinct lineages all with branch lengthsT in an individual assemblage. When normalized measures of species importance (like relative abundance or relative biomass) are used to quantify species importance, we havez++ = N in Eqs. (8a) and (8b). The alpha formula then reduces to a generalized mean of the local diversities with the following property: if all assemblages have the same diversityX, the alpha diversity is alsoX (Jost2007). For non-normalized measures of species importance, like absolute abundance or biomass, this property does not hold. This is because when species absolute abundances are compared, for example, a three-species assemblage with absolute abundances {2, 5, 8} will not be treated as identical as another three-species assemblage with absolute abundances {200, 500, 800}. However, these two assemblages are treated as identical when only relative abundances are compared.

Chiu et al. (2014) proved that the phylogenetic gamma Hill number (Eqs.7a and7b) is always greater than or equal to the phylogenetic alpha Hill number (Eqs.8a and8b) forall q ≥ 0 regardless of species abundances and tree structures. Based on a multiplicative partitioning, the phylogenetic beta diversity is the ratio of gamma diversity to alpha diversity:

$$ {}{}^q\overline{D}_{\beta }(T)=\frac{{}{}^q\overline{D}_{\gamma }(T)}{{}{}^q\overline{D}_{\alpha }(T)},\kern0.46em q\ge 0. $$
(9)

When theN assemblages are identical in species identities and species abundances, then\( {}{}^q\overline{D}_{\beta }(T)=1 \) for anyT. When theN assemblages are completely phylogenetically distinct (no shared lineages), then\( {}{}^q\overline{D}_{\beta }(T)=N, \) no matter what the diversities or tree shapes of the assemblages. The measure\( {}{}^q\overline{D}_{\beta }(T) \) thus quantifies the effective number of completely phylogenetically distinct assemblages in the interval [−T, 0]. As proved by Chiu et al. (2014), the phylogenetic beta diversity\( {}{}^q\overline{D}_{\beta }(T) \) is always between unity andN for any given alpha value, implying alpha and beta components are unrelated (or independent) for both measures,\( {}{}^q\overline{D}\;(T) \) andqPD(T); see Chao et al. (2012) for a rigorous discussion of un-relatedness and independence of two measures. When all lineages in the pooled assemblage are completely distinct (no lineages shared) in the interval [−T, 0], the phylogenetic alpha, beta and gamma Hill numbers reduce to those based on ordinary Hill numbers. This includes the limiting case in whichT tends to zero, so that phylogeny is ignored.

Parallel decomposition can be made for the phylogenetic diversityqPD(T), and we summarize the following relations:\( {}{}^qP{D}_{\gamma }(T)={}{}^q\overline{D}_{\gamma }(T)\times T \) and\( {}{}^qP{D}_{\alpha }(T)={}{}^q\overline{D}_{\alpha }(T)\times T. \) Under a multiplicative partitioning scheme, we have\( {}{}^qP{D}_{\beta }(T)={}{}^qP{D}_{\gamma }(T)/{}{}^qP{D}_{\alpha }(T)={}{}^q\overline{D}_{\beta }(T) \), i.e., the beta components from partitioning the phylogenetic Hill numbers\( {}{}^q\overline{D}\;(T) \) and phylogenetic diversityqPD(T) are identical, implying the interpretation and the corresponding similarity or differentiation measures (in the next section) are also identical. Thus, it is sufficient to focus only on the measure\( {}{}^q\overline{D}_{\beta }(T) \), which will be referred to as thephylogeneticbeta diversity or beta component for simplicity.

For each of the two measures,\( {}{}^q\overline{D}\;(T) \) andqPD(T), alpha and gamma diversities obey the replication principle. Then the beta diversity formed by taking their ratio is replication-invariant (Chiu et al.2014). That is, when assemblages are replicated, the beta diversity does not change. Therefore, when we pool equally-distinct sub-trees, such as pooling equally-ancient subfamilies, the beta diversity is unchanged by pooling the subfamilies if all subfamilies show the same beta diversity (“consistency in aggregation”).

We now give the phylogenetic beta diversities for the special cases ofq = 0, 1 and 2.

  1. (a)

    Whenq = 0, we have\( {}{}^0\overline{D}_{\beta }(T)={L}_{\gamma }(T)/{L}_{\alpha }(T) \), whereLγ(T) denotes the total branch length of the pooled tree (the gamma component of Faith’s PD) andLα(T) denotes the average length of individual trees (the alpha component of Faith’s PD).

  2. (b)

    Whenq = 1, the phylogenetic beta diversity of order 1 is

    $$ {}{}^1\overline{D}_{\beta }(T)= \exp \left[\left({H}_{P,\gamma }-{H}_{P,\alpha}\right)/T+{\displaystyle \sum}_{k=1}^N\left(\frac{z_{+k}}{z_{++}}\right) \log \left(\frac{z_{+k}}{z_{++}}\right)+ \log N\right], $$
    (10a)

    whereHP,γ andHP,α denote respectively the gamma and alpha phylogenetic entropy. When the species importance measurezik represents theith species relative abundance in thekth current-time assemblage, then\( {z}_{+k}=1,\kern0.24em {z}_{++}=N,\kern0.24em {z}_{+k}/{z}_{++}=1/N. \) In this special case, we have\( {}{}^1\overline{D}_{\beta }(T)= \exp \left[\left({H}_{P,\gamma }-{H}_{P,\alpha}\right)/T\right] \). Thus an additive decomposition for phylogenetic entropyHP holds (Pavoine et al.2009; Mouchet and Mouillot2011), as for ordinary Shannon entropy (Jost2007).

  3. (c)

    Whenq = 2, the phylogenetic beta diversity can be expressed as

    $$ {}{}^2\overline{D}_{\beta }(T)=\frac{{\displaystyle \sum}_{i\in {B}_T}{L}_i{\displaystyle \sum}_{k=1}^N{z}_{ik}^2}{{\displaystyle \sum_{i\in {B}_T}^N{L}_i{z}_{i+}^2}}\;. $$

In the special case of\( {z}_{+k}=1,\kern0.24em {z}_{++}=N \), this phylogenetic beta diversity of order 2 can be linked to quadratic entropy as

$$ {}{}^2\overline{D}_{\beta }(T)={\left(1-{Q}_{\gamma }/T\right)}^{-1}/{\left(1-{Q}_{\alpha }/T\right)}^{-1}, $$
(10b)

whereQγ andQα denote respectively the gamma and alpha quadratic entropy. The above formula is also applicable to non-ultrametric trees by replacing allT with\( \overline{T} \), the mean branch length in the pooled assemblage; see Chiu et al. (2014, Appendix C) for a proof.

Normalized Phylogenetic Similarity Measures

For traditional abundance-based diversity, the most commonly used similarity measures includeN-assemblage generalizations of the Jaccard et al. (1966) and Morisita-Horn (Morisita1959) measures. The latter three measures were integrated into a class ofCqN measures by Chao et al. (2008). Jost (2006,2007), Chao et al. (2008,2012), and Chiu et al. (2014) have demonstrated that all the above measures are monotonic transformations of beta diversity based on the ordinary Hill numbers. This is an advantage of using the framework of Hill numbers: a direct link exists between diversity and similarity (or differentiation) among assemblages.

Chiu et al. (2014) extended this framework by proposing four classes of similarity (or differentiation) measures that are monotonic functions of phylogenetic beta diversity. The basic idea is that the phylogenetic beta diversity, a ratio of gamma and alpha phylogenetic Hill numbers, is independent of alpha and measures the pure differentiation among assemblages. The phylogenetic beta component always lies in the range [1,N] for any measures of species importance and all ordersq ≥ 0. Since the range depends onN, the phylogenetic beta diversity cannot be used to compare phylogenetic differentiation among assemblages across multiple regions with different numbers of assemblages. To remove the dependence onN, several transformations can be used to transform the phylogenetic beta component onto [0, 1] to measure local overlap, regional overlap, homogeneity and turnover. We give a summary of these four transformations below and tabulate formulas and the relationship with previous measures in Table1 for the two most important classes. The formulas for the special cases forq = 0, 1 and 2 are also displayed there.

  1. 1.

    A class of branch overlap measures from a local perspective:

    $$ {\overline{C}}_{qN}(T)=\frac{N^{1-q}-{\left[{}{}^q\overline{D}_{\beta }(T)\right]}^{1-q}}{N^{1-q}-1}. $$
    (11a)

    This gives the effective average proportion of shared branches in an individual assemblage. This class of similarity measures extends theCqN overlap measure derived in Chao et al. (2008) to a phylogenetic version. The corresponding differentiation measure\( 1-{\overline{C}}_{qN}(T) \) quantifies the effective average proportion of non-shared branches in an individual assemblage.

    1. (1a)

      Forq = 0, this similarity measure is referred to as the “phylo-Sørensen”N-assemblage overlap measure because forN = 2, it reduces to the measurePhyloSør (phylo-Sørensen) developed by Bryant et al. (2008) and Ferrier et al. (2007).

    2. (1b)

      Forq = 1, this measure\( {\overline{C}}_{1N}(T) \) is called the “phylo-Horn”N-assemblage overlap measure because it extends Horn (1966) two-assemblage measure to incorporate phylogenies forN assemblages.

    3. (1c)

      Forq = 2,\( {\overline{C}}_{2N}(T) \) is called the “phylo-Morisita-Horn”N-assemblage similarity measure because it extends Morisita-Horn measure (Morisita1959) to incorporate phylogenies forN assemblages. The differentiation measure\( 1-{\overline{C}}_{2N}(T) \) when the species importance measure is relative abundances reduces to the measure proposed by de Bello et al. (2010). However, their measure is valid only for ultrametric trees (p. 7 of de Bello et al.2010). Here, the measure can be applied to non-ultrametric trees to obtain

    $$ 1-{\overline{C}}_{2N}\left(\overline{T}\right)=\frac{1-\left[1/{}{}^2\overline{D}_{\beta}\left(\overline{T}\right)\right]}{1-1/N}=\frac{Q_{\gamma }-{Q}_{\alpha }}{\left(1-1/N\right)\left(\overline{T}-{Q}_{\alpha}\right)}, $$
    (11b)

    whereQγ andQα are respectively gamma and alpha quadratic entropy, and\( \overline{T} \) is the mean branch length in the pooled assemblage. A general form for any species importance measure (including absolute abundances) is

    $$ 1-{\overline{C}}_{2N}\left(\overline{T}\right)=\frac{{\displaystyle \sum}_{i\in {B}_{\overline{T}}}{L}_i{\displaystyle \sum}_{m>k}^N{\left({z}_{im}-{z}_{ik}\right)}^2}{\left(N-1\right){\displaystyle \sum}_{i\in {B}_{\overline{T}}}{L}_i{\displaystyle \sum}_{k=1}^N{z}_{ik}^2}\;. $$
    (11c)

    The above expression shows that the similarity index\( {\overline{C}}_{2N}\left(\overline{T}\right) \), as in all other abundance-sensitive similarity measures, is unity if and only if\( {z}_{ij}={z}_{ik} \) (i.e., species importance measures are identical for any nodei in the branch set and for any two assemblagesj andk). This reveals that the similarity index\( {\overline{C}}_{2N}\left(\overline{T}\right) \) quantifies the node-by-node resemblance among theN abundance sets {zik;iB}, k = 1, 2, …,N from a local perspective. See Fig.2 of Chiu et al. (2014) for a simple example of the framework.

  2. 2.

    A class of branch overlap measures from aregionalperspective:

    $$ {\overline{U}}_{qN}(T)=\frac{{\left[1/{}{}^q\overline{D}_{\beta }(T)\right]}^{1-q}-{\left(1/N\right)}^{1-q}}{1-{\left(1/N\right)}^{1-q}} $$
    (12a)

    This class of measures quantifies the effective proportion of shared branches in the pooled assemblage. The corresponding differentiation measure\( 1-{\overline{U}}_{qN}(T) \) quantifies the effective average proportion of non-shared branches in the pooled assemblage.

    1. (2a)

      Forq = 0, this measure is called the “phylo-Jaccard”N-assemblage measure because forN = 2 the measure\( 1-{\overline{U}}_{02}(T) \) reduces to the Jaccard-typeUniFrac measure developed by Lozupone and Knight (2005) and the PD-dissimilarity measure developed by Faith et al. (2009).

    2. (2b)

      Forq = 1, this measure is identical to the “phylo-Horn”N-assemblage overlap measure\( {\overline{C}}_{1N}(T) \); see Table1.

    3. (2c)

      Forq = 2, we refer to the measure2N(T) as a “phylo-regional-overlap” measure. When the species importance measure is relative abundance, we have the following formula for non-ultrametric trees:

      $$ 1-{\overline{U}}_{2N}\left(\overline{T}\right)=\frac{N-{}{}^2\overline{D}_{\beta}\left(\overline{T}\right)}{N-1}=\frac{Q_{\gamma }-{Q}_{\alpha }}{\left(N-1\right)\left(\overline{T}-{Q}_{\gamma}\right)}, $$

      where\( \overline{T} \) denotes the mean branch length in the pooled assemblage. A general form for any species importance measure (including absolute abundances) is

      $$ 1-{\overline{U}}_{2N}\left(\overline{T}\right)=\frac{{\displaystyle \sum}_{i\in {B}_{\overline{T}}}{L}_i{\displaystyle \sum}_{m>k}^N{\left({z}_{im}-{z}_{ik}\right)}^2}{\left(N-1\right){\displaystyle \sum}_{i\in {B}_{\overline{T}}}{L}_i{z}_{i+}^2}\;. $$

    The numerator is the same as that in\( {\overline{C}}_{2N}\left(\overline{T}\right) \), revealing that the similarity index\( {\overline{U}}_{2N}\left(\overline{T}\right) \) also quantifies the node-by-node resemblance among theN abundance sets {zik;iB}, k = 1, 2, …,N; but here the denominator (for the purpose of normalization) is different and takes a regional perspective.

  3. 3.

    A class of phylogenetic homogeneity measures

    $$ {\overline{S}}_{qN}(T)=\frac{1/{}{}^q\overline{D}_{\beta }(T)-1/N}{1-1/N}. $$
    (12b)

    This measure is linear in the proportion of regional phylogenetic diversity contained in a typical assemblage.

    1. (3a)

      Forq = 0, it reduces to the “phylo-Jaccard” measure0N (T), i.e.,\( {\overline{S}}_{0N}(T)={\overline{U}}_{0N}(T) \).

    2. (3b)

      Forq = 1, this measure does not reduce to the “phylo-Horn” overlap measure.

    3. (3c)

      Forq = 2, this measure is identical to\( {\overline{C}}_{2N}(T) \), the “phylo-Morisita-Horn” similarity measure, i.e.,\( {\overline{S}}_{2N}(T)={\overline{C}}_{2N}(T). \)

  4. 4.

    A class of measures of the complement of “phylogenetic turnover rate”:

    $$ {\overline{V}}_{qN}(T)=\frac{N-{}{}^q\overline{D}_{\beta }(T)}{N-1}=1-\frac{{}{}^q\overline{D}_{\beta }(T)-1}{N-1}. $$
    (12c)

    This measure in linear in the phylogenetic beta diversity and the corresponding differentiation measure\( \left[{}{}^q\overline{D}_{\beta }(T)-1\right]/\left(N-1\right) \) quantifies the relative branch turnover rate per assemblage.

    1. (4a)

      Forq = 0, the measure\( {\overline{V}}_{0N}(T) \) is identical to the “phylo-Sørensen” measure, i.e.,\( {\overline{V}}_{0N}(T)={\overline{C}}_{0N}(T) \).

    2. (4b)

      Forq = 1, this measure does not reduce to the “phylo-Horn” overlap measure.

    3. (4c)

      Forq = 2, this measure is identical to2N (T), the “phylo-regional-overlap” measure. That is,\( {\overline{V}}_{2N}(T)={\overline{U}}_{2N}(T) \).

As with the phylogenetic diversity measures, all the above similarity or differentiation measures are functions of two parameters: the sensitivity parameterq and the time perspectiveT. Thus, for each measure, we suggest using the two types of profiles described in section “Phylogenetic Hill numbers and related measures” for the two major similarity measures\( {\overline{C}}_{qN}(T) \) andqN(T) (or their complements) to convey complete information about the similarity or differentiation of a set of assemblages. An example showing the two types of profiles is given in section “An example”.

The lineage excess\( {}{}^q\overline{D}_{\gamma }(T)-{}{}^q\overline{D}_{\alpha }(T) \) and the phylogenetic diversity excess\( {}{}^qP{D}_{\gamma }(T)-{}{}^qP{D}_{\alpha }(T) \) can be interpreted as the effective number of regional lineages (or regional phylogenetic diversity) not contained in a typical local assemblage. However, they cannot be directly applied to compare the similarity or differentiation across multiple regions because both depend not only on the number of assemblages, but also on their corresponding alpha diversity. Following Chao et al. (2012) and Chiu et al. (2014, their Appendix D) proved that we can eliminate these dependences by using an appropriate normalization. After proper normalizations, the two measures lead to the same four classes of normalized similarity and differentiation measures as those obtained from the phylogenetic beta diversity. This is another advantage of using the framework of phylogenetic Hill numbers. That is, a consensus can be achieved on phylogenetic similarity and differentiation measures, includingN-assemblage phylogenetic generalizations of the classic Jaccard, Sørensen, Horn and Morisita-Horn measures, regardless of whether one prefers multiplicative or additive decompositions.

Table 1 Two major classes of phylogenetic similarity measures based on the transformations of phylogenetic beta diversity when species importance measures are incidences (forq = 0), relative abundances or absolute abundance (forq = 1 and 2). The corresponding differentiation measures are the one-complements of the similarity measures. When all lineages are completely distinct (this includes\( T\to 0 \), ignoring phylogeny), these phylogenetic measures reduce to the corresponding non-phylogenetic versions. All measures can also be applied to non-ultrametric trees if\( \overline{T} \) is substituted forT

An Example

We apply the phylogenetic diversity measures and similarity (or differentiation) measures considered in this chapter to a real conservation biology case discussed by Pavoine et al. (2009), a heavily-fished assemblage of 52 rockfish species of the genusSebastes collected for 20 years over three decades (1980–1986, 1993–1994, 1996, 1998–2007) from the Southern California Bight, USA. The phylogenetic tree for these 52 species was obtained from Hyde and Vetter (2007); see Fig.3a. The age of the root for these species is around 7.9 million years (Myr).

Fig. 3
figure 3figure 3

(a) The phylogenetic tree of 52 rockfish species of the genusSebastes (Hyde and Vetter2007) and the species relative abundances in three assemblages: 1980s (Decade I), 1990s (Decade II) and 2000s (Decade III). The age of the root isT = 7.9 Myr. (b) A sub-tree contains only the dominant species (those with relative abundance >8 % in at least one assemblage), and these species are marked in figure (a). All six species are shared by the three assemblages and four of them diverged around 6 Myr ago (i.e., they have been in isolated lineages for 6 Myr) (See Pavoine et al. (2009) for details)

We separate the data into three decades: 1980s, 1990s and 2000s, which will be referred to as Assemblages (and Decades) I, II and III respectively. Within each decade’s assemblage, species abundances are pooled. The species relative abundances for the three assemblages are shown in Fig.3a. There were 48, 44 and 39 species in Decades I, II and III, respectively. (Note that each data point here is a mean of many years’ observations.) A sub-tree containing only the six dominant species (those with relative abundance >8 % in at least one assemblage) is shown in Fig.3b. All six species are shared in the three assemblages and four of them have been in isolated lineages for 6 Myr.

As suggested in section “Phylogenetic Hill numbers and related measures”, we present for each assemblage two types of profiles. In Fig.4a, we plot the measure\( {}{}^q\overline{D}\;(T) \) as a function of orderq, 0 ≤q ≤ 3, for two selected values of temporal perspectives:T = 0 (phylogeny is ignored) andT = 7.9 Myr (whole phylogenetic tree in Fig.3a is considered). In Fig.4b, we plot\( {}{}^q\overline{D}\;(T) \) andqPD(T) as functions ofT separately forq = 0, 1, and 2 for 0 ≤T ≤ 10.

Fig. 4
figure 4figure 4

(a) The first type of diversity profile plots\( {}{}^q\overline{D}\;(T) \) as a function of orderq, 0 ≤ q ≤ 3, for two selected values of temporal perspectives:T = 0 (non-phylogenetic case) andT = 7.9 Myr (the age of the root of the phylogenetic tree in Fig.3a). (b) The second type of diversity profile plots\( {}{}^q\overline{D}\;(T) \) (phylogenetic Hill number) andqPD(T) (phylogenetic diversity) as functions ofT, 0 ≤ T ≤ 10, separately forq = 0, 1 and 2

Based on our phylogenetic diversity measures, all profiles in Fig.4 reveal that the diversity in the most recent decade (Decade III) is the lowest among the three decades in the rockfish assemblage. This implies an appreciable loss of species (as shown in the first type of profile forT = 0), loss of lineages (as shown in the second type of profile based on the measure\( {}{}^q\overline{D}\;(T) \)), and loss of evolutionary history (as shown in the second type of profiles based on the measureqPD(T)) over the three decades.

When species/lineage abundances are discounted (q = 0 in the left panels of Fig.4b), both lineage richness (based on the measure\( {}{}^0\overline{D}\;(T) \)) and total branch lengths (based on the measure0PD(T), i.e., Faith’s PD) exhibit the expected ordering: Decade I > Decade II > Decade III. When species/lineage abundances are counted (i.e.q = 1 and 2 in Fig.4b), the profiles for Decades I and II cross because the assemblage of Decade II has more even abundant species than that of Decade I (see the first type of profiles forT = 0 and Fig.3a, b). Note that if the time-depth is greater than 6 Myr (including the age of the root), then all the abundance-sensitive phylogenetic measures for the three assemblages are very close because most of the dominant species began to diverge around 6 Myr (Fig.3b). This also explains the closeness of the three profiles in the first type of profile forT = 7.9 Myr (theright panel in Fig.4a).

To illustrate the phylogenetic differentiation among assemblages, we focus on measuring the phylogenetic differentiation between any two decades for three pairs (i.e. Decades I vs. II, Decades I vs. III and Decades II vs. III). To see how the phylogenetic differentiation measures vary with the time perspectiveq and with the orderT, we show two types of profiles for each of the two differentiation measures\( 1-{\overline{C}}_{qN}(T) \) and\( 1-{\overline{U}}_{qN}(T) \) in Figs.5 and6. In Fig.5a, we present the first type of profile that plots the measure\( 1-{\overline{C}}_{qN}(T) \) as a function ofq whereq is in the range [0, 3] for two time perspectives:T = 0 (non-phylogenetic case) andT = 7.9 Myr (the age of the root node). In Fig.5b, the same type of differentiation profile is shown for the other measure\( 1-{\overline{U}}_{qN}(T) \). Then in Fig.6a, b, we present the second type of profile that shows the two measures as a function of temporal perspectiveT, 0 ≤ T ≤ 10, forq = 0, 1 and 2 separately.

Based on the two phylogenetic differentiation measures, all profiles in Figs.5 and6 show consistent patterns. When species/lineages abundances are discounted (q = 0), the differences among the differentiation measures of the three pairs of assemblages are not appreciable, as shown in the two left panels in Fig.6 and in the initial point in each of profiles in Fig.5. When species/lineages abundances are counted (q > 0), the compositional differentiation between Decades I vs. II is generally close to that between Decades I vs. III, and the differentiation between two recent decades (Decades II vs. III) is much lower than any of the other two pairs. This implies that the composition of species/lineage abundances has changed after 1990. Examining the relative abundances for those dominant species listed in Fig.3b, we see that the most abundant speciesS. paucispinis (23 %) in Decade I became less abundant in both Decade II (9 %) and Decade III (11 %); the second most abundant speciesS. mystinus (11 %) in Decade I became quite rare in both Decade II (4 %) and Decade III (5 %). Also, the speciesS. miniatus in Decade I was rare, but it became the most dominant species in both Decade II (12 %) and Decade III (25 %). These compositional changes for dominant species help explain the above findings.

As the time perspectiveT becomes large, more dominant shared lineages are added to the two assemblages, implying the differentiation between any two assemblages should exhibit a non-increasing trend asT is increased. Our two differentiation measures forq > 0 in Fig.6 show the expected decreasing trend, and the decline rates differ forq = 1 andq = 2. Based on Fig.3b, we see that most of the dominant and isolated species began to diverge around 6 Myr ago. Thus, the two differentiation profiles forq = 1 and 2 start to decrease sharply around 6 Myr especially for orderq = 2. Since the node abundances near roots (where the differentiation values are near zero) are relatively high and dominant in the whole tree, all values of the phylogenetic differentiation measures forT = 7.9 Myr (the first type of profile forT = 7.9 Myr in the right panel of Fig.5) are substantially lower than their corresponding non-phylogenetic differentiation measure by comparing two figures (T = 0 andT = 7.9 Myr) in each row of Fig.5. The two types of profiles (in Fig.5a, b, and6a, b) demonstrate that the two differentiation measures\( 1-{\overline{C}}_{qN}(T) \) and\( 1-{\overline{U}}_{qN}(T) \) can incorporate the differences in both tree structure and lineage abundances.

In summary, our phylogenetic diversity measures have shown an appreciable loss of species, lineage and evolutionary history in rockfish assemblage over time due to fishing pressure, and our phylogenetic differentiation measures show a pronounced change of species/lineages composition after 1990.

Fig. 5
figure 5figure 5

(a) Differentiation profiles of the measure\( 1-{\overline{C}}_{qN}(T) \) and (b) of the measure\( 1-{\overline{U}}_{qN}(T) \) as a function of orderq, 0 ≤ q ≤ 3, for two specific time perspectives:T = 0 (left panels, corresponding to non-phylogenetic differentiation profiles), andT = 7.9 Myr (right panels, corresponding to the profiles for the age of the root node of the pooled phylogenetic tree in Fig.3a) for three pairs of assemblages (I vs. II, I vs. III, and II vs. III)

Fig. 6
figure 6figure 6

(a) Differentiation profiles of the measure\( 1-{\overline{C}}_{qN}(T) \) and (b) of the measure\( 1-{\overline{U}}_{qN}(T) \), as a function of the time perspective (or time-depth)T, 0 ≤ T ≤ 10, forq = 0 (left panel),q = 1 (middle panel), andq = 2 (right panel) for three pairs of assemblages. All measures are computed for the interval [−T, 0], whereT varies from 0 to 10

Conclusion

  1. 1.

    To quantify phylogenetic diversity of an assemblage, we suggest using two measures: (i) the phylogenetic Hill number\( {}{}^q\overline{D}\;(T) \) (Eqs.4a and4b) which measures the “the effective number of equally abundant and equally distinct lineages all with branch lengthsT”, and (ii) the phylogenetic or branch diversityqPD(T) (Eqs.5a and5b) which measures the “effective total lineage-length”, i.e., the total evolutionary history on an assemblage since timeT. These two measures depend explicitly on two parameters, the abundance sensitivity parameterq and the time perspective (or time-depth) parameterT.

  2. 2.

    Two types of diversity profiles are recommended for considering species/branch abundances and phylogenetic information: (i) The first type of diversity profile is obtained by plottingqPD(T) or\( {}{}^q\overline{D}\;(T) \) as a function of orderq, for some selected values of temporal perspectiveT includingT = 0 (i.e., the non-phylogenetic profile based on the ordinary Hill numbers), andT = the age of the most basal node. See the upper panels of Fig.4 for an example. It would be also informative to includeT = the age of the divergence between the group under study and the rest of the tree. (ii) The second type of diversity profile is obtained by plottingqPD(T) and\( {}{}^q\overline{D}\;(T) \) as functions ofT separately forq = 0, 1, and 2; see the middle and lower panels of Fig.4 for an example. The second type of profile shows the effect of time-depth or evolution change on our diversity measures.

  3. 3.

    When there are multiple assemblages, the phylogenetic gamma Hill number is the effective number of equally abundant and equally distinct lineages in the pooled assemblage; the phylogenetic alpha Hill number is the effective number of equally abundant and equally distinct lineages per assemblage. Thus the phylogenetic beta Hill number, as the ratio of gamma and beta, is interpreted as “the number of phylogenetically completely distinct assemblages”. In this case, alpha and beta are unrelated (or independent). The difference of phylogenetic gamma and alpha Hill numbers is lineage excess, which is dependent on both alpha and gamma. The phylogenetic beta Hill number and lineage excess lead to the same classes of similarity and differentiation measures, listed in section “Normalized phylogenetic similarity measures”. See Table1 for the two major classes of phylogenetic overlap measures,\( {\overline{C}}_{qN}(T) \) from a local perspective andqN(T) from a regional perspective.

  4. 4.

    To assess the phylogenetic resemblance or differentiation among assemblages, two types of similarity or differentiation profiles as those in Point 2 are suggested for the two major classes of measures,\( {\overline{C}}_{qN}(T) \) andqN(T) (Table1); see Figs.5 and6 for examples.

References

  • Allen B, Kon M, Bar-Yam Y (2009) A new phylogenetic diversity measure generalizing the Shannon index and its application to phyllostomid bats. Am Nat 174(2):236–243

    Article PubMed  Google Scholar 

  • Bryant JA, Lamanna C, Morlon H, Kerkhoff AJ, Enquist BJ, Green JL (2008) Microbes on mountainsides: contrasting elevational patterns of bacterial and plant diversity. Proc Natl Acad Sci U S A 105:11505–11511

    Article CAS PubMed PubMed Central  Google Scholar 

  • Cavender-Bares J, Ackerly DD, Kozak KH (2012) Integrating ecology and phylogenetics: the footprint of history in modern-day communities. Ecology 93(sp8):S1–S3

    Article  Google Scholar 

  • Cavender‐Bares J, Kozak KH, Fine PV, Kembel SW (2009) The merging of community ecology and phylogenetic biology. Ecol Lett 12(7):693–715

    Article PubMed  Google Scholar 

  • Chao A (2005) Species estimation and applications. In: Kotz S, Balakrishnan N, Read CB, Vidakovic B (eds) Encyclopedia of statistical sciences, vol 12, 2nd edn. Wiley, New York, pp 7907–7916

    Google Scholar 

  • Chao A, Jost L, Chiang SC, Jiang Y-H, Chazdon RL (2008) A two-stage probabilistic approach to multiple-community similarity indices. Biometrics 64(4):1178–1186

    Article PubMed  Google Scholar 

  • Chao A, Chiu C-H, Jost L (2010) Phylogenetic diversity measures based on Hill numbers. Phil Trans R Soc London B365:3599–3609

    Article  Google Scholar 

  • Chao A, Chiu C-H, Hsieh TC (2012) Proposing a resolution to debates on diversity partitioning. Ecology 93(9):2037–2051

    Article PubMed  Google Scholar 

  • Chao A, Chiu C-H, Hsieh TC, Davis T, Nipperess DA, Faith DP (2015) Rarefaction and extrapolation of phylogenetic diversity. Methods Ecol Evol 6:380–388

    Google Scholar 

  • Chiu C-H, Jost L, Chao A (2014) Phylogenetic beta diversity, similarity, and differentiation measures based on Hill numbers. Ecol Monog 84(1):21–44

    Article  Google Scholar 

  • Colwell RK, Coddington JA (1994) Estimating terrestrial biodiversity through extrapolation. Philos Trans R Soc Lond B345:101–118

    Article  Google Scholar 

  • Crozier R (1997) Preserving the information content of species: genetic diversity, phylogeny, and conservation worth. Annu Rev Ecol Syst 28:243–268

    Article  Google Scholar 

  • Daróczy Z (1970) Generalized information functions. Inf Control 16(1):36–51

    Article  Google Scholar 

  • de Bello F, Lavorel S, Díaz S, Harrington R, Cornelissen JH, Bardgett RD, Berg MP, Cipriotti P, Feld CK, Hering D (2010) Towards an assessment of multiple ecosystem processes and services via functional traits. Biol Cons 19(10):2873–2893

    Article  Google Scholar 

  • Ellison AM (2010) Partitioning diversity. Ecology 91:1962–1963

    Article PubMed  Google Scholar 

  • Faith DP (1992) Conservation evaluation and phylogenetic diversity. Biol Cons 61(1):1–10

    Article  Google Scholar 

  • Faith DP (2013) Biodiversity and evolutionary history: useful extensions of the PD phylogenetic diversity assessment framework. Ann NY Acad Sci 1289(1):69–89

    Article PubMed  Google Scholar 

  • Faith DP, Richards ZT (2012) Climate change impacts on the tree of life: changes in phylogenetic diversity illustrated for acropora corals. Biology 1(3):906–932

    Article PubMed PubMed Central  Google Scholar 

  • Faith DP, Lozupone CA, Nipperess D, Knight R (2009) The cladistic basis for the phylogenetic diversity (PD) measure links evolutionary features to environmental gradients and supports broad applications of microbial ecology’s “phylogenetic beta diversity” framework. Int J Mol Sci 10:4723–4741

    Article PubMed PubMed Central  Google Scholar 

  • Ferrier S, Manion G, Elith J, Richardson K (2007) Using generalized dissimilarity modelling to analyse and predict patterns of beta diversity in regional biodiversity assessment. Divers Distrib 13(3):252–264

    Article  Google Scholar 

  • Gotelli NJ, Colwell RK (2011) Estimating species richness. In: Magurran AE, McGill BJ (eds) Biological diversity: frontiers in measurement and assessment. Oxford University Press, Oxford, pp 39–54

    Google Scholar 

  • Hannah L, Kay JA (1977) Concentration in the modern industry: theory, measurement, and the U.K. experience. MacMillan, London

    Book  Google Scholar 

  • Havrda J, Charvát F (1967) Quantification method of classification processes: concept of structural α-entropy. Kybernetika 3(1):30–35

    Google Scholar 

  • Helmus MR, Bland TJ, Williams CK, Ives AR (2007) Phylogenetic measures of biodiversity. Am Nat 169(3):68–83

    Article  Google Scholar 

  • Hill M (1973) Diversity and evenness: a unifying notation and its consequences. Ecology 54(2):427–432

    Article  Google Scholar 

  • Horn HS (1966) Measurement of “overlap” in comparative ecological studies. Am Nat 100:419–424

    Article  Google Scholar 

  • Hubbell SP (2001) A unified theory of biodiversity and biogeography. Princeton University Press, Princeton

    Google Scholar 

  • Hurlbert SH (1971) The nonconcept of species diversity: a critique and alternative parameters. Ecology 52(4):577–586

    Article  Google Scholar 

  • Hyde JR, Vetter RD (2007) The origin, evolution, and diversification of rockfishes of the genus Sebastes (Cuvier). Mol Phylogenet Evol 44:790–811

    Article CAS PubMed  Google Scholar 

  • Ives AR, Helmus MR (2010) Phylogenetic Metrics of Community Similarity. Am Nat 176:128–142

    Article  Google Scholar 

  • Ives AR, Helmus MR (2011) Generalized linear mixed models for phylogenetic analyses of community structure. Ecol Monogr 81(3):511–525

    Article  Google Scholar 

  • Jost L (2006) Entropy and diversity. Oikos 113(2):363–375

    Article  Google Scholar 

  • Jost L (2007) Partitioning diversity into independent alpha and beta components. Ecology 88:2427–2439

    Article PubMed  Google Scholar 

  • Jost L (2009) Mismeasuring biological diversity: response to Hoffman and Hoffman (2008). Ecol Econ 68:925–927

    Article  Google Scholar 

  • Jost L (2010) Independence of alpha and beta diversities. Ecology 91:1969–1974

    Article PubMed  Google Scholar 

  • Jost L, DeVries P, Walla T, Greeney H, Chao A, Ricotta C (2010) Partitioning diversity for conservation analyses. Divers Distrib 16(1):65–76

    Article  Google Scholar 

  • Keylock C (2005) Simpson diversity and the Shannon-Wiener index as special cases of a generalized entropy. Oikos 109:203–207

    Article  Google Scholar 

  • Kimura M, Crow JF (1964) The number of alleles that can be maintained in a finite population. Genetics 49(4):725–738

    CAS PubMed PubMed Central  Google Scholar 

  • Lozupone C, Knight R (2005) UniFrac: a new phylogenetic method for comparing microbial communities. Appl Environ Microbiol 71(12):8228–8235

    Article CAS PubMed PubMed Central  Google Scholar 

  • MacArthur RH (1965) Patterns of species diversity. Biol Rev 40(4):510–533

    Article  Google Scholar 

  • MacArthur RH (1972) Geographical ecology. Harper & Row, New York

    Google Scholar 

  • MacArthur RH, Wilson EO (1967) The theory of island biogeography. Princeton University Press, Princeton

    Google Scholar 

  • Magurran AE (2004) Measuring biological diversity. Blackwell, Oxford

    Google Scholar 

  • Magurran AE, McGill BJ (eds) (2011) Biological diversity: frontiers in measurement and assessment. Oxford University Press, Oxford

    Google Scholar 

  • McPeek MA, Miller TE (1996) Evolutionary biology and community ecology. Ecology 77(5):1319–1320

    Article  Google Scholar 

  • Morisita M (1959) Measuring of interspecific association and similarity between communities. Memoires of the Faculty of Science, Kyushu University. Ser E (Biol) 3:65–80

    Google Scholar 

  • Mouchet MA, Mouillot D (2011) Decomposing phylogenetic entropy into α, β and γ components. Biol Lett 7(2):205–209

    Article PubMed  Google Scholar 

  • Nipperess DA, Matsen FA IV (2013) The mean and variance of phylogenetic diversity under rarefaction. Methods Ecol Evol 4:566–572

    Article PubMed PubMed Central  Google Scholar 

  • Patil GP, Taillie C (1979) A study on diversity profiles and orderings for a bird community in the vicinity of Colstrip, Montana. In: Patil GP, Rosenzweig M (eds) Contemporary quantitative ecology and related econometrics. International Co-operative Publishing House, Fairland, pp 23–48

    Google Scholar 

  • Patil GP, Taillie C (1982) Diversity as a concept and its measurement. J Am Stat Assoc 77:548–561

    Article  Google Scholar 

  • Pavoine S, Love MS, Bonsall MB (2009) Hierarchical partitioning of evolutionary and ecological patterns in the organization of phylogenetically structured species assemblages: application to rockfish (genus: Sebastes) in the Southern California Bight. Ecol Lett 12(9):898–908

    Article PubMed  Google Scholar 

  • Pavoine S, Baguette M, Bonsall MB (2010) Decomposition of trait diversity among the nodes of a phylogenetic tree. Ecol Monogr 80(3):485–507

    Article  Google Scholar 

  • Peet RK (1974) The measurement of species diversity. Annu Rev Ecol Syst 5:285–307

    Article  Google Scholar 

  • Petchey OL, Gaston KJ (2002) Functional diversity (FD), species richness and community composition. Ecol Lett 5(3):402–411

    Article  Google Scholar 

  • Pielou EC (1975) Ecological diversity. Wiley, New York

    Google Scholar 

  • Purvis A, Hector A (2000) Getting the measure of biodiversity. Nature 405(6783):212–219

    Article CAS PubMed  Google Scholar 

  • Rao CR (1982) Diversity and dissimilarity coefficients: a unified approach. Theor Popul Biol 21(1):24–43

    Article  Google Scholar 

  • Rényi A (1961) On measures of entropy and information. In: Neyman J (ed) Proceedings of the fourth Berkeley symposium on mathematical statistics and probability, vol 1. University of California Press, Berkeley, pp 547–561

    Google Scholar 

  • Ricotta C, Szeidl L (2009) Diversity partitioning of Rao’s quadratic entropy. Theor Popul Biol 76(4):299–302

    Article PubMed  Google Scholar 

  • Routledge R (1979) Diversity indices: which ones are admissible? J Theor Biol 76(4):503–515

    Article CAS PubMed  Google Scholar 

  • Tilman D (2001) Functional diversity. In: Levin AS (ed) Encyclopedia of biodiversity, vol 3. Academic Press, San Diego, pp 109–120

    Chapter  Google Scholar 

  • Tsallis C (1988) Possible generalization of Boltzmann-Gibbs statistics. J Stat Phys 52(1):479–487

    Article  Google Scholar 

  • Vellend M, Cornwell WK, Magnuson-Ford K, Mooers A (2011) Measuring phylogenetic biodiversity. In: Magurran AE, McGill BJ (eds) Biological diversity: frontiers in measurement and assessment. Oxford University Press, Oxford, pp 194–207

    Google Scholar 

  • Warwick R, Clarke K (1995) New ‘biodiversity’ measures reveal a decrease in taxonomic distinctness with increasing stress. Mar Ecol Prog Ser 129(1):301–305

    Article  Google Scholar 

  • Webb CO (2000) Exploring the phylogenetic structure of ecological communities: an example for rain forest trees. Am Nat 156:145–155

    Article PubMed  Google Scholar 

  • Webb CO, Ackerly DD, McPeek MA, Donoghue MJ (2002) Phylogenies and community ecology. Annu Rev Ecol Syst 33:475–505

    Article  Google Scholar 

  • Webb CO, Losos JB, Agrawal AA (2006) Integrating phylogenies into community ecology. Ecology 87(sp7):S1–S2

    Article  Google Scholar 

  • Weiher E (2011) A primer of trait and functional diversity. In: Magurran AE, McGill BJ (eds) Biological diversity: frontiers in measurement and assessment. Oxford University Press, Oxford, pp 175–193

    Google Scholar 

  • Whittaker RH (1972) Evolution and measurement of species diversity. Taxon 12:213–251

    Article  Google Scholar 

Download references

Acknowledgements

The authors thank Roseli Pellens, Samuel Scheiner and an anonymous reviewer for helpful comments and suggestions. This work was supported by the Taiwan National Science Council under Contracts 100-2118-M007-006 and 103-2628-M007-007 for AC. CC is supported by a post-doctoral fellowship, National Tsing Hua University, Taiwan. LJ was supported by a donation from John V. Moore to the Population Biology Foundation. All numerical results presented in section “An example” of this chapter were obtained by expanding the R scripts provided in Pavoine et al. (2009, their Appendix S1) to include Hill numbers and our phylogenetic measures.

Author information

Authors and Affiliations

  1. Institute of Statistics, National Tsing Hua University, Hsin-Chu, 30043, Taiwan

    Anne Chao & Chun-Huo Chiu

  2. EcoMinga Foundation, Baños, Tungurahua, Ecuador

    Lou Jost

Authors
  1. Anne Chao
  2. Chun-Huo Chiu
  3. Lou Jost

Corresponding author

Correspondence toAnne Chao.

Editor information

Editors and Affiliations

  1. Institut de Systématique, Evolution, Biodiversité, ISYEB – UMR 7205 CNRS MNHN UPMC EPHE, Muséum National d’Histoire Naturelle, Sorbonne Universités, Paris, France

    Roseli Pellens  & Philippe Grandcolas  & 

Rights and permissions

Open Access This chapter is distributed under the terms of the Creative Commons Attribution-Noncommercial 2.5 License (http://creativecommons.org/licenses/by-nc/2.5/) which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

The images or other third party material in this chapter are included in the work’s Creative Commons license, unless indicated otherwise in the credit line; if such material is not included in the work’s Creative Commons license and the respective action is not permitted by statutory regulation, users will need to obtain permission from the license holder to duplicate, adapt or reproduce the material.

Reprints and permissions

Copyright information

© 2016 The Author(s)

About this chapter

Cite this chapter

Chao, A., Chiu, CH., Jost, L. (2016). Phylogenetic Diversity Measures and Their Decomposition: A Framework Based on Hill Numbers. In: Pellens, R., Grandcolas, P. (eds) Biodiversity Conservation and Phylogenetic Systematics. Topics in Biodiversity and Conservation, vol 14. Springer, Cham. https://doi.org/10.1007/978-3-319-22461-9_8

Download citation

Keywords

Publish with us


[8]ページ先頭

©2009-2026 Movatter.jp