当前位置: 首页 > 医学版 > 期刊论文 > 基础医学 > 分子生物学进展 > 2004年 > 第2期 > 正文
编号:11259325
Ancient DNA Enables Timing of the Pleistocene Origin and Holocene Expansion of Two Adélie Penguin Lineages in Antarctica
     * Allan Wilson Centre for Molecular Ecology and Evolution, Auckland

    Institute of Molecular BioSciences, Massey University, Auckland

    School of Biological Sciences, University of Auckland, Auckland, New Zealand

    Dipartmento Scienze della Terra, Università di Pisa, and Consiglio Nazionale Ricerche, Centro Studio Geologia Strutturale, Pisa, Italy

    E mail: d.m.lambert@massey.ac.nz.

    Abstract

    The timing of divergent events in history is one of the central goals of contemporary evolutionary biology. Such studies are however dependent on accurate evolutionary rates. Recent developments in ancient DNA analysis enable the estimation of more accurate evolutionary rates and therefore more accurate timing of divergence events. Consequently, this leads to a better understanding of changes in populations through time. We use an evolutionary rate calculated from ancient DNA of Adélie penguins (Pygoscelis adeliae) to time divergent events in their history. We report the presence of two distinct and highly variable mitochondrial DNA lineages and track changes in these lineages through space and time. When the ancient DNA and the phylogenetic rates are used to estimate the time of origin of the lineages, two very different estimates resulted. In addition, these same rates provide very different estimates of the time of expansion of these lineages. We suggest that the rate calculated from ancient DNA is more consistent with the glacial history of Antarctica and requires fewer assumptions than does a narrative based on the phylogenetic rate. Finally, we suggest that our study indicates an important new role for ancient DNA studies in the timing of divergent events in history.

    Key Words: molecular clock ? phylogeography ? control region ? Bayesian inference ? rate heterogeneity

    Introduction

    Evolutionary biology, over the past 100 years, has focused on first determining phylogenetic relationships among groups of organisms and, second, representing these relationships against time scales. The latter requires, however, accurate estimates of rates of evolution. These can then be used to better understand changing patterns of genetic variation. More precise and higher resolution descriptions of patterns of genetic variation in populations through space and time are central activities in contemporary evolutionary biology. Accurate descriptions of such patterns can be informative of past vicariant events that have had an impact on the evolution of diverse biota. Such vicariant events are typically associated with periods of global environmental change. The Pleistocene epoch (2 Myr BP to 10 kyr BP) was, for example, characterized by a series of global ice ages that were remarkably synchronous (Williams 1989). Over the past 900 kyr, each glacial cycle lasted about 100 kyr and was associated with eccentricity changes in the Earth's orbit (Imbrie et al. 1992). During these ice ages, many populations and species are thought to have split into refugia (Roberts 1998). Each ice age was followed by relatively short deglacial periods, the most recent of which occurred during the Holocene (<10 kyr BP). The latter warm period was characterized by the expansion of many populations from ice age refugia (Roberts 1998).

    In Antarctica, the Late Pleistocene was distinguished by the repeated expansion and collapse of huge marine-based ice shelves, as well as by fluctuations in ice volume on the Antarctic landmass. These changes would have certainly caused large-scale disruptions to animal populations, as well as to their habitats. For example, Adélie penguins (Pygoscelis adeliae) breed in large colonies on ice-free areas close to the sea (Ainley, LaReseche, and Sladen 1983). Ross Island is presently the home to several hundred thousand breeding pairs of birds and is situated at the edge of the Ross Ice Shelf. In Antarctica, even small fluctuations in the volume of ice can have a significant impact on the number and extent of ice-free areas (Colhoun et al. 1992). At the last glacial maximum, the entire Ross Sea coastline(fig. 1) was uninhabitable to Adélie penguins because of the extent of the Ross Ice Shelf, and Ross Island itself was almost 900 km from the sea (Denton et al. 1989). Hence, these areas were uninhabitable to Adélie penguins, and such glacial events have undoubtedly influenced penguin abundance, distribution, and genetic diversity.

    FIG. 1. (A) and (B) The locations on Antarctica where 16 modern populations of Adélie penguins and the 17 relic colonies containing subfossil remains were sampled. Circled numbers correspond to the sites listed in tables 1 and 2. The areas marked by dots show the two largest marine-based ice shelves, both fed from the West Antarctic Ice Sheet. The dashed line in the insert marks the grounding line of the Ross Ice Shelf at the last glacial maximum (LGM) and during its retreat 7,600 years BP. Each pie diagram indicates the relative frequencies of the A (black) and RS (white) lineages in each regional grouping. (C), (D), and (E) The subfossil sampling locations on the coast of the Ross Sea and the ages and lineages for all 96 ancient DNA sequences recovered from three different time periods

    Because of the presence of extensive ancient DNA deposits in the undisturbed Antarctic environment, Adélie penguins have enabled the direct estimation of a rate of evolution of the control region of the mitochondrial genome (Lambert et al. 2002). Consequently, this species provides an ideal model to detect any vicariant events precipitated by major global environmental changes; for example, the Pleistocene cooling and later Holocene warming of Antarctica (Roberts 1998). The timing of vicariant events that have structured populations or resulted in speciation has generally been inferred using a "molecular clock" (Stoneking et al. 1992; Rand 1994; Klicka and Zink 1997; Held 2001). To estimate the divergence times between lineages, virtually all authors use phylogenetic rates adopted from other species or populations. Molecular clocks are typically calibrated in absolute time by comparison of sequences from species whose time of divergence is established from either known-age fossils (e.g., Rand 1994), vicariant events (e.g., Held 2001), or archaeological evidence (e.g., Stoneking et al. 1992). However, in some cases, rates estimated in this way result in divergence times that are grossly inconsistent with the fossil record when applied in other lineages or to other taxa (Heckman et al. 2001). Perhaps this is not surprising, because a range of errors can affect phylogenetic methods for estimating evolutionary rates. First, inadequate sampling of taxa and short sequences can result in an inaccurate tree topology. Second, a series of potential errors involves the use of the fossil record. These potential errors include misdiagnosis of fossil taxa, uncertainty in dating fossils, inaccurate positioning of relevant fossils on a phylogenetic tree, improper dating of the stem lineage of an extant group, and the minimum age estimates that fossils provide (Soltis et al. 2002). Moreover, other factors that can bias estimates of rate may include a history of mitochondrial introgression, nonneutrality, or rate heterogeneity among lineages (Mindell and Thacker 1996). This diversity of potential errors, in any particular case, is likely to result in either an overestimate or underestimate of the rate of evolution.

    The first estimate for a rate of evolution for an avian mitochondrial genome was from Anser and Branta geese (Shields and Wilson 1987). Using restriction fragment length polymorphisms (RFLPs) of mitochondrial genomes, Shields and Wilson recorded 9% sequence difference between Anser and Branta geese. Knowing that that the oldest fossils from each genus were dated at 4 to 5 Myr, Shields and Wilson (1987) proposed a mean rate of divergence of 2%/Myr (9/4.5 = 2) for the entire mitochondrial genome. Quinn (1992) later estimated that the Branta mitochondrial DNA (mtDNA) control region (CR) evolved at a rate of 20.8%/Myr. This was based on the finding that the 5' end of the CR of two Branta subspecies evolved 10.4 times faster than the entire mitochondrial genome (2%/Myr [Shields and Wilson 1987]). This relative difference was suggested because the CRs of these two subspecies had 13.5% sequence difference, accompanied by 1.3% RFLP difference between their entire mitochondrial genomes. This was the basis on which Quinn (1992) calculated a 10.4 times difference in rates between the entire molecule and the CR itself (13.5/1.3 = 10.4). Subsequently, a large number of authors have adopted Quinn's relative rate estimate of 20.8%/Myr as a general rate for the CR of avian species (for example Baker and Marshall 1997).

    Not withstanding all the above difficulties, until recently, the phylogenetic approach has represented the only method for estimating evolutionary rates and thereby timing divergent events. However, using a CR rate of evolution calculated from ancient DNA, we report here changes through space and time of two highly divergent mitochondrial lineages of Adélie penguins. Furthermore, we estimated the time to the most recent common ancestor (tMRCA) of mitochondrial lineages, using both the phylogenetic rate typically adopted for bird species (i.e., 20.8%/Myr) and the ancient DNA rate specific to Adélie penguins (i.e., 53%/Myr to 143%/Myr). These two rates resulted in very different times of origin of the lineages. We discuss the relative merits of these rates and suggest that the time of divergence of lineages, as estimated using ancient DNA data, is more consistent with the glacial history of Antarctica.

    Materials and Methods

    Sample Collection

    Blood sampling of Adélie penguins from 16 populations (table 1) was conducted in Antarctica during the austral summers of 1996 to 2001. Subfossil Adélie penguin bones were collected from 17 locations (table 2) along the coast of the Ross Sea, which included relic nesting sites in the vicinity of presently occupied rookeries, as well as from abandoned rookeries. We dug for subfossil bones using a stratigraphic method of excavation (Baroni and Orombelli 1994). Subfossil bone samples were collected and stored frozen, then returned to the laboratory where they were kept at –20°C. A total of 380 sequences from modern and 96 sequences from ancient Adélie penguin samples have been included in the present study from our previous work (Lambert et al. 2002). These sequences can be retrieved at the GenBank (accession numbers AF474792 to AF474997).

    Table 1 The Locations in Antarctica Where Modern Penguin Populations Were Sampled.

    Table 2 The Locations in Antarctica Where Subfossil Bones Were Sampled.

    DNA Extraction and Sequencing

    Modern DNA was extracted from 177 whole-blood samples following standard methods (Sambrook, Fritsch, and Maniatus 1989). The hypervariable region 1 (HVR-1) of the mtDNA control region was amplified using PCR and primers specific to Adélie penguins. For the modern DNA samples, we amplified a 640-bp fragment using the primers L-tRNAGlu (5'-CCCGCTTGGCTTYTCTCCAAGGTC-3') and H-A650 (5'-CTGACATAGGAACCAGAGGCGC-3') (Ritchie and Lambert 2000; Roeder, Ritchie, and Lambert 2002). Samples were amplified using a Hybaid Omn-E thermal cycler at 94°C for 10 s, 50°C or 61°C for 10 s, and 72°C for 25 s for 30 cycles. All resulting PCR products were purified using High Pure PCR purification columns (Roche), sequenced using the ABI PRISM? BigDyeTM Terminator Cycle Sequencing kit (Applied Biosystems), and analyzed on a ABI 377A automated sequencer. DNA sequences were aligned using ClustalW (Thompson, Higgins, and Gibson 1994) and can be retrieved at the GenBank accession numbers AY525167–AY525326.

    Summary Statistics

    The rate of each type of nucleotide substitution relative to GT was estimated under a general time-reversible model (GTR) with maximum likelihood (Rodríquez et al. 1990) using PAUP* version 4.0 (Swofford 2002). Haplotypic (h) and nucleotide ( ± SE) diversity, number of segregating sites (S), and nucleotide differences (k) were estimated in DNASP version 3.5 (Rozas and Rozas 1999). To test for neutrality, we calculated Tajima's D statistic (Tajima 1989), the difference between S and , which are both estimates of the expected number of substitutions per site.

    Phylogenetic Analyses

    A phylogenetic analysis of all sequences using the neighbor-joining (NJ) method (Saitou and Nei 1987) was conducted using a HKY85 substitution model (Hasegawa, Kishino, and Yano 1985) in PAUP* version 4.0. The phylogenetic tree was rooted with two sequences from chinstrap penguins (P. antarctica) and one sequence from a gentoo penguin (P. papua). We assessed the level of support for the major features of the NJ tree in the data set by resampling with a bootstrap procedure of 1,000 replicates. Support for the monophyly of lineages was also assessed by simulating the uncertainty of the tree () given the data set (X) in a Bayesian framework. The analysis began with the NJ tree, and the posterior probability densities were estimated with a Metropolis-coupled Markov chain Monte Carlo (MCMC) integration algorithm using MrBayes version 2.01 (Huelsenbeck and Ronquist 2001).

    Time to a Most Recent Common Ancestor

    The tMRCA was estimated for the modern HVR-1 sequences (352 bp) by simulating the uncertainty of the genealogy in a Bayesian framework, using the 95% highest probability density (HPD) (Box and Tiao 1992) intervals for μe using a population model of constant growth. In addition, we also estimated divergence times given the phylogenetic rate of nucleotide change of 0.208 substitutions/site/Myr (s/s/Myr). In both cases, the posterior probability density of the tMRCA was estimated from a distribution of likely trees, which were calculated as a product of a likelihood function Pr(X|, μe, Q), where Q is a GTR substitution model, the prior probabilities of the coalescent process (,|Ne), μe, and Ne (effective population size), and Q (Drummond et al. 2002). The posterior probability densities were approximated using MCMC integration in MEPI version 1.0 (Drummond et al. 2002). The prior probabilities for Ne were allowed to vary between 105 and 107 individuals. The estimated evolutionary rate for Adélie penguin HVR-1 sequences, and their clocklike behavior, has previously been established at 0.96 s/s/Myr) (95% HPD interval 0.53 to 1.43) (Lambert et al. 2002). In the first analysis, μe was allowed to vary between 0.53 and 1.43, and for the second analysis, μe was fixed at a point estimate of 0.208 s/s/Myr. The latter rate has been estimated from the divergence of Anser and Branta species of geese (Shields and Wilson 1987; Quinn 1992) and is a generally accepted phylogenetic rate for the HVR-1 of avian species. The Markov chains were 107 steps long, and the first 5 x 105 iterations were discarded as the burn-in time, until a steady state was reached.

    Results

    The HVR-1 sequences from the modern Adélie penguins were typically 594 bp long and were aligned to 96 ancient DNA sequences 352 bp long. These sequences are homologous to the CR sequences for which Quinn (1992) estimated an evolutionary rate of 20.8%/Myr. For all modern samples (n = 557), the homologous 352 bp region showed unequal base frequencies (fA = 0.31, fT = 0.30, fC = 0.19, and fG = 0.20 from the H-strand sequence) and revealed 440 haplotypes among 557 sequences (h = 0.998). The relative rates for each substitution type was biased towards transitions (AC = 1.11, AG = 21.52, AT = 0.33, CG = 1.49, and CT = 15.99, relative to GT = 1.0 from the heavy strand sequence) over 193 segregating sites, of which 47 were singleton changes, and = 0.043 (±SE 0.002). Tajima's D statistic was –1.712, and there was no significant deviation from neutrality (P > 0.05).

    Two Mitochondrial DNA Lineages

    The most prominent feature of any phylogenetic analysis of HVR-1 sequence data for Adélie penguins is the presence of two monophyletic lineages of Adélie penguins (fig. 2). One lineage was recorded mainly in the Ross Sea and is designated the Ross Sea (RS) lineage, whereas the other was present at all locations around the Antarctic continent and was consequently designated the Antarctic (A) lineage (fig. 1A). There were, on average, 27.7 nucleotide differences between sequences from the A and RS lineages (the overall d = 0.081 ± SE 0.012, and the net d = 0.057), significantly more than the average number of differences among sequences within each lineage (kA = 7.1 and kRS = 9.8). The two lineages shared 49 polymorphic sites, but there were no fixed differences between them. Sequences from the A lineage had 98 polymorphic sites that were monomorphic among the RS sequences, whereas sequences from the RS lineage had 46 polymorphic sites that were all monomorphic among the A lineage sequences. When the two outgroup species, the chinstrap and gentoo penguins, were compared with the Adélie penguins sequences, the proportion of nucleotide differences were 0.305 (±SE 0.022) and 0.294 (±SE 0.027), respectively. Furthermore, there was a 100 bp insertion/deletion event when the HVR-1 sequences from Adélie and gentoo penguins were compared with those from chinstraps.

    FIG. 2. The neighbor-joining tree of the 557 modern and 96 ancient Adélie penguin HVR-1 sequences and the outgroup is represented by the chinstrap and gentoo penguins. The monophyly of Antarctic (A) and Ross Sea (RS) Adélie penguin lineages was well supported by the bootstrap analysis (100%) and Bayesian inference (1.0)

    The monophyly of each lineage was well supported. There was 100% bootstrap support for the split, and the Metropolis-coupled MCMC algorithm found that the monophyly of each lineage was 100% credible. A sequence from one lineage never clustered with a sequence from the other lineage in the full posterior distribution of likelihood trees. Plots of the log probability of the observed data through time showed that the Markov chain reached stability after about 8,000 iterations, and the first 50,000 iterations were discarded as the burn-in time (data not shown). To check for convergence, a second run was conducted that began with a random tree, and again this showed no topological variance in the monophyly of the A and RS lineages.

    The Geographic Pattern of mtDNA Lineages

    Populations in the Ross Sea have a higher nucleotide diversity ( = 0.042 ± SE 0.006) than those sampled from other Antarctic locations ( = 0.019 ± SE 0.001), because of the presence of both lineages in the Ross Sea and the absence of one lineage outside of that region. A feature of the lineage proportions is the decreasing frequency of the RS lineage with increasing latitude. The geographic distribution of each lineage is presented in figure 1A. In figure 1A, the frequency cline has been divided into regional groups according to known locations of the grounded ice shelf during the retreat of the Ross Ice sheet since the last glacial maximum. These five regional groupings were designated for convenience. They include Ross Island (Capes Bird, Royds and Crozier, and Beaufort Island), Terra Nova (Franklin and Inexpressible Islands, Adélie Cove and Edmonson Point), Adare (Capes Wheatstone, Hallett, and Adare), Balleny (Balleny Islands and Port Martin), and the non-Ross Sea locations (Torgersen, Welsh, and Gardner Islands). The number of the A and RS lineages in each of these regional groups are presented in table 1, and their relative frequencies are presented in figure 1A. The existence of a cline in A/RS proportions is generally supported by a series of contingence 2-test that were performed to compare the relative proportions of each lineage between regions. This test showed a significant difference in all comparisons between regional groupings (P < 0.001), excluding the comparison of Ross Island with Terra Nova (2 = 2.243, P = 0.191) and Balleny with non–Ross Sea regional groups (2 = 2.034, P = 0.99).

    Ancient DNA Samples

    DNA has previously been recovered, and HVR-1 sequences were determined from 96 subfossil bone samples (Lambert et al. 2002). The geographic distribution of the ages for each subfossil bone and its respective lineage are presented in figure 1B. Younger bones tended to be recovered from more southern locations, and the largest numbers of old bones were concentrated in the Terra Nova Bay area. DNA was recovered from 15 samples dated to between 5,706 and 6,082 years BP from Dunlop and Inexpressible islands and revealed similar numbers of the A (n = 8) and RS (n = 7) lineages. There was a significant difference between the relative frequencies of the two lineages when the pooled modern Ross Island and Terra Nova populations were compared with the relic Dunlop and Inexpressible islands populations (2 = 4.634, P = 0.037). There was no significant difference between the relative frequencies of the two lineages recovered from the less than 2,500 years and the 2,500 to 5,000 years BP time periods when compared with the modern populations in the same region (P > 0.05). The average divergence between the A and RS lineages was estimated for sequences that were older than 2,000 years (mean age = 4,200 years BP) at 0.074 (±SE 0.010). This result is slightly lower than the same comparison among the modern samples and may indicate a detectable amount of evolutionary change between the two lineages through time.

    The Posterior Distribution of Times to a Common Ancestor

    The estimated number of independent samples in the Markov chain, calculated by dividing the chain length by the integrated autocorrelation time, was 744 when the rate varied between 0.53 and 1.43 s/s/Myr (assuming a constant population) and 972 when the rate was fixed at 0.205 s/s/Myr. There was a significant difference between the two posterior probability densities of tMRCA (figs. 3 and 4). Using the rate derived from ancient DNA sampling of Adélie penguins, the MCMC approach estimated the tMRCA A-RS at 75 kyr BP (95% HPD interval 37 to 122 kyr BP). In contrast, using the phylogenetic rate, the same approach estimated the tMRCA A-RS at 308 kyr BP (95% HPD interval 227 to 391 kyr BP). The median estimate of the tMRCA A-A using the rate derived from ancient DNA sampling of Adélie penguins was 30 kyr BP (95% HPD interval 16 to 52 kyr BP) and tMRCA RS-RS was 32 kyr BP (95% HPD interval 18 to 52 kyr BP). In contrast, the median estimates of the tMRCA A-A using the phylogenetic rate were 127 kyr BP (95% HPD interval 89 to 176 kyr BP) and the tMRCA RS-RS were 143 kyr BP (95% HPD interval 121 to 186 kyr BP). These estimated divergence times differ from those previously presented in (Lambert et al. 2002) because the latter represented only estimates based on our best estimate of the rate of evolution and the sequence divergence between lineages.

    FIG. 3. The temperature changes over the past 420 kyr BP from present day at Vostok on the East Antarctic Ice Sheet are from Petit et al. (1999). The interglacial periods are interpreted from the temperature profile. Above the temperature graph are the posterior probability densities for the times to the most recent common ancestor using the "Adélie penguin" rate (left) and the avian "phylogenetic" rate (right). Arrows indicate the median value of divergence times of each distribution

    FIG. 4. The temperature changes on the East Antarctic Ice Sheet over the past 200 kyr BP from Petit et al. (1999) and the posterior probability densities for the times to the most recent common ancestor of each mitochondrial lineage. The tMRCA for the A and RS lineages were estimated separately using both the "Adélie penguin" rate (left) and the "phylogenetic" rate (right)

    Discussion

    The Origin of Two Lineages

    The present study has revealed the existence of two distinct and genetically diverse mtDNA lineages of Adélie penguins. We report that these groups of haplotypes are unevenly distributed through time and space around Antarctica. The fact that the Antarctic (A) lineage is distributed around the continent, whereas the Ross Sea (RS) lineage is restricted to the Ross Sea, is consistent with two ancestral refugia that were subject to population bottlenecks. The evidence presented of a cline in A/RS frequencies in the Ross Sea suggests subsequent expansion and secondary admixture. An evolutionary rate estimated for HVR-1 using Adélie penguin ancient DNA sequences of 0.96 s/s/Myr (0.53 to 1.43 s/s/Myr) suggests that the time to a most recent common ancestor of these two lineages (tMRCA A-RS) is 75 kyr BP (37 to 122 kyr BP). The latter estimate is for a constant population. Simulations for an exponentially growing population resulted in similar estimates (data not shown). This tMRCA A-RS is the best estimate of the age of the sequence that gave rise to all modern haplotypes of both lineages. Given that there was almost certainly a level of existing polymorphisms in the population at the time of separation of the lineages, this estimate represents the maximum age of any period of lineage sorting. Lineage sorting results from a population bottleneck in which a diverse set of haplotypes is reduced in number. A number of extreme models, consistent with these data, can be envisaged. First, the period of sorting might have begun after the onset of the last glacial cycle. An ancestral population of mtDNA haplotypes would have been dramatically reduced so that only the two ancestors of the A and RS lineages would have survived. Over the last glacial cycle, these two lineages would have accumulated differences as Adélie penguins expanded in population size and haplotype diversity, starting around the onset of the deglacial period. Second, lineage sorting may have occurred late in the recent glacial cycle, when ice conditions were at their most severe and not long before the subsequent expansion associated with the deglacial warming. In the latter scenario, there would be a significant temporal separation between the tMRCA and the sorting event itself.

    Using the ancient DNA rate of evolution calculated for Adélie penguins, the estimated time to a most recent common ancestor of both lineages (tMRCA A-RS) is consistent with our understanding of the glacial history of Antarctica. The Late Pleistocene climate in Antarctica has been well documented; for example, temperature records are available from ice cores taken from Vostok on the East Antarctic Ice Sheet (fig. 3). These records date back to 420 kyr BP (Petit et al. 1999), and provide a framework to test hypotheses about the relationship between environmental change and the history of Adélie penguins. A recent date of divergence of the lineages (approximately 75 kyr BP) coincides with the middle of the last glacial cycle. At that time, Adélie penguins may have had limited opportunities to breed on the Antarctic mainland, and our results suggest they were separated into refugia. Certainly at the time of the last glacial maximum, ice-free areas would have been rare, if present at all in Antarctica (Colhoun et al. 1992). The isolation of the Antarctic continent from other major landmasses would have limited the ability of Adélie penguins to progressively move north as ice-free areas were reduced and perhaps totally eliminated. Moreover, the expansion of the sea-ice would have meant that breeding grounds were far removed from the open sea, even if ice-free areas were present on the continent. Regular visiting of such feeding grounds—a necessity during the breeding season when adults are raising chicks—would be energetically expensive, if even physically possible. Such sea ice covering would have been extensive. Benthic sedimentation of 13C-rich organic material from sea-ice algae shows that at the last glacial maximum the summer sea ice may have extended as far as the winter sea ice now reaches (fig. 1A) (Cooke and Hays 1982; Gersonde and Zielinki 2000). The latter extends from hundreds to thousands of kilometers from the existing coastline and specifically the Ross Sea would have been completely covered at that time. These large-scale disruptions to gene flow between ice-age Adélie penguin refugia are likely to have persisted as recently as 17 to 20 kyr BP (Baroni and Orombelli 1994).

    In contrast, the phylogenetic rate (0.208 s/s/Myr) suggests that the tMRCA A-RS was 308 kyr BP and that, consequently, the A and RS lineages persisted through two or three glacial cycles (fig. 3). According to this scenario, Adélie penguin population sizes would have remained large enough over the last two or three cycles for both lineages to survive. Since each glacial cycle is likely to have resulted in some form of population bottleneck—given that there would have been a loss of ice-free nesting areas—the above scenario would seem less likely than the explanation based on the higher evolutionary rate calculated directly from Adélie penguin ancient DNA. The latter suggested a divergence during the last glacial cycle and therefore requires the lineages to survive only a single ice-age event.

    The Expansion of the Two Lineages

    Using the ancient DNA rate, we estimated tMRCA of modern haplotypes of each of the A and RS lineages separately. This resulted in very similar estimates in the case of each lineage (tMRCA A-A = 30 kyr BP [16 to 52 kyr BP] and tMRCA RS-RS = 32 kyr BP [18 to 52 kyr BP]) (fig. 4). We suggest that these two lineages began to diversify some time after these estimates because there will inevitably be a time lag between the tMRCA and any dramatic increase in the number of haplotypes. In contrast, the phylogenetic rate estimates the tMRCA of all modern A haplotypes to be 127 kyr BP (89 to 176 kyr BP) and all modern RS haplotypes to be 143 kyr BP (121 to 186 kyr BP). This suggests that there was a dramatic increase in the number of haplotypes of both lineages and that the number of haplotypes accumulated over the last ice age. However, this period was characterized by a general contraction in the distribution rather than expansion of many species (Roberts 1998). As we have discussed above, in the case of Adélie penguins, this period was probably one that was associated with a reduction in numbers of Adélie penguins in Antarctica.

    Ice Age Refugia

    The restricted distribution of the RS lineage suggests a refugium that was geographically close to, or adjacent to, the Ross Sea. Moreover, the high relative frequency of the RS lineage at sites near the entrance to the Ross Sea, compared with lower frequencies nearer to the permanent ice shelf, suggests that after deglaciation, the RS lineage moved down the coast following the retreat of the Ross Ice Shelf (fig. 1). In contrast, because of the Antarctic-wide distribution of the A lineage, it is difficult to infer a possible location of any refugium. However, it is interesting to note that the A lineage was well established in the Ross Sea by 6,082 years BP, and, in fact, at that date, it was in approximately equal proportions to the RS lineage at Terra Nova Bay in the Ross Sea (fig. 1). This would suggest that both lineages quickly established themselves in the Ross Sea, and our ancient DNA results show them to be sympatric at many locations in past times. Our study also shows, however, that the proportions of each lineage, at individual sites, have changed during this Holocene expansion period.

    Rate Heterogeneity and the Molecular Clock

    The observation that the amount of sequence difference between any two taxa does not have a precise linear relationship to the time to their common ancestor has plagued the molecular clock hypothesis for many years (Gillespie 1991). Hence, a degree of heterogeneity in evolutionary rates is inevitable. A variety of mechanisms have been proposed to explain such rate heterogeneity among species or groups of species, including, for example, the metabolic rate hypothesis (Martin and Palumbi 1993), the body temperature hypothesis (Prager et al. 1974), and the generation time hypothesis (Kohne 1970). Moreover, the rate of evolution is not a deterministic process and can vary along a lineage, and nucleotide changes have been described using a Poisson model (Gillespie 1991). Rate heterogeneity along a species lineage can be caused by changes in the underlying mutation rate or the fixation of nearly neutral alleles as the effective population size fluctuates (Ohta 1987). Therefore, it has been recognized that a rate of evolution estimated using fossils calibration dates of millions of years might not be transferable to other species and to more recent (or older) time periods.

    Most studies that use the molecular clock as a dating tool rely on the assumption that, first, there is a constant rate at which the clock "ticks" and that, second, there are available relatively precise estimates of the true rate of evolution for any particular species. The majority of studies are restricted to adopting a rate of evolution previously established from a related group of species. A rate of evolution is usually deemed a good approximation to the true rate if there have been two or more studies of independent taxa that have provided corroborating estimates. For example, Klicka and Zink (1997) used the level of mtDNA sequence divergence between 35 sister species of oscine passerine and a clock rate of 2%/Myr to refute the conventional model that many North American bird populations speciated during the glacial cycles of the Late Pleistocene. Klicka and Zink (1997) suggested that a rate of 2%/Myr for mtDNA was a reasonable estimate because it was corroborated among a diverse array of studies, including, for example, Shields and Wilson (1987), Nunn et al. (1996), and Tarr and Fleischer (1993). However, our study of Adélie penguins has been the first to employ a rate that was directly estimated from our study species, independent of both the fossil record and a phylogeny of closely related species. Hence, we were able test how different our results would be from one where we adopted the conventional CR rate for birds. We recorded significantly different tMRCA using rate specific to Adélie penguins. Importantly, this had significant implications for biogeographic explanations of the origin of two distinct mtDNA lineages found in Adélie penguins.

    Finally, ancient DNA has been widely used to investigate a variety of issues, from changing levels of genetic variation (e.g., Nielsen, Hansen, and Loeschcke 1997), to analyses of the phylogeny of extinct species (e.g., H?ss et al. 1996), to investigations of ancient diets (Poinar et al. 1998). Our research on ancient DNA of Adélie penguins suggests a new role in evolutionary biology; namely the timing of divergent events and tracking changes in the proportion of lineages over space and time. This approach, when used to detect the time to the MRCA of the A and RS lineages, is more consistent with the known glacial history of Antarctica and requires fewer assumptions than does a scenario based on an adopted phylogenetic rate of mitochondrial DNA evolution. Hence, our results indicate that complex biogeographical narratives based on evolutionary rates adopted from other species could be seriously misleading. As an increasing number of ancient DNA studies are reported, with consequently better estimates of evolutionary rates, this will enable a more accurate understanding of rate heterogeneity and consequently the genetic changes that have accompanied major global environmental events, such as the ice ages in Antarctica.

    Acknowledgements

    This research was made possible by a grant from the Marsden Fund of New Zealand and Centres of Research Excellence Fund. We also acknowledge support from Massey University, the Italian Antarctic Research Programme, Antarctica New Zealand, and the U.S. Coast Guard. P.A.R. acknowledges a Massey University doctoral scholarship. For assistance we thank P. Barrett, L. Davis, A. Drummond, S. Eyton, K. Kerry, J. Macdonald, P. Metcalf, J. Robins, L. Shepherd, C. Vleck, and E. Young. This paper represents publication No. 97 from the Allan Wilson Centre for Molecular Ecology and Evolution.

    Literature Cited

    Ainley, D. G., R. E. LaReseche, and W. J. Sladen. 1983. Breeding biology of Adélie penguins. University of California Press, Berkeley.

    Baker, A. J., and H. D. Marshall. 1997. Mitochondrial control region sequences as tools for understanding evolution. Pp. 51–82 in D. P. Mindell, ed. Avian molecular evolution and systematics. Academic Press, San Diego.

    Baroni, C., and G. Orombelli. 1994. Abandoned penguin rookeries as Holocene paleoclimatic indicators in Antarctica. Geology 22:23-26.

    Box, G. E. P., and G. C. Tiao. 1992. Bayesian inference in statistical analysis. Wiley, New York.

    Colhoun, E. A., M. C. G. Mabin, D. A. Adamson, and R. M. Kirk. 1992. Antarctic ice volume and contribution to sea-level fall at 20,000 yr BP from raised beaches. Nature 358:316-319.

    Cooke, D. W., and J. D. Hays. 1982. Estimates of Antarctic ocean seasonal sea-ice cover during glacial intervals. Antarctic geoscience: symposium on Antarctic geology and geophysics. University of Wisconsin Press, Madison, Wisconsin.

    Denton, G. H., J. G. Bockheim, S. C. Wilson, and M. Stuiver. 1989. Late Wisconsin and early Holocene glacial history, inner Ross embayment, Antarctica. Quaternary Res. 31:151-182.

    Drummond, A., G. K. Nicholls, A. G. Rodrigo, and W. Solomon. 2002. Estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data. Genetics 161:1307-1320.

    Gersonde, R., and U. Zielinki. 2000. The reconstruction of late Quaternary Antarctic sea-ice distribution—the use of diatoms as a proxy for sea-ice. Palaeo 162:263-286.

    Gillespie, J. H. 1991. The causes of molecular evolution. Oxford University Press, Oxford, UK.

    Hasegawa, M., H. Kishino, and T. Yano. 1985. Dating the human-ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol. 22:160-174.

    Heckman, D. S., D. M. Geiser, B. Eidell, R. L. Stauffer, N. L. Kardos, and S. B. Hedges. 2001. Molecular evidence for the early colonization of land by fungi and plants. Science 293:1129-1133.

    Held, C. 2001. No evidence for slow-down of molecular substitution rates at subzero temperatures in Antarctic serolid isopods (Crustacea, Isopoda, Serolidae). Polar Biol. 24:497-501.

    H?ss, M., A. Dilling, A. Currant, and S. P??bo. 1996. Molecular phylogeny of the extinct ground sloth Mylodon darwinii. Proc. Natl. Acad. Sci. USA 93:181-185.

    Huelsenbeck, J. P., and F. Ronquist. 2001. MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 17:754-755.

    Imbrie, J., E. A. Boyle, and S. C. Clemans, et al. (18 co-authors). 1992. On the structure and origin of major glaciation cycles 1: linear responses to Milankovitch forcing. Paleooceanography 7:701-738.

    Klicka, J., and R. M. Zink. 1997. The importance of recent Age ages in speciation: a failing paradigm. Science 277:1666-1669.

    Kohne, D. E. 1970. Evolution of higher-organism DNA. Q. Rev. Biophys. 33:327-375.

    Lambert, D. M., P. A. Ritchie, C. D. Millar, B. Holland, A. J. Drummond, and C. Baroni. 2002. Rates of evolution in ancient DNA from Adélie penguins. Science 295:2270-2273.

    Martin, A. P., and S. R. Palumbi. 1993. Body size, metabolic rate, generation time, and the molecular clock. Proc. Natl. Acad. Sci. USA 90:4087-4091.

    Mindell, D. P., and C. E. Thacker. 1996. Rates of molecular evolution: phylogenetic issues and applications. Annu. Rev. Ecol. Syst. 27:279-303.

    Nielsen, E. E., M. M. Hansen, and V. Loeschcke. 1997. Analysis of microsatellite DNA from old scale samples of Atlantic salmon Salmo salar: a comparison of genetic composition over 60 years. Mol. Ecol. 6:487-492.

    Nunn, G. B., J. Cooper, P. Jouventin, C. J. R. Robertson, and G. G. Robertson. 1996. Evolutionary relationships among extant albatrosses (Procellariiformes: Diomedeidae) established from complete cytochrome-b gene sequences. Auk 113:784-801.

    Ohta, T. 1987. Very slightly deleterious mutations and the molecular clock. J. Mol. Evol. 26:1-6.

    Petit, J. R., J. Jouzel, and D. Raynaud, et al. (19 co-authors). 1999. Climate and atmospheric history of the past 420,000 years from the Vostok ice core, Antarctica. Nature 399:429-436.

    Poinar, H. N., M. Hofreiter, W. G. Spaulding, P. S. Matin, B. A. Stankiewicz, H. Bland, R. P. Evershed, G. Possnert, and S. P??bo. 1998. Molecular coproscopy: dung and diet of the extinct ground sloth Nothrotheriops shastensis. Science 281:402-406.

    Prager, E. M., A. H. Brush, R. A. Noaln, M. Nakanishi, and A. C. Wilson. 1974. Slow evolution of transferrin and albumin in birds according to micro-complement fixation analysis. J. Mol. Evol. 3:243-262.

    Quinn, T. W. 1992. The genetic legacy of Mother Goose—phylogeographic patterns of lesser snow goose Chen caerulescens caerulescens maternal lineages. Mol. Ecol. 1:105-117.

    Rand, D. M. 1994. Thermal habit, metabolic rate and the evolution of mitochondrial DNA. Trends Ecol. Evol. 9:125-131.

    Ritchie, P. A., and D. M. Lambert. 2000. A repeat complex in the mitochondrial control region of Adélie penguins from Antarctica. Genome 43:613-618.

    Roberts, N. 1998. The Holocene: an environmental history. Blackwell Publishing, Oxford, UK.

    Rodríquez, F., J. L. Oliver, A. Marín, and J. R. Medina. 1990. The general stochastic model of nucleotide substitution. J. Theor. Biol. 142:485-501.

    Roeder, A. D., P. A. Ritchie, and D. M. Lambert. 2002. New DNA markers for penguins. Cons. Genet. 3:341-344.

    Rozas, J., and R. Rozas. 1999. DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15:174-175.

    Saitou, H., and M. Nei. 1987. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406-425.

    Sambrook, J., E. F. Fritsch, and T. Maniatus. 1989. Molecular cloning: a laboratory manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York.

    Shields, G. F., and A. C. Wilson. 1987. Calibration of the mitochondrial DNA evolution in geese. J. Mol. Evol. 24:212-217.

    Soltis, P. S., D. E. Soltis, V. Savolainen, P. R. Crane, and T. G. Barraclough. 2002. Rate heterogeneity among lineages of tracheophytes: integration of molecular and fossil data and evidence for molecular living fossils. Proc. Natl. Acad. Sci. USA 99:4430-4435.

    Stoneking, M., S. T. Sherry, A. J. Redd, and L. Vigilant. 1992. New approaches to dating suggest a recent age for human mtDNA ancestor. Philos. Trans. R. Soc. Lond. B Biol. Sci. 337:167-175.

    Swofford, D. L. 2002. PAUP*: phylogenetic analysis using parsimony (*and other methods). Version 4.0b10. Sinauer Associates, Sunderland, Mass.

    Tajima, F. 1989. Statistical methods for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595.

    Tarr, C. L., and R. C. Fleischer. 1993. Mitochondrial-DNA variation and the evolutionary relationships in the Amakihi complex. Auk 110:825-831.

    Thompson, J. D., D. G. Higgins, and T. J. Gibson. 1994. CLUSTAL W: improving the sensitivity of progressive mutliple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acid Res. 22:4673-4680.

    Williams, M. A. J., D. Dunkerley, P. DeDeckker, P. Kershaw, and M. Chappell. 1989. Quaternary environments. Arnold, London.(Peter A. Ritchie*,, Craig)