Data For Research
beta

Confidence Limits on Phylogenies: An Approach Using the Bootstrap

  • Joseph Felsenstein
  • Evolution, Vol. 39, No. 4 (Jul., 1985), pp. 783-791

  • Published by: Society for the Study of Evolution
  • Stable URL: http://www.jstor.org/stable/2408678
  • Abstract: The recently-developed statistical method known as the "bootstrap" can be used to place confidence intervals on phylogenies. It involves resampling points from one's own data, with replacement, to create a series of bootstrap samples of the same size as the original data. Each of these is analyzed, and the variation among the resulting estimates taken to indicate the size of the error involved in making estimates from the original data. In the case of phylogenies, it is argued that the proper method of resampling is to keep all of the original species while sampling characters with replacement, under the assumption that the characters have been independently drawn by the systematist and have evolved independently. Majority-rule consensus trees can be used to construct a phylogeny showing all of the inferred monophyletic groups that occurred in a majority of the bootstrap samples. If a group shows up 95% of the time or more, the evidence for it is taken to be statistically significant. Existing computer programs can be used to analyze different bootstrap samples by using weights on the characters, the weight of a character being how many times it was drawn in bootstrap sampling. When all characters are perfectly compatible, as envisioned by Hennig, bootstrap sampling becomes unnecessary; the bootstrap method would show significant evidence for a group if it is defined by three or more characters.
  • Subjects: statistics
  • Keyterms: bootstrap, phylogeny, character, confidence, tree, monophyletic, sampling, jackknife, resampl, efron, parsimoniou, estimate, systematist, parsimony, camin, specie, distribution, infer, consensus, binary, weight
  • CiteRank: 16
  • Times cited by articles in JSTOR: 2261

Top Words
Word Count
the374
of247
a129
in97
is95
to92
that86
and74
be65
bootstrap64
are55
we55
characters51
for49
by46
Top Bigrams
Bigram Count
## ##117
### ###78
of the74
# the38
### #38
# ###37
the ###34
the bootstrap33
in the29
### and24
## #21
### the20
to be15
that the15
### we15
Top Trigrams
Trigram Count
## ## ##101
### ### ###26
### ### #12
# ## #11
confidence limits on9
most parsimonious trees9
# ### ###9
of the bootstrap8
of the ###7
### of the7
estimate of the7
# ### #7
a series of6
of the phylogeny6
### ### and6
Top Quadgrams
Quadgram Count
## ## ## ##86
### ### ### ###12
estimate of the phylogeny5
### ## ## ##5
### of the time4
### ### ### #4
in a majority of4
confidence limits on phylogenies4
can be used to4
## ## ## ###4
most parsimonious trees #4
# each of these4
## ### ### ##4
the number of times3
### merychippus ### ###3
Top Keyterms
Keyterm Weight
bootstrap1.0
phylogeny0.509
character0.497
confidence0.255
tree0.204
monophyletic0.203
sampling0.195
jackknife0.175
resampl0.17
efron0.15
parsimoniou0.139
estimate0.136
systematist0.131
parsimony0.13
camin0.125

References

CAMIN, J. H., AND R. R. SOKAL. 1965. A method for deducing branching sequences in phylogeny. Evolution19:311-326.

CAVENDER, J. A. 1978. Taxonomy with confi- dence. Math. Biosci.40:271-280

Vol. 44, p. 308, 1979

—1981. Tests of phylogenetic hypotheses under generalized models. Math. Biosci.54:217- 229.

DIACONIS, P., AND B. EFRON. 1983. Computer- intensive methods in statistics. Sci. Amer.249: 116-130.

EFRON, B. 1979. Bootstrap methods: Another look at the jackknife. Ann. Statist.7:1-26.

—1982. The jackknife, the bootstrap, and other resampling plans. CBMS-NSF Regional Conference Series in Applied Mathematics No. 38. Society for Industrial and Applied Mathe- matics. Philadelphia, PA.

EFRON, B., AND G. GONG. 1983. A leisurely look at the bootstrap, the jackknife, and cross-vali- dation. Amer. Statist.37:36-48.

FELSENSTEIN, J. 1983a. Statistical inference of phylogenies. J. Roy. Statist. Soc. A146:246- 272.

.1983b. Parsimony in systematics: Bio- logical and statistical issues. Ann. Rev. Ecol. Syst.14:313-333.

—. 1985. Confidence limits on phylogenies with a molecular clock. Systematic Zoology34: 152-161.

MARGUSH, T., AND F. R. MCMORRIS. 1981. Con- sensus n-trees. Bull. Mathemat. Biol.43:239- 244.

TEMPLETON, A. R. 1983. Phylogenetic inference from restriction endonuclease cleavage site maps with particular reference to the evolution of hu- mans and the apes. Evolution37:221-224.

Selected: 6,428,186

JSTOR is part of ITHAKA, a not-for-profit organization helping the academic community use digital technologies to preserve the scholarly record and to advance research and teaching in sustainable ways.
©2000-2010 ITHAKA. All Rights Reserved. JSTOR®, the JSTOR logo, and ITHAKA® are registered trademarks of ITHAKA.