Confidence Limits on Phylogenies: An Approach Using the Bootstrap
- Joseph Felsenstein
- Evolution, Vol. 39, No. 4 (Jul., 1985), pp. 783-791
- Published by: Society for the Study of Evolution
- Stable URL: http://www.jstor.org/stable/2408678
- Abstract: The recently-developed statistical method known as the "bootstrap" can be used to place confidence intervals on phylogenies. It involves resampling points from one's own data, with replacement, to create a series of bootstrap samples of the same size as the original data. Each of these is analyzed, and the variation among the resulting estimates taken to indicate the size of the error involved in making estimates from the original data. In the case of phylogenies, it is argued that the proper method of resampling is to keep all of the original species while sampling characters with replacement, under the assumption that the characters have been independently drawn by the systematist and have evolved independently. Majority-rule consensus trees can be used to construct a phylogeny showing all of the inferred monophyletic groups that occurred in a majority of the bootstrap samples. If a group shows up 95% of the time or more, the evidence for it is taken to be statistically significant. Existing computer programs can be used to analyze different bootstrap samples by using weights on the characters, the weight of a character being how many times it was drawn in bootstrap sampling. When all characters are perfectly compatible, as envisioned by Hennig, bootstrap sampling becomes unnecessary; the bootstrap method would show significant evidence for a group if it is defined by three or more characters.
- Subjects: statistics
- Keyterms: bootstrap, phylogeny, character, confidence, tree, monophyletic, sampling, jackknife, resampl, efron, parsimoniou, estimate, systematist, parsimony, camin, specie, distribution, infer, consensus, binary, weight
- CiteRank: 16
- Times cited by articles in JSTOR: 2261
Top Words
|
Top Bigrams
|
Top Trigrams
|
Top Quadgrams
|
Top Keyterms
|
References
CAMIN, J. H., AND R. R. SOKAL. 1965. A method for deducing branching sequences in phylogeny. Evolution19:311-326.
CAVENDER, J. A. 1978. Taxonomy with confi- dence. Math. Biosci.40:271-280
Vol. 44, p. 308, 1979
—1981. Tests of phylogenetic hypotheses under generalized models. Math. Biosci.54:217- 229.
DIACONIS, P., AND B. EFRON. 1983. Computer- intensive methods in statistics. Sci. Amer.249: 116-130.
EFRON, B. 1979. Bootstrap methods: Another look at the jackknife. Ann. Statist.7:1-26.
—1982. The jackknife, the bootstrap, and other resampling plans. CBMS-NSF Regional Conference Series in Applied Mathematics No. 38. Society for Industrial and Applied Mathe- matics. Philadelphia, PA.
EFRON, B., AND G. GONG. 1983. A leisurely look at the bootstrap, the jackknife, and cross-vali- dation. Amer. Statist.37:36-48.
FELSENSTEIN, J. 1983a. Statistical inference of phylogenies. J. Roy. Statist. Soc. A146:246- 272.
.1983b. Parsimony in systematics: Bio- logical and statistical issues. Ann. Rev. Ecol. Syst.14:313-333.
—. 1985. Confidence limits on phylogenies with a molecular clock. Systematic Zoology34: 152-161.
MARGUSH, T., AND F. R. MCMORRIS. 1981. Con- sensus n-trees. Bull. Mathemat. Biol.43:239- 244.
TEMPLETON, A. R. 1983. Phylogenetic inference from restriction endonuclease cleavage site maps with particular reference to the evolution of hu- mans and the apes. Evolution37:221-224.