DFR : Confidence Limits on Phylogenies: An Approach Using the Bootstrap

Confidence Limits on Phylogenies: An Approach Using the Bootstrap

Joseph Felsenstein
Evolution, Vol. 39, No. 4 (Jul., 1985), pp. 783-791

Published by: Society for the Study of Evolution
Stable URL: http://www.jstor.org/stable/2408678
Abstract: The recently-developed statistical method known as the "bootstrap" can be used to place confidence intervals on phylogenies. It involves resampling points from one's own data, with replacement, to create a series of bootstrap samples of the same size as the original data. Each of these is analyzed, and the variation among the resulting estimates taken to indicate the size of the error involved in making estimates from the original data. In the case of phylogenies, it is argued that the proper method of resampling is to keep all of the original species while sampling characters with replacement, under the assumption that the characters have been independently drawn by the systematist and have evolved independently. Majority-rule consensus trees can be used to construct a phylogeny showing all of the inferred monophyletic groups that occurred in a majority of the bootstrap samples. If a group shows up 95% of the time or more, the evidence for it is taken to be statistically significant. Existing computer programs can be used to analyze different bootstrap samples by using weights on the characters, the weight of a character being how many times it was drawn in bootstrap sampling. When all characters are perfectly compatible, as envisioned by Hennig, bootstrap sampling becomes unnecessary; the bootstrap method would show significant evidence for a group if it is defined by three or more characters.
Subjects: statistics
Keyterms: bootstrap, phylogeny, character, confidence, tree, monophyletic, sampling, jackknife, resampl, efron, parsimoniou, estimate, systematist, parsimony, camin, specie, distribution, infer, consensus, binary, weight
CiteRank: 16
Times cited by articles in JSTOR: 2261

Top Words

Word	Count
the	374
of	247
a	129
in	97
is	95
to	92
that	86
and	74
be	65
bootstrap	64
are	55
we	55
characters	51
for	49
by	46

Top Bigrams

Bigram	Count
## ##	117
### ###	78
of the	74
# the	38
### #	38
# ###	37
the ###	34
the bootstrap	33
in the	29
### and	24
## #	21
### the	20
to be	15
that the	15
### we	15

Top Trigrams

Trigram	Count
## ## ##	101
### ### ###	26
### ### #	12
# ## #	11
confidence limits on	9
most parsimonious trees	9
# ### ###	9
of the bootstrap	8
of the ###	7
### of the	7
estimate of the	7
# ### #	7
a series of	6
of the phylogeny	6
### ### and	6

Top Quadgrams

Quadgram	Count
## ## ## ##	86
### ### ### ###	12
estimate of the phylogeny	5
### ## ## ##	5
### of the time	4
### ### ### #	4
in a majority of	4
confidence limits on phylogenies	4
can be used to	4
## ## ## ###	4
most parsimonious trees #	4
# each of these	4
## ### ### ##	4
the number of times	3
### merychippus ### ###	3

Top Keyterms

Keyterm	Weight
bootstrap	1.0
phylogeny	0.509
character	0.497
confidence	0.255
tree	0.204
monophyletic	0.203
sampling	0.195
jackknife	0.175
resampl	0.17
efron	0.15
parsimoniou	0.139
estimate	0.136
systematist	0.131
parsimony	0.13
camin	0.125

References

CAMIN, J. H., AND R. R. SOKAL. 1965. A method for deducing branching sequences in phylogeny. Evolution19:311-326.

CAVENDER, J. A. 1978. Taxonomy with confi- dence. Math. Biosci.40:271-280

Vol. 44, p. 308, 1979

—1981. Tests of phylogenetic hypotheses under generalized models. Math. Biosci.54:217- 229.

DIACONIS, P., AND B. EFRON. 1983. Computer- intensive methods in statistics. Sci. Amer.249: 116-130.

EFRON, B. 1979. Bootstrap methods: Another look at the jackknife. Ann. Statist.7:1-26.

—1982. The jackknife, the bootstrap, and other resampling plans. CBMS-NSF Regional Conference Series in Applied Mathematics No. 38. Society for Industrial and Applied Mathe- matics. Philadelphia, PA.

EFRON, B., AND G. GONG. 1983. A leisurely look at the bootstrap, the jackknife, and cross-vali- dation. Amer. Statist.37:36-48.

FELSENSTEIN, J. 1983a. Statistical inference of phylogenies. J. Roy. Statist. Soc. A146:246- 272.

.1983b. Parsimony in systematics: Bio- logical and statistical issues. Ann. Rev. Ecol. Syst.14:313-333.

—. 1985. Confidence limits on phylogenies with a molecular clock. Systematic Zoology34: 152-161.

MARGUSH, T., AND F. R. MCMORRIS. 1981. Con- sensus n-trees. Bull. Mathemat. Biol.43:239- 244.

TEMPLETON, A. R. 1983. Phylogenetic inference from restriction endonuclease cleavage site maps with particular reference to the evolution of hu- mans and the apes. Evolution37:221-224.

JSTOR is part of ITHAKA, a not-for-profit organization helping the academic community use digital technologies to preserve the scholarly record and to advance research and teaching in sustainable ways.
©2000-2010 ITHAKA. All Rights Reserved. JSTOR®, the JSTOR logo, and ITHAKA® are registered trademarks of ITHAKA.

Confidence Limits on Phylogenies: An Approach Using the Bootstrap

References

Back to search...