Tree reliability Flashcards
With tree length frequency distributions what do skewed distributions indicate
stronger phylogenetic signal
In tree length frequency distributions what tree is considered more reliable for the lower of these two data sets?
the shortest tree?
What is the formula for consistency index
CI = (m/s)
–> m = minimum number of steps (number of states - 1)
–> s = number of steps in the tree (for that character)
–> for the whole tree you do sum of m/sum of s for each character
What does the maximum value of one mean for a Consistency index
this means no homoplasy is observed
what is the problem with the consistency index
- autapomorphies inflate the CI but are not informative
What is the definition of the retention index
fraction of potential synapomorphies retained as synapomorphies on the tree
what is the formula for retention index
RI = (g-s)/(g-m)
–> m = minimum number of steps (number of states - 1)
–> s = number of steps in the tree (for that character)
–> g is the maximum number of steps = number of steps on a polytomy
What is the better estimate of the efficiency with which the tree explains the data: RI (retention index) or CI (consistency index)
RI because autapomorphies do not inflate it unlike for CI
Describe consensus trees
- the goal is to summarize information from rival trees
- can have a strict consensus tree where the tree contains the clades that occur in ALL the rival trees
- can have majority rule consensus trees where clades included are those that are found in a majority of the trees
Describe Bremer support
the difference in number of steps between the length of the most
parsimonious tree(s), and the length of the most parsimonious tree
that does not contain a particular clade (node, branch)
Describe Bootstrap support
this method involves re sampling the data with replacement. You draw one character at random and then put it back and draw again until you have drawn the same amount of characters you originally had in your matrix (so characters can be repeated). You save the best tree for each replicate and then find the majority tree of all replicates. The proportion of trees supporting a clade is the bootstrap proportion
How does Jackknife support work
- this is also a resampling procedure like bootstrapping but here you subsample within a cycle?
- it resamples a proportion of the characters WITHOUT replacement (unlike bootstrapping)
- here the best tree from each replicate is saved, and a majority rule consensus is obtained from all the trees, the proportion of trees supporting a clade is the jackknife proportion
Describe posterior probability
- used with Bayesian analysis
- the posterior probability of a tree is the conditional probability that the tree is correct, taking into account new evidence an the prior probability
- generates millions of trees using a kind of random walk simulation technique (MCMC)
- compiles the trees into a single consensus tree
Compare Bayesian values with bootstrap values
Bayesian values are higher
Describe multispecies coalescent probability
- this estimates gene tree-species tree congruence
- coalescent theory is a model of how gene variants sampled from a population may have originated from a common ancestor