Lecture 16 - Patterns and Models Flashcards

1
Q

What is the approximate time to fixation for neutral alleles in diploid populations?

A

4N generations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a substitution model used for?

A

Used to describe the patterns and rates of sequence change over evolutionary time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does Juke’s and Cantor’s one-parameter model say?

A

All substitutions will occur with equal probability (i.e., no bias in the direction of change).

Only one parameter here - the rate of change of one nucleotide to another (α, alpha).
- Thus 3α is the rate at which one nucleotide can change to any other
- Probability of the nucleotide staying the same after one unit of time = 1 - 3α

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Is the assumption made by Juke and Cantor (i.e., that all base substitutions are equally likely) a realistic assumption to make?

A

NO - because like-to-like changes are more probable.
- Transitions
- Transversions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Is the assumption made by Juke and Cantor (i.e., that all base substitutions are equally likely) a realistic assumption to make?

A

NO - because like-to-like changes are more probable.
- Transitions
- Transversions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a transition?

A

Like-to-like substitution.
- Purine to purine (A <-> G)
- Pyrimidine to pyrimidine (C <-> T)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a transversion?

A

A pyrimidine to purine change (or vice versa).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does Kimura’s two parameter model say?

A

Assumes that like-to-like changes are more probable.
- This is seen in nature, as transitions are observed TWICE as frequently as transversions
- Thus, mutational bias towards transitions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What frequency will each reach nucleotide eventually (in both models)?

A

25% frequency (i.e., equilibrium).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the “multiple hits” phenomenon?

A

The more time that passes, the higher the chance that the same site will undergo multiple substitutions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When is it more likely that multiple hits have occurred?

A

The higher the divergence, the more substitution events occurred, thus the more likely it is that multiple hits have occurred.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What problem arises from multiple hits?

A

The number of differences that are observed might not necessarily reflect the number of substitutions that have occurred.

Thus, by simply counting the differences, one can extremely underestimate the amount of evolutionary change!
(SEE LECTURE 16 part 1 @ 18:30 mins)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do sophisticated models attempt to take multiple hits into account?

A

By looking at the degree of divergence between the two sequences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What problem occurs in these sophisticated models (that take multiple hits into account) when looking at high levels of divergence?

A

Saturation.
(SEE LECTURE 16 part 1 @ 21 mins)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When do our substitution models work best?

A

When the degree of divergence is relatively low.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the two major violations to the assumptions made in our models (that are seen in nature)?

A
  1. Probability of mutation can vary across the gene or genome regions depending on base pair composition and other factors (e.g., chromatin organisation)!
  2. Probability of fixation can vary across the gene or genome regions, due to differing strengths of purifying selection!
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Give an example of the variability of mutation rates within genomes.

A

CpG dinucleotides (i.e., C followed by G in sequence).
- Susceptible to methylation!
- Cytosine deaminated into methylcytosine
- Methylcytosine more likely to give rise to T substitutions

Rate of transitions at CpG sites is approx. 18 times higher than non-CpG sites!

18
Q

How does strength of purifying selection vary within the genome?

A
  1. Mutations in protein coding regions are more likely to be deleterious (thus will be removed by purifying selection).
    - Protein coding regions are the slowest evolving regions
  2. Even within genes, some regions are under less selective constraint than others!
19
Q

What is a missense mutation?

A

A nucleotide substitution that result in an amino acid change.

20
Q

What is a nonsense mutation?

A

A nucleotide substitution that results in a premature “stop” codon.

21
Q

What are the 3 amino acid sequences that encode STOP codons?

A
  1. UAA
  2. UAG
  3. UGA
22
Q

What is the amino acid sequence that encodes a start codon?

A

AUG

23
Q

What is a non-synonymous mutation?

A

A nucleotide change that alters the amino acid sequence.
- Missense
- Nonsense

24
Q

What is a synonymous mutation?

A

A nucleotide change that does NOT alter the amino acid sequence.

25
Q

What codon position is often synonymous?

A

3rd codon position.

26
Q

What codon position is NEVER synonymous?

A

2nd codon position.

27
Q

What codon position is sometimes synonymous?

A

1st codon position.

28
Q

For synonymous sites, what is rate of substitution?

A

Rate of substitution = Rate of mutation (µ)

29
Q

What are non-degenerate sites?

A

Sites at which any nucleotide change will result in an AA change (i.e., never synonymous, 2nd codon).
- LOWEST rate of substitution observed here

30
Q

What are two-fold degenerate sites?

A

Sites at which one of three possible nucleotide changes results in AA change (i.e., sometimes synonymous, 1st codon).
- INTERMEDIATE rate of substitution observed here

31
Q

What are four-fold degenerate sites?

A

Sites at which no nucleotide change will change the AA (i.e., often synonymous, 3rd codon)
- HIGHEST rate of substitution observed here

32
Q

What percentage of AA changes are deleterious?

A

Approx. 80%

33
Q

What percentage of AA changes are neutral?

A

Approx. 20%

34
Q

What percentage of AA changes are advantageous?

A

0.0% (rarely do we see advantageous changes!)

35
Q

How do we detect selection?

A

By comparing Ka to Ks.
- Ka = number of non-synonymous substitutions per non-synonymous site
- Ks = number of synonymous substitutions per synonymous site

Their ratio (i.e., Ka/Ks) is the test for selection!
- Ka/Ks ratio, also known as the dn/ds ratio

36
Q

What would we expect if no selection is occurring (when comparing Ka and Ks)?

A

Ka/Ks should be roughly equal to 1.

37
Q

If Ka/Ks ratio is much less than 1, what does this indicate?

A

Ka/Ks «< 1 = suggests negative/purifying selection is occurring!

38
Q

If Ka/Ks ratio is greater than 1, what does this indicate?

A

Ka/Ks&raquo_space;> 1 = suggests positive selection is occurring!

39
Q

Although Ka/Ks being greater than 1 is rare, why is it particularly interesting?

A

As it may indicate adaptive evolution!
- E.g., FOXP2., the “language gene” in humans!

40
Q

What are some issues with the Ka/Ks ratio?

A

SEE LECTURE 16 part 2 @ 26:30 mins!!!

41
Q

Why is it harder to identify homology in non-coding regions?

A

As they evolve faster!