Topic 3: Neutral Theory and DNA variation Flashcards

1
Q

What is the neutral theory?

A

Motoo Kimura proposed that most variation is neutral, otherwise deleterious mutations would be removed by selection and beneficial ones would be rare and quickly fixed.
This came about because high levels of polymorphism observed are not consistent with strong selection at all loci.
New variants arise through mutation and allele frequencies are due to an equilibrium between mutation and genetic drift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is genetic drift?

A

Random change in allele frequency that occur from generation to generation as a result of sampling effects. Is much more extreme in small populations were alleles are fixed and lost at much more random rates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the four forces of evolutionary change?

A

Genetic drift
Gene flow
Mutation
Selection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What 5 things can we predict based on the principles of the neutral theory (variation is a consequence of the balance between genetic drift and mutation)?

A

Probability of fixation of an allele
Rate of fixation of an allele
Time between fixation of an allele
Time is takes for a new allele to become fixes
Amount of diversity on a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

All of the things we can predict from the principles of Neutral theory can be predicted from what two simple functions?

A

Population size and mutation rate (if we ignore selection and migration)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the probability of fixation of a new allele?

A

The probability that a neutral allele will reach fixation is equal to its frequency at the time of consideration, p.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the probability that a new mutation will reach fixation? (an allele that arose via mutation)

A

There are 2N alleles in a population, thus the new allele has a frequency of 1/(2N) therefore the probability of a new mutation becoming fixed is 1/2N where N is the number of individuals in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the rate of fixation of new mutations?

A

The rate at which new mutations are fixed in the population is μ. This results from the fact that the rate of new mutations per bp/ gene and per generation is μ . The probability of fixation for each new mutation is 1/2N, and there are 2N μ new mutations each generation. Combining these two formulas you get 2N μ /2N and this is just μ . This means that the rate of new mutations per bp/ gene per generation is the same thing as the rate of fixation of new alleles.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the average time between the fixation of new alleles?

A

1/ μ (reciprocal the rate of fixation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the average time to fixation and time to loss of a new allele?

A

Mutations that are destined to become fixed, do so in 4Ne generations, and mutations that are destined to be lost are gone in (2Ne/N) ln (2N) generations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Ne?

A

This is the effective population size, it is an idealized number that is usually smaller than the actual population size (N) because it only includes successful breeders.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is genetic drift stronger in small populations?

A

It is stronger in small populations because there is greater sampling error between generations. In other words, allele frequencies fluctuate more widely in smaller populations in the same amount of time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Are mutations lost or fixed more quickly in populations? What does this explain?

A

They are lost much faster than they are fixed, but both processes take time in larger populations. This explains why larger populations are more genetically diverse; because they stay polymorphic for longer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the usual mutation rate for a population?

A

1 x 10^-6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you calculate predicted gene diversity (expected heterozygosity) as a function of population size and mutation rate?

A

He = 4Ne μ / 4Ne μ +1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the shortcomings of studying genetic variation using allozyme electrophoresis?

A

Not all proteins are enzymes
Enzymes have variable functional constraints on their sequence characteristics (more likely to be under selection)
Not all amino acid changes are detectable by electrophoresis (silent, those that do not result in shape or charge changes)
Proteins are more unstable and more difficult to work with than DNA

17
Q

What is the role of ADH in Drosophila? What is its distribution and alleles?

A

ADH is an enzyme that helps to break down alcohols, and there is a fast allele (more common in colder areas, higher activity), and a slow allele (more common in warmer areas, more stable). 11 copies of ADH (5 fast and 6 slow) from 5 populations were cloned and sequences to look for hidden variation.
The ADH gene has 4 exons and three introns and they are expressed differently in larva and adults

18
Q

What was the result on the study of the 11 ADH genes that were sequenced?

A

There were 9 unique alleles, and 43 variable DNA sites, BUT only one amino acid change was seen which caused to slow to fast polymorphism(Lys to Arg). This means that all other variations/polymorphisms found were either silent substitutions in exons, introns, or untranslates 5’ and 3’ UTR
This means that there was MUCH more DNA variation than protein variation, but unlike any other gene, there was no hidden variation.

19
Q

How do you quantify variation at the level of the nucleotide?

A

You calculate to proportion of nucleotide sites that differ in a sample of genes:
Pn= S/N where S is the number of variable sites and N is the total sites compared

20
Q

What factor does Pn not account for? How do we fix this?

A

Pn does not account for the number of sequences that were observed in the sample. We account for this by using the unbiased estimate of the polymorphisms per site.
THETA = Pn/a
a is the sum of 1/i, where i is up until number of sequences-1

21
Q

What issue does theta correct? What issue does theta have? How do we correct this?

A

theta corrects for the bias caused by sampling (the more sites you compare the higher the Pn), BUT it is imperfect because it does not account for different frequency of the polymorphisms, because sites are simply counted as being polymorphic or not
We correct this by using nucleotide diversity, which is a measure that accounts for the frequency of polymorphisms in a way similar to heterozygosity

22
Q

What is pi?

A

Nucleotide diversity
The proportion of sites that differ between pairs of sequences.
In other words it is heterozygosity at the nucleotide level.