Topic 13: Assignment Tests and Population Structure Flashcards

1
Q

What are assignment tests based on?

A

Likelihood of individuals genotype being found in either population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is involved in the assignment test principle and use?

A

Assign each individual genotype to the subpopulation in which it has the highest likelihood of occuring.
When subpopulations have very different allele frequencies, individuals should be assigned to the subpopulation in which they were born (natal population) where their genotype has higher probability
Immigrants should be assigned to their natal population and not the subpopulation they were sampled in after dispersal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What two things can assignment tests do? What can they not do?

A

They can measure population structure and they can quantify contemporary dispersal
CANNOT quantify effective migration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which F stat is applicable to assignment tests?

A

FST

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are strongly differentiated populations?

A

LARGE FST VALUES therefore large genetic distances and very different allele frequencies, they are both far from the middle line on the graph and have all their shapes in seperate places

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are weakly differentiated populations?

A

Not much population structure and genetic distance, similar allele frequencies, small FST values, both close to middle line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are non-differentiated populations?

A

Pretty much one population, no clear pattern, populations grouped together on center line and exchange over line between them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does shape and location on graph mean in an assignment plot?

A

Shape: where they were sampled
Location: population where the individual is assigned due to genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a measure of genetic divergence on an assignment plot?

A

The number of cross assigned individuals. They can either arise by chance or indicate it is a migrant into the population it was found, from its original population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the three major uses of assignment plots?

A

Population structure (genetic differentiation), wildlife forensics (matching specimens to source populations), measuring contemporary gene flow (identifying recent immigrants or dispersers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the traditional approach to determine the differentiation between subpopulations?

A

Consider geographic sample locations as subpopulations and then estimate genetic structure, genetic distance, test for differentiation between the designated subpopulations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When do we use Bayesian methods? What is it?

A

When population structure is cryptic and we do not know how many randomly mating subpopulations there really is.
You use genetic information first to cluster individuals in such a way as to maximize the differences between groups (FsT) and minimize differentiation within groups (FIS) for any given number of groups.
The likelihood of observing the actual genotype data given K groups
The k that gives the highest probability of the data is the best way of designating the subpopulations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What do Bayesian methods also allow for?

A

They also allow for admixture (interbreeding) between genetic groups and resulting in mixed individual ancestry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is an admixture or Structure plot?

A

A graph that visualizes each individual’s genetic assignment
Each individual is a column, and each genetically distinct cluster (corresponding to a genetic background) plotted in a different colour.
The genetic composition of each individual column shows how much of their genetic background can be assigned to each of the K clusters.
If there is complete population structure and every subpopulation is genetically distinct, then all the individuals within each subpopulations will have the same solid color bar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does underestimating structure lead to?

A

Management actions at the wrong spatial scale, inflating population size estimates and preventing protection under legislation such as Species at Risk Act,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What can overestimating structure lead to?

A

Costly conservation actions for “rare taxa” such as translocations or habitat restoration to unnecessarily promote gene flow.

17
Q

How can you tell on a graph which K is best?

A

Where the plateau on the graph is the best K value, if there is a big jump then take the lower K value

18
Q

What is principal component analysis?

A

A statistical procedure that uses an orthogonal transformation to convert a set of observations into a set of values linearly uncorrelated variables called principal components
For n observations of p variables the number of principal components will be (n-1,p).
In all cases the first principal component accounts for as much of the variability in the data as possible, and each succeeding component explains the highest amount of variation remaining from the preceding components

19
Q

What is the goal of Principal Component Analysis?

A

Goal is to reduce complexity of the dataset to ease visualization, attempt to show as much genetic variability as possible in a few axes. Individuals close to each other on the axes are more genetically similar

20
Q

What are pros and cons to Bayesian methods?

A

PROS: can be used to find K and can identify admixed individuals
CONS: assumes HWE within populations, assumes linkage equilibrium between loci, computationally intensive.

21
Q

What are pros and cons of PCA?

A

PROS: no assumptions made about underlying genetic model, computationally quick, can handle large data sets
CONS: inherently explorative, generally cannot be used to find k, cannot be used to identify admixed individuals