Alignment post-processing & variant calling Flashcards

1
Q

Why is it difficult to call SNPs?

A

What is a SNP and what is noise?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Outline the workflow from mapping and alignment

A

Sorting, filtering and indexing
Local realignment
Remove duplicates
Recalibrate base score qualities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is local realignment? How to do it?

A

Align each read again in a way where they do not span indels or have clusters of mismatched bases (here, they are likely to be misaligned).
Find relevant regions and do multiple sequence realignment here.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why do we have duplicates in our data?
How do we deal with them?
What are the drawbacks of doing this?

A

PCR amplification
We keep the read with the best quality
Natural duplicates and sequencing errors are not taken into consideration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to recalibrate base qualities?

A

Find reported score, position in read, dinucleotide context
Count number of times the site was a mismatch
Calculate Phred quality score from number of mismatches to ref. +1 and number of observed bases +2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is variant calling and genotyping?

A

VC: Identify polymorphic sites
Genotyping: Finding the genotype of an individual at a specific site.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Name some programs for genotyping and describe method

A

Samtools, GATK

Bayesian calculations based on conditioned probabilities, Hardy-Weinberg, prob. of wrong base call and more.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is variant filtering?

A
To remove FPs, use known polymorphic sites as training set and use the program to filter away sites that look different.
If this (soft) is not possible, use quality, depth, strand bias, and more instead.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly