How are FASTA and BLAST different from NW and SW methods?
NW and SW are good for doing pairwise alignments between two sequences of interest.
But it’s way too slow to compare millions of pairwise alignments.
FASTA and BLAST are heuristic methods that can do this, though they are not guaranteed to find the globally optimal alignments (definition of heurisitic)
Though, both FASTA and BLAST are loosely based on the Smith-Waterman algoritithm
What is the nr database?
Non-redundant database (non-currated / largest)
Give the steps of the FASTA algorithm
List five types of BLAST
What does E mean in BLAST/FASTA?
THe expected number of scores equal or greater in identity to the alignment you’ve found.
Give the steps of BLAST
How do you tell if an alignment is significant?
Searching a database
Single pairwise alignments
- Monte-carlo test to calculate p-value for the real S
What is the extreme value distribution?
The extreme value distribution is obtained by choosing the largest values from N independent and identically distributed samples from a population.
In FASTA and SW, what type of score distributions to you get in the output?
- Observed (p-value)
give the formula for an expect value (E)
E = kmne^-λs
k: constant
m: query sequence length
n: database sequence length
λ: scaling constant
s: SCORE