Lecture 2 - RH Flashcards

1
Q

What is BLAST used for?

A

To search for sequences by typing those sequences in the search bar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the output of a BLAST search?

A

A pairwise alignment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is found using BLAST searches?

A

Relatives can be found using BLAST.

BLAST can check whether a protein encodes:

  • Known protein with known function
  • Known protein unknown function
  • Unknown protein known function (familiar functional domains)
  • Unknown protein unknown function
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are some uses for BLAST?

A

Study evolution

Discover function

Find crucial features (motif finding)

Identify causes of disease (detect variable sites in alignment)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is homology?

A

Genes or proteins are homologous if they share a common ancestor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the types of “ologies” hat can exist between proteins or nucleotides?

A

Homology (common ancestor)

Orthology (Descent from speciation)

Paralogy (Duplication event causes separate evolution)

Xenology (Horizontal transfer event)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When are proteins said to be homologous?

A

> 25% identical aas (likely)

18 - 25% twilight zone (needs further investigation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is a pairwise alignment interpreted?

A

Choose 2 sequences

Select algorithm that generates score

Allow gaps (insertions/deletions)

Score reflects similarity

Alignments can be global or local

Probability that the alignment occured by chance is estimated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which is more informative; protein or DNA sequences?

A

Protein:

It contains 20 rather than 4 characters

Codons are degenerate

Protein sequences offer a longer “look-back” time

DNA sequences can be converted into protein sequences and used in pairwise alignments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why are DNA alignments used if protein alignments are more accurate?

A

To identify cDNA, non-coding regions of DNA, and to identify DNA polymorphisms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the types of alignments?

A

Uniformative (in exercise book)

Ungapped alignment (“”””)

Gapped alignment (“”””)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why are gapped alignments used?

A

It is more maleable due to accounting for frameshift mutations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What BLAST tools are available on NCBI?

A

Nucleotide BLAST

Protein BLAST

Translated BLAST

Genome BLAST

smart BLAST (can be used to create a phylogenetic tree)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the difference between global and local alignments?

A

Global alignment: Aligns all of 2 sequences and finds global similarity

Local alignment: Looks for regions of similarity. (99% of cases)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is dynamic programming?

A

Dynamic programming (usually referred to as DP ) is a very powerful technique to solve a particular class of problems. It demands very elegant formulation of the approach and simple thinking and the coding part is very easy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is dynamic programming used in alignments?

A

Finding optimal alignments is a computationally difficult task so problem is divided into states and for each state a decision is made

17
Q

What are the 3 steps of dynamic programming for sequence alignments?

A

Initialization

Scoring the matrix

Traceback

18
Q

What happens to similarity score when a gap is introduced or extended?

A

a large penalty is applied to the score

19
Q

What is a PAM?

A

Point Accepted Mutation

*Number of changes/100 amino acids

20
Q

Which matrices are used by BLAST?

A

BLOSUM matrices

21
Q

When is BLOCKS used?

A

For conserved motifs

22
Q

What does the number after BLOSUM represent?

A

Percentage identity based on observed alignments

23
Q

What are the penalties applied by BLOSUM62 matrix?

A

Opening a new gap = -11

Extending a gap = -1