Lecture 3 - RH Flashcards

1
Q

Why do we perform alignments?

A

To find homologues

To see if homologue is associated with protein structure

To determine function

To determine evolutionary relationships

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How much information can be obtained from aa sequence, pair of homologues, and many aligned sequences?

A

aa sequence = very little

pair of homologues = some info

many alignments = a lot of information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why are MSAs performed?

A

To elucidate functional info within protein sequence

To perform evolutionary analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

In a pairwise alignment what does the positioning of 2 aa’s at the same point imply?

A

That they have the same role in homologous proteins

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What happens when more sequences are added to an alignment?

A

More accurate results revealing results that are not obvious in a pairwise alignment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is a MSA performed?

A

Find sequences you wish to align

Prune them if necessary

Run multi-alignment algorithm

Inspect the output

Remove disruptive sequences and repeat

Identify conserved aa’s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How are alignments scored?

A

Alignment arranged so a maximum number of characters in each sequence are matched

Scoring is done according to the Sum of Pairs (SP)

Each column is scored by summing all possible matches, mismatches, and gaps.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the sum of pairs result indicate?

A

2^n where n is the final score of the Sum of Pairs. This is the number that represents the number of times that a sequence is identical due to homology rather than pure chance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What can MSAs tell us?

A

Most highly conserved residues may correspond to the active site.

Insertions and deletions are probably in surface loops

Conserved pattern of hydrophobicity with spacing 2 may indicate beta sheet

Spacing 4 may indicate alpha helix

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some ways to construct phylogenetic trees?

A

Distance matrix methods

Maximum parsimony methods

Maximum likelihood methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Neighbour Joining?

A

Similar to UPGMA using stepwise build

Corrects for evolutionary rate

Creates an unrooted tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is character based maximum parsimony? What is the problem with this method?

A

Based on sequence characters rather than distances

Trees are constructed by searching all possible tree topologies and looking for one with the least changes

Problem: Computationally expensive and so not all sites are used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is maximum likely based on?

A

Searches for the evolutionary model that has the highest likelyhood of producing the observed data

Uses a substitution model that incorporates probability.

In practice every position in alignment is scored based on probability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is bootstrapping and what is it used for?

A

A method of statistically validating a tree.

Data is resampled (generally 1000 times) after being slightly

Statistics are hard to define if a node is present 700 times from 1000 then that means 95% probability that it is in the correct position.

*Low bootstrap numbers are bad news

How well did you know this?
1
Not at all
2
3
4
5
Perfectly