9. Protein structure prediction Flashcards

1
Q

The structure

A

is determined by the aa seq.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The folding will

A

correspond to the energy minima

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why protein modelling?

A
  • Structure is important for function

- Gap between known sequences and structures is huge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Common structures

A

a-helix
b-sheet
b-turn
random coil

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Program available for secondary structure prediction

A

DSSP

STRIDE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Different approaches

A
  • Statistical methods
  • Knowledge-based methods
  • Machine learning
  • Consensus method
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Evaluation 2nd structure

A

Q3

Sov

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Q3

A

fraction correctly predicted residues - Accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Sov

A

Fractional overlap of segments - ability to pick up correct structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How is Q3 and Sov used

A

They can only evaluate the method itself not your prediction as it looks at already known structures
- one should use both Q3 and Sov for good evaluation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Q3 equation

A

Correctly predicted residues/total residues =Q3%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Example of machine learning methods

A

PSIPRED
PHD
Jnet

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Membrane topology

A

look if protein is bound to a membrane or not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Common characteristics of TM region (3)

A
  • W or Y at the edges of the membrane
  • ca 20 hydrophobic residues inside the membrane
  • Positive inside
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Database to identify TM regions

A

TOPCON

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

TOPCON

A

accounts different methods and make a consensus

- gives exact positions of TM region in the bottom

17
Q

3D structure prediction

A
  • More complex than secondary

- Two main approaches

18
Q

Two main approaches of 3D modelling

A
  • Homology modelling

- Ab initio modelling

19
Q

Homology modelling

A
  • use known structure as template
  • higher seq similarity -> better prediction
  • alignment and template selection is very important
20
Q

Methodology (entire 3D)homology modelling (5steps)

A
  • Identify related structure
  • Align target seq to template structure
  • Generate “known” backbone and side chains
  • Generate loops
  • Refine
21
Q

Template selection

A
  • select optimal crystal structure
22
Q

Common approaches for template selection

A

sequence similarity
homology (both of these: BLAST, Prosite, Pfam)

Fold recognition

23
Q

Fold recognition (threading)

A
  • structure more conserved than sequence
  • compare to a known library of folds (CATH, SCOP
  • align sequence to a fold
  • ENERGY CALCULATIONS
  • does not require a similar sequence
24
Q

Loops

A
  • exposed regions are more variable than the protein core
  • often important for protein function
  • loops longer than 5 residues is hard to model
25
Approaches short, med, long
short - analytical approach medium - database approach long - fragment based approach
26
Optimisation of model
- use ENERGY MINIMISATION to fix bad parts * side chain clashes * bad peptide bond angles
27
Common errors (of 3D template) 5st
- side chain packing - distortions/shifts - bad loops - misalignment - bad template
28
Ab initio modelling
- predict structure without any prior knowledge but the sequence - used when template-modelling fails - works best for small proteins - great computational cost
29
CASP
- evaluation of 3D structure methods | - similarity of model to the native structure
30
Function prediction
``` predict the function - use results from eg * signalP/TargetP * 2nd structure pred * topology predictions * 3D predictions - use machine learning methods to connect results usually given as gene ontology terms ```