McPherson Lectures 1-3 Flashcards
(34 cards)
what is directed evolution
the creation of new protein activities through the principles of natural selection to chang: pH and thermal stability, optimum pH and temp, binding properties (i.e. km for substrates) and catalytic properties
although the ultimate goal is to be able o design a protein de novo which performs a certain function, why do most directed evolution experiments explore a sample of proteins with activity
most random sequences won’t fold into a functional protein so existing proteins already have stable folds, display functional properties and may have minor activities which can be enhanced by sequence changes
what are the methods for introducing variability
error prone per, DNA shuffling, family shuffling or random mutagenesis
why is sparse sampling used in directed evolution experiments
possible sequence space = 1.3x10^55kg of protein = 2x mass of earth so sparse sampling just uses enough diversity to select interesting proteins
how does error prone PCR work
reduce the PCR fidelity e.g. using non-proof reading Taq polymerase, high conc of mgcl2, increased extension time or altered relative nucleotide conc to introduce variability
how many mutation per codon are produced in error prone pcr
1
why is recombination usually performed after error prone pcr
it is unlikely to lead to concerted accumulation of beneficial mutations
describe the process of DNA shuffling
- fragment DNA by physical method e.g. dnase 1
- heat denaturing allows random annealing which self prime upon addition of DNA polymerase. repeat cycle to build up size of whole gene
- resulting genes can be amplified by PCR to form a variant library which can be cloned into a vector
- screen for desired activity then carry out saturation mutagenesis to improve
why can error prone pcr/dna shuffling not be used to explore complete sequence space
limited by codon usage- with 1 mutation/codon tryptophan, for e.g., can only be mutated to 5 aa
how are protein coding sequences isolated once the desired protein is selected for
a physical association between the expressed protein and its coding protein has to be created. in vivo it can then be transfected into a cell then cells are grown and DNA coding sequence is recovered as plasmid DNA or by PCR. in vitro can be done via ribosome display and reverse transcription, mRNA display or micro fluids encapsulation
what are 3 possible methods for selection
survival phenotypes, fluorescence selection, chromogenic enzyme activities
how can we overcome the problem of not being able to select for “better” mutants if you find something that allows the cell to grow
downregulate protein expression or introduce a degradation signal to select for mutants which allow growth under these pressures
describe screening
pick individual colonies, put them in a 96 well plate, grow, lyse and assay for activity
what is the problem with screening
low throughput, inefficient and expensive if most variants are inactive
which substrate does Cyt p450 usually use
camphor
which substrate did they try to change cyt p450 to use
naphthalene
describe the first stage of the directed evolution experiments performed on cytochrome p450
error prone PCR was used to generate 200,000 clones of which 32000 were screened using fluorescence and found 3 variants
which 3 variants were further tested following error prone pcr and fluorescence screening of cyt p450
M7-6H which had 11 fold higher activity and contained the mutations R280L and E331K, M7-4H 5 fold higher activity and E331K mutation and M7-8H with 5 fold higher activity and mutations E331K and C242F
where was the E331K mutation discovered by directed evolution situated
not near the active site- would not have been an obvious choice
what is APEX
a haem peroxidase
why was directed evolution carried out on APEX
it can be used for intracellular specific protein imaging by EM and spatially resolved proteomic mapping but it isn’t as efficient as one might like
describe the experiment to produce APEX2 via directed evolution
first a base enzyme was generated which can be expressed in cytosol. they then generated a library of variants using error prone pcr, cloned them into ecoli then transformed into yeast. treated with substrate to label the variants then used FACS to select for improved activity. in 2nd and 3rd round increased selection pressure was applied to test for efficient haem incorporation. found APEX2 and VPGAPEX and in both most mutations were away from the active site
what was the major mutation in APEX2
A134P
what are the applications of APEX2
as reporter to generate contrast for EM/VM, mass spec analysis