Transcriptional control of gene expression Flashcards

(58 cards)

1
Q

DNA structure in solution

A

Right-handed, 10.5bp per turn, close to B-form (grooves roughly equal depth, bp sit on helical axis). Bases attach to 2’deoxyribose by N-glycosidic bond, exist primarily in amino+ keto tautomer forms. Helps transcription start by allowing seq-specific recognition of dsDNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

DNA structure in solution stability determined by

A
  • base pairing(4 Watson-Crick base pairings isomorphic, consistent with DNA’s uniform structure- H bonding->stability and specificity),
  • stacking (stability via hydrophobic effect (entropy) and favourable electrostatic/vdW interactions. Stacking maximised by propellor twist of 16-18o. Favours extended DNA conformation as bending reduces stacking. Optimal in 5’-purine-pyrimidine-3’ bp steps (e.g., 5’-G-C-3’), hence sequences like TATA most flexible as minimal H bonding and minimal stacking.
  • electrostatic repulsion between -ve phosphates. Inter-strand destabilises double helix (can be reduced with higher cation concentration). Intra-strand favours extended conformation (countered by bound proteins that facilitate bending. Follows Coulomb’s law (F=k(q1xq2/r2))
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

B-form varies in structure (variable geometry of base-pair steps) (shown by X-ray):

A

Roll (opening along bp axis +20/-10o); Twist (rotation per base pair ~36o); slide (displacement along bp-axis +2 to -1 A; High propellor twist limits degree of slide)
Also seq dependent variation in dimension of groove: minor groove narrower and major groove wider in AT rich seq.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Protein-DNA recognition

A

initial docking can involve local variation w/ seq-dependent twist/roll/slide. True specific recognition-> sampling bases in the grooves.
Major and minor grooves formed by base pair displacement from helical axis: 120/240o angle between glycosidic bonds in each bp (not rotationally symmetrical around long axis of helix)-> groove width. Extent of bp displacement from helical axis-> groove depth.
Major groove accessible to amino acid chains and info-rich: each bp has unique profile of H-bond donors and acceptors, methyls and non-polar Hs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Other DNA forms

A

B ID’d when prepared under high humidity and A under low humidity for X-ray.
* Z form can occur with purine-pyrimidine repeats (CGCGCG e.g.), left handed, alternating anti/syn positioning creates zigzag (purine N9-C1’ bonds syn)- creates torsional stress, bases not planar. Dynamic and less stable than B, more energetically costly to maintain in physio conditions. Major groove v shallow, wider, fewer specific interactions.
* A form right handed, broader and more compact than B, 11 bp/turn, 0.26nm rise/bp, major groove v. deep and much more narrow, inaccessible to amino acid sidechains, minor groove shallow and info-poor (helical axis in major groove). W-C bp RNA has A form- hard to recognise, cellular RNA rarely found like this. In mammalian cells, long dsRNA often recognised as foreign, triggers interferon response.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

RNA special features

A

Can have non-canonical bps, e.g., G=U wobble bp. 27 different bps involving 2 H bonds (+GC w/ 3H bonds)- many observed in r/tRNAs, riboswitches, ribozymes, spliceosomes, etc, allow more versatile conformations such as G quadruplex structures. Remember only WC bps are isomorphic.
Special features: structured RNA can be catalytic (e.g., ribosome, spliceosome) or bind small molecule ligands (riboswitches- can have 1 of 2 conformations depending on bound ligand); Mg2+/proteins often needed for tertiary fold+ catalytic activity; 2o structure formation usually via W-C (canonical) bps; X-ray structures show numerous structural motifs involved in tertiary folds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

RNA structural motifs (5)

A
  • Base-triples, e.g., one A binds U on W-C face and another U on Hoogsteen face (also found on G) to form U:A:U triple. Base triples-> triple helices, e.g., expression and nuclear retention elements (ENE) in viral RNAs+ cellular non-coding RNAs. 3’ A-tract forms triple helix with 2 internal U tracts- v. stable. ENE stabilises RNA+ cause nuclear retention (no export to cytoplasm)
  • Pseudoknots: base pairing of loop seq w/ complementary seq outside the stem closing the loop, stabilised by co-axial stacking of 2 helices
  • Complex 3D folds, similar to globular protein, made of multiple stem-loops, e.g., bacterial 16S rRNA.
  • Helix-turn-helix(HTH): common for seq specific binding for DNA binding dimers/tetramers; usually recognise “half sites” (inverted repeats) with 1 turn separation, bind as homodimers; recognition helix R fits major groove seq-specifically, Stabilisation/positioning helix P increases affinity+ stabilises R helix, sits across major groove. Helices at right angles to cover all angles of DNA spiral.
  • Handshake motif
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

RNA recognition

A

by non-bp paired regions (seq-specific) or by shape. Fully WC bp dsRNA forms uniform A-form helix: major groove inaccessible, long fully dsRNA usually recognised as foreign.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Investigating gene expression mechanisms by…

A

clone+ seq gene(can use genomic clone (for studying transcription/splicing) or cDNA clone (to study translation); assay system in vivo or vitro; ID cis-acting seq (essential, typically act as binding site for trans-acting factors (protein or RNA) by finding consensus seqs/ investigating effects of mutations; ID trans-acting factors; investigate cis+ trans combine to ctrl f(x).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Assays in vivo vs in vitro

A

can be in vivo (physiological conditions but little ctrl over variables, hard to monitor reaction intermediates, can be unphysiological if test gene/expressed RNA too abundant) or in vitro (precise variable ctrl, can detect activities then purify trans factors, often inefficient, unphysiological). Also in silico approaches (deep seq, new hypothesis generation for testing in vivo or vitro).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Prokaryotic transcription general points

A

RNA transcribed from template/antisense/non-coding strand. Initiation: rate-limiting as no primer, promoter specifies TSS, consists of cis elements, usually shortly upstream on TSS+1, recognised by RNA Pol+/ TFs. Most RNAs start w/C/G, then synthesis/elongation 5-3’ using NTP substrate (NMP incorporated, PPi released) ~45nt/s, error rate 1/10000.
NB: NAD+ cap stabilises prokaryotic mRNA (is a mechanism for exp ctrl): @ some promoters with TSS+1=A, NAD+ can be incorporated at +1 by RNAP. Contains ADP-ribose-nicotinamide, only occurs at some promoters (seq of promoter is important)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Bacterial core promoter elements

A

Asymmetric. In downstream order: Up element at highly active rRNA promoters~20 bp AT rich element recognised by C-terminal domain of alpha subunits (bind via minor groove); -35 box (TTGACA) recognised by sigma region 4, down mutations inhibit initial RNAP binding; -10/pribnow box (TATAATG) melted non-template strand recognised by sigma region 2, down mutation inhibits promoter melting, optimal distance from -35 box 16-18bp.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Bacterial RNAP

A

single core (alphax2+omega (assembly), beta+ beta’ (active site) transcribes all m/r/tRNA. More abundant than sigma subunits (promoter recognition), so both core+ holoenzyme present.
Alpha N-terminal and C-terminal domains connected by flexible linker. Active site in cleft at base of beta, beta’ claw-like pincers: downstream DNA enters cleft between pincers (mobile, tightly binds ~20 bp of downstream DNA when RNAP transcribing. Beta=flap, beta’= upper jaw)
Core binds non-specifically (“loose”), non-specific initiation from multiple locations on both strands. Holoenzyme non-specific binding reduced 1000-10000-fold, specific promoter affinity increased 1000 fold, accurate initiation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Bacterial RNAP sigma subunit domains

A

2+4 have helix-turn-helix motifs, recognise -10+-35;
Domain 1.1=-ve DNA mimic, suppressed inappropriate DNA binding by free sigma (interacts with sigma 4, stops DNA binding) and holoenzyme (occupies downstream DNA binding cleft, reducing non-specific binding, displaced upon promoter binding).
3.2 linker region occupies RNA exit channel in holoenzyme beneath flap

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Sequence during initiation of prokaryotic transcription

A

Holoenzyme binds at -35 box (reversible) promoter binding ejects sigma1.1, pincers clamp around downstream DNA->
* closed binary complex (-55 to -10)-> promoter melting (irreversible) at -10 box from -11 to -3 begins with base flipping, where bases A-11 and T-7 flipped out without ATP use into specific binding pockets on sigma2 to form 14nt bubble (detected by KMnO4 probing)-> open binary complex (-55 to +20)->
* Initial transcribing complex (first NTP binds with low affinity by bp to template, subsequent NTPs bind with 10x affinity due to bp and stacking interactions. Phosphodiester bonds form 2-9 nts, aligned by RNAP active site, and initiation bubble expands to 23nts in cycles of abortive initiation where short RNA transcripts are released. Productive transcription (sigma 3.2 still blocks RNA exit channel), then promoter escape after 8-9nt:
* Sigma released, bubble collapses from 5’end to 14nt to form ternary elongation complex (35bp footprint)- highly stable and processive. Sigma 3.2 still blocks RNA exit channel.
NB: promoter escape not easy due to: sigma2/-10 and sigma4/-35 (and optional alpha-CTD) contacts with DNA, 3.2 linker/RNA exit channel and sigma4/beta and sigma2+sigma3/beta+beta’ contacts with RNAP need to be broken. New interactions stabilise elongation complex: 9bp RNA:DNA complex, clamp around 20nt upstream DNA, ssRNA (6-10nt) contact with exit channel.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

IDing and characterising promoters:

A
  • For specific bases/short seqs: Find consensus seqs by alignment to TSS e.g., -10/Pribnow box or -35 box (closer match= stronger promoter- permanent up/down tuning)
  • Examine natural promoter mutations- affect mRNA quantity but not seq. Can be UP/DOWN, latter more common, UP more likely for weaker promoters.
  • Generate targeted mutations guided by info on consensus seq.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

For ID of more extensive regions of DNA, biochemical mapping: DNA melting

A

KMnO4oxidation of pyrimidines in ssDNA regions. KMnO4 reacts preferentially with unpaired thymine, oxidizing C5-C6 double bond+ adding OH to both Cs-> add alkali to cleave phosphodiester backbone @modified positions. Alternative: block primer extension reaction to monitor position of modified Ts. Sensitivity to KMnO4 helps monitor where DNA unwinds (e.g., promoter melting during initiation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Biochemical mapping of DNA-protein interactions (also for RNA-protein):
EMSA/ gel-shift assay (in vitro) procedure

A

mix end-labelled DNA with pure protein/cell extract, run native gel, then image. Seq specificity demonstrated by titrating in xs unlabelled DNA of same/random/mutated seq. Variant on this: supershift, for when you think you know the binding protein (DNA probe incubated w/ cell extract, test for candidate protein with antibodies- DNA:antibody:protein complex migrates even slower than DNA:protein complex on gel).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Footprinting (in vitro

A

if and where protein binds. one strand of DNA end-labelled, incubate with and without protein, digest mildly (single hit)- protein protects DNA where bound. Run high-res denaturing urea gel, image and compare gels with+ without protein (gaps in protein gel where protein bound)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Modification interference

A

Modification interference: find where chemical modification of DNA prevents protein binding. 1 strand of DNA end-labelled; DNA modified (average 1 mod/strand w/ ENU (phosphates) or DMS (methylates purines)); incubate with protein; separate bound+ unbound DNA (e.g., EMSA); purify DNA from both modified DNA and post-incubation DNA and cleave at modified sites; run urea gel: missing bands where modification prevented protein binding and therefore protecting DNA on DNA bound by protein compared to input DNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

ChIP-seq

A

ChIP(-seq) (chromatin immunoprecipitation): which DNA sites specific proteins bind to in vivo, mostly used for eukaryotes (TFs, RNAPs, histone post-translational mods). Proteins bind genomic DNA in vivo; in vivo crosslinking with formaldehyde, then cell lysis and DNA fragmentation into ~200bp fragments (sonication or MNase); immunoprecipitation with antibody for query protein; reverse crosslinks (heat), purify DNA; NGS seq and map assembly, then seq analysis and motif ID (computational) to ~100bp resolution (low res but genome-wide).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Prokaryotic Transcription Regulation

A

Achieved in 2 ways: classes of genes can be co-ordinately ctrled by switching sigma factors (7 types in E coli), e.g., by heat shock.:
Sigma70=general, TTGACA and TATAAT -35 and -10 with 16-18bp separation.
SigmaN=nitrogen starvation, CTGGNA -20 box and TTGCA -10 box with 6bp separation.
Regulation by activator/repressor proteins: induction by small-molecule substrate inducer (metabolising enzymes switched on). Repression by availability of a nutrient co-repressor (biosynthetic enzymes switched off). Inducers and co-repressors are allosteric regulators or activator/repressor proteins (bind site remote to DNA binding site), which have 2 conformations: 1 stabilised by inducer/co-repressor and 1 binds DNA with higher affinity. Often bind as dimer/tetramer to palindromic binding sites

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Lac operon general rules/qualities

A

Lac operon shows +ve and -ve regulation in response to lactose/glucose presence to reduce waste of biosynthetic capacity by unnecessary enzyme production. LacZ, Y+A fully switched on by and gate logic: Active when lactose+ and glucose-(cAMP elevated).
Low basal levels of transcription (leaky promoter)-> enough LacY product (permease) to allow Lac uptake into cell when available. Side reaction of beta-galactosidase-> allolactose (inducer that binds Lac repressor).
Regulated promoter has suboptimal -10 and -35-> max levels of transcription require CAP protein (activator). Binding site for CAP upstream of -35 but not overlapping, centre of the two (A/T)GTGA half sites are 10bp/1 helical turn apart.
Upstream of -35 not AT rich, not a great binding site for CTD binding (no UP element).
Overall, Lac -vely regulated by Lac repressor (bound when no lactose, released by inducer (lactose) binding) and +vely regulated by CAP (binds DNA when cAMP bound, promotes DNA binding- note that cAMP accumulates in glucose absence). If repressor and activator both not bound, transcription only at around 2% or max bc RNAP binding v. weak (no UP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

In vivo assay evidence for Lac operon regulation

A

When repressor mutated, lac active whenever lactose present;
When operator mutated (repressor binds weakly), more leakage when both lactose and glucose present;
When CAP mutated, lac activity lowered when lactose present; when -35 mutated, lac activity lowered regardless of conditions

25
negative regulation by Lac repressor
(transcribing LacI mRNA encoding lac repressor: dimer of dimers/tetramer only active at Lac): only ~10 repressor tetramers/cell (constitutive but weak (v. weak -35). Low basal levels of transcription (leaky promoter)-> enough LacY product (permease) to allow Lac uptake into cell when available. Side reaction of beta-galactosidase-> allolactose. Operator/palindromic binding site for Lac repressor overlaps TSS/RNAP footprint. -Operator recognised by N-terminal HTH motif of one dimer docking major groove; additional binding sites at +410, -80, at least 1 needed for full repression; blocks RNAP binding; inducers (allolactose) bind repressor, reduce affinity for operator DNA but only while lac available (inducer metabolised by beta-Gal); artificial inducers bind Lac repressor+ not metabolised-> sustained activation.
26
Lac repressor DNA binding
largely non-specific, as non-specific binding sites in large xs over specific. Affinity of repressor to non-specific DNA enough for almost all repressor to be bound. Non-specific binding accelerates specific binding by x100-1000 through facilitated transfer (i.e., restricts effective diffusion volume for repressor which can slide along DNA (1D diffusion), intersegment transfer (helped by tetramer structure, walks along DNA)
27
+ve regulation of Lac operon by regulated promoter (CAP):
2x22kDa dimer. cAMP bound form binds symmetric site upstream of RNAP via HTH: 7aa positioning/stabilisation helix 2 makes non-specific contacts w/ backbone, 4aa beta turn, 9aa recognition (F) helix 3 docks major groove specifically by H-bonding to GC bp complementary to arg+glu and vdW contacts. cAMP binding induces folding of 3 additional turns of helix in dimerization domain, recognition helices rotated 60o and brought 7A closer-> positioned to bind DNA so only cAMP-CAP complex can bind. DNA-CAP binding not sufficient for activation- also need CTD. CAP +ve ctrl mutants can’t activate but can bind, some RNAP alpha-CTD mutants still good for general transcription but not activated by CAP. Activator bypass experiment replace alpha-CTD w/ artificial substitute, which interacts artificially with CAP- show CAP only recruits RNAP: DNA bound CAP interacts with C-terminal domain of alpha subunit to recruit it, compensating for suboptimal -35/-10.
28
Euk chromatin packaging: naked DNA and micrococcal nuclease
Naked DNA extends due to -ve/-ve backbone repulsion, base pair stacking. Light microscopy shows hetero+ euchromatin. Electron microscopy shown interphase chromatin at low cation concentration like beads on a string, further packing at physiological ionic strength (like bead string wrapped round a pole). Extensive micrococcal nuclease treatment with histones protecting DNA show 147bp nucleosome core particle (invariable), with limited digestion showing nucleosome and linker length around 200bp- linker length can very by cell type 20-60bp.
29
Histones
octamer is 2 each of H4, H2B+ H2A, H3 in order of size (11-14kDa), + linker H1 at around 21kDA. H2s form homodimers, H3+H4 form tetramers. Core histone fold is 3 alpha-helices (short-long-short); Linker histone +ve at N and C tail, increases protection of linker from micrococcal nuclease from 147 to 166bp chromatosomes. Several DNA contacts. Nucleosomes with H1 more ordered zigzag arrangement than beads on a string (condenses 10nm to 30nm fibre together with N tails of core octamer, x40 compaction, with further levels having chromatin loops attached to proteinaceous nuclear scaffold)- not accessible to RNAP/DNase I.
30
Histone N terminal tails
rich in +ve Lys/Arg; post-translational modifications (ser-P, Lys-acet alter charge, arg/lys mono/di/tri methylation alter binding site) change affinity to DNA; can be cleaves from main histone w/ trypsin (accessible, extended, flexible).
31
Core euk nucleosome
resolved w/ Xray. 1.65 left-handed wraps of DNA around histone. N-tails stick out, stabilise DNA. H3/4 tetramer interacts w/ DNA ends and 60bp middle, H2s interact 30bp flanking central 60. 14 sites of contact where minor groove approached histone core, 33 H bonds w/ backbone and 7 H bonds to bases in minor grooves. +ve histone charge helps side-by-side DNA packing. Beads on a string=x6 compaction.
32
Nucleosome positioning
Nucleosome positioning: can sometimes be positioned to specific seq: ~5bp periodicity of AT:GC rich seq favours positioned nucleosome (low stacking, high flexibility), DNA-binding proteins can maintain specific nucleosome-free areas. Genome-wide mapping of Micrococcal nuclease treated chromatin (ChIP seq for PolII density) show: nucleosomes repositioned during gene activation but still resent in active genes; strong positioning of nucleosomes at active TSS.
33
DNase sensitivity experiments
DNase I sensitivity experiments: Experiments involved chromatin digestion w/ seq-non-specific DNase I (accessible DNA), DNA extraction and cleavage with restriction enzyme (seq-specific), southern blot and probe for expressed and unexpressed genes, then compare intensities of bands in preparations in which chromatin digested with increasing DNase amounts. Showed that expressed genes digested by DNase I 5x faster than unexpressed, therefore expressed genes, while not nucleosome free, are associated with lower level chromatin packaging.
34
DNase hypersensitive regions
cleaved 10-20x faster than surrounding sensitive regions. Locations mapped by indirect end-labelling: light DNase I digestion, purify and cut with restriction enzyme, southern blot, probe with labelled fragment that hybridises to one end of target restriction fragment. Hypersensitive sites are: nucleosome free (confirmed by deep-seq), at active promoters/enhancers, transcription factors bound.
35
In vivo packaging of euk DNA (as opposed to in vitro)
don’t really get 30nm fibre except certain cell types, usual arrangement in both interphase and mitosis variable, flexible, dynamic with no uniform structure. Found in chromosome territories, with smaller chromatin domains for intra/inter chromosome interaction.
36
Euk DNA compaction regulation
loss/modification of H1 (ChIP shows depletion near active promoters); histone tail mods by HATs recruited by transcriptional activators/ HDACs recruited by transcriptional repressors (acetyl-Lys uncharged, worse DNA packing, also recognised by bromodomains often found in transcription factors, reinforce activation); methylation (of Lys) recognised by chromodomains in repressive proteins.
37
Archaea, prok and euk genome and transcription differences
Note on archaea+ prokaryotes: Eukarya+ archaea (circular genomes+ cell division apparatus more resemblant of bacteria, no organelles or complex cytoskeleton) more similar in transcription that prokarya, w/ many Pol II smaller euk subunits related to those in archaea (also have histone-like proteins and nucleosome-like structures)
38
Key differences of euk transcription compared to bacteria
monocistronic (no operons), more stable+ processed (from pre-mRNA) mRNA, tighter ctrl due to background DNA compaction. Note that Kozak seq in euk equivalent to prok TATA. Consensus sequences are helpful in studying new/extinct species. Upstream seqs include proximal promoters (also in coding region e.g., snRNAs) and distal enhancers (also in introns/downstream of gen). For both euk and prok, RNA synth 5’-3’, template read 3’-5’. Start site +1.
39
RNAPs in euks
I in nucleolus (pre-rRNA, genes generally in tandem-arrays of 150-200 copies) * II in nucleoplasm (pre-mRNA+ most snRNAs. Requires accessory factors for regulation, initiation, processivity+ accuracy) * III in nucleoplasm (pre-tRNAs, small stable RNAs (some snRNAs+ 5S rRNA), genes scattered throughout genome). Capping and splicing sometimes occur in I and III transcripts. 10-12 subunits, with largest 3 (RPB 1-3) homologous to prokaryotic beta’/beta/alpha. All involve DNA melting, DNA and nascent RNA exiting through separate channels. Evidence that euk transcription happens in discrete sites/ transcription factories. Topoisomerases release super-helices ahead+ behind polymerase. Can be distinguished in vivo/vitro by sensitivity to alpha-amanitin (Pol II most sensitive, I least) Core RNAP II insufficient for accurate initiation.
40
RPB 1 has unique C-terminal domain… (6 points)
* with linker followed by multiple heptad repeats, consensus YSPTSPS. * Flexible, intrinsically disordered not seen in X-ray structure, easily cleaved. * 2 main forms of RPB 1 found on SDS-PAGE: Pol IIa has CTD hypo-Pi, associated with initiation; Pol IIo has CTD hyper-Pi, associated with elongation- pi and de-pi’d by different kinases during transcription. Major pi-acceptors are Ser2+5 in consensus. Depending on Pi state can bind initiation/elongation factors, histone mod enzymes, RNA processing factors. Initially de-pi’d during initiation, early Pi-S5, later removed, later Pi-S2. * CTD essential (removal lethal), conserved (with different repeat numbers in different species)+ has no analogous structure in Pol I/III. * Joined to Pol II close to RNA exit channel to deposit RNA processing factors on pre-mRNA. * Largest subunit of Pol II, may be linked to ctrl of Pol II processivity+ linking PolII transcription to splicing, PolyA-tion and mRNA export. Yeast proteomic analysis ID’d >100 proteins associated with PiCTD- CTD functions as landing platform for factors involved in co-transcriptional processes.
41
RNA Pol II promoter
seqs specify TSS, which can be mapped by alignment of cDNA+ genomic DNA. Downstream promoter element (DPE) = common component of RNA Pol II promoters that don’t contain a TATA.
42
5’-CAGE
(cap analysis of gene exp) selects mRNA by cap->cDNA->seq->mapping reveals 2 main promoter classes: * Sharp/focused- single TSS, associated w/1 or both consensus elements (TATAAA @-25 and INR element at +1 (YAYTCYY), plus additional elements) and includes stronger and tissue-specific promoters * Broad/Dispersed: multiple start sites spread over 150ish nt, lack refined elements like TATA/INR, enriched with CpGs.
43
In vitro transcription assays in euks help assess accuracy and quantity of initiation. Can mutate DNA templates to demonstrate hypotheses:
Cis elements: need only TATAAA or INR for accurate initiation in vitro- if these mutated, lose accuracy. Core promoter properties: 1 copy, fixed position relative to TSS, asymmetric, orientation specific. Sufficient for basal in vitro transcription+ accurate initiation. Major role= specify TSS. Trans factors: transcribes in vitro w/non-specific TSS. HeLa extract shows accurate and efficient initiation. Need additional GTFs for accurate TATA-box dependent initiation. Purification from extracts ID’d TFII, A, B, D, E, F, H. GTFs assemble preinitiation complex with promoter and Pol II. Only needs NTPs added to initiate. Preinitiation complex assembled stepwise: promoter binds TFIID, then A->B->F+Pol II->E->H->use of ATP (to ADP released)-> initiation (melting and abortive initiation) +elongation.
44
GTF roles: Order of TFII s
TFIID and TAFs, A, B, F, E, H
45
TFIID
TFIID has 13 subunits including TATA binding protein (TBP- recognises TATA box (which doesn’t melt!) via beta sheet structure via minor groove. TFIID responds better than TBP to certain transcriptional activators, but they act equally as well in vitro in assays in absence of transcriptional activator) and TBP associated factors (TAFs)-> intercalation of Phe residues between bases, with insertion of 4xPhe-> 80o bending, minor groove widened and flattened (TBP contains 2 direct repeats, forms “molecular saddle”, bends DNA). Specificity bc TA bp steps more readily distorted. At TATA box promoters, TBP-> basal transcription. At non-TATA box promoter also need. TFIID nucleates assembly of other TFs, helps determine TSS. TBP= universal TF (also needed for Pol I+ III), highly conserved.
46
TAFs
TAFs (co-activators). TAF1 has bromodomain, recognises Ac-Lys (mutation-> neurodevelopmental disorder). Bind promoter seqs, especially important for promoters w/out TATAs. Some contacted by upstream seq-specific activators (different activators contact different TAFs- combinatorial ctrl needed for specific regulation). Evidence of “variant” TFIID complexes vary in TAF content (possibly tissue-specific). Some TAFs have histone-like folds.
47
TFIIA and B
TFIIA: binds+ stabilises TFIID/DNA interaction. TFIIB: bridges between TFIID and Pol II. Some seq preference for DNA binding. Binds weakly-conserved BRE element upstream of TATA, helps define TSS. N-terminal region interacts Pol II w/ upstream regulators. The rest of the protein made up of 2 direct repeats.
48
TFIIF, E and H
TFIIF binds Pol II stably (elongation+ initiation factor), reduces non-specific Pol II binding. 2 proteins, weak seq similarities w/bacterial sigma. TFIIE: 2 polypeptides In humans, enters complex after F+ Pol II. TFIIH recruited after Pol, has 1 kinase+2 DNA helicase/ATPase subunits. Kinase Pis RPB1 CTD on Ser5, helicase melts DNA and clears promoter (ATP hydrolysis). Also functions in DNA excision repair. NER targeted preferentially to transcribed regions (template).
49
Mediator complex
Mediator complex needed in whole nuclear extract. In humans, 26 subunits, binds Pol II via hyper-Pi CTD, co-activator and co-repressor subunits. SAGA complex (chromatin modifying by acetylation and de-ubiquitination) has similar function.
50
Studying general transcription factors
Studying GTFs: biochemical (cell-free) transcription assays largely capture effects of factors binding proximal promoters in vivo. Prep nuclear extract, add DNA template+ ribonucleotides (include radiolabelled nucleotides), incubate (transcription complexes assemble, transcription occurs), detect by autoradiography. Bottom-up approach: add components and see which action conferred by each component, finds component mix sufficient for a process. Top-down removes components to find what is necessary for the process. In vitro biochemical studies allow detailed mechanistic study but only DNA elements close to initiation site have detectable activity in the assay (reflects fact that naked DNA used unlike in vivo chromatin, so some effects not captured). Genetic in vivo and structural studies complement assays.
51
Activating eukaryotic transcription factors and enhancers
High chromatin packing-> Pol II+GTFs+core promoters not enough for initiation. Additional TFs bind cis-elements, activate in vivo transcription (tissue-specific, developmentally regulated)
52
In vivo transcription/ reporter assays
In vivo transcription assays: transfect plasmid-> mammalian cells (liposomes/ co-precipitate w/Ca phosphate), detect transient exp in 1-3 days (harvest RNA+ analyse accuracy+ amount initiation or use reporter gene). Reporter assay involves deletion/point mutation in test promoter ID cis elements, assuming reporter activity proportional to transcription initiation rate, promoter elements all upstream. Generic/ minimal promoters are TATA box+ TSS. Results: 3’ end/TATA deletion modestly reduces efficiency, but several start sites mean further deletions reduce efficiency more. 5’ end deletions impact transcription showing additional cis elements needed in vivo. Deletions ID upstream elements (UEs): precisely defined by point mutations+ internal deletions, necessary for transcription in vivo, binding sites for activating TFs, e.g., Beta globin promoter has TATA and two UEs, ID’d by point mutations -100 to +1. Most mutations without effect.
53
Upstream element properties in euks
UE properties: role in efficiency, not accuracy; constitutive OR inducible; position+ spacing variable; usually within 200bp of TSS; orientation independent; 1 or more copies; 6-10bp+ palindromic; response elements; multiple types; combinations vary (patchwork arrangement).
54
Enhancers
Enhancers= cis elements that v. strongly activate from 10-100s kbps away, either orientation up/downstream of gene. Originally ID’d by enhancer trap assay (reporter gene downstream of core promoter and UEs with candidate enhancer fragments clones downstream in both orientation). E.g., muscle specific enhancer in MYL1 increased CAT (reporter) activity only in differentiated myotubes but not myoblasts (undifferentiated). Enhancers generally highly conserved, ~100bp, contain array of many closely spaced TF binding sites w/ only few individual sites vital but mutations in most positions having effect, constitutive or tissue specific, transcribed into small eRNAs. About 1-5 per human gene. Activating TFs: modular, discrete domains for DNA binding, dimerization, activation and regulation (e.g., steroid hormone binding. E.g., LexA in yeast.
55
DNA binding domains (4 e.g.s, general properties)
many classes, seq-specificity often via alpha-helix docking into major groove. * Homeodomain: HTH (less variable in euk than bacteria), H binds from sidechains in helix 3 to bps in major groove. Often act as heterodimers * Zn fingers: beta-beta-alpha structure; Zn2+ tetrahedrally coordinated between Cys (beta) and His (alpha), stabilises structure. Each Zn finger can recognise 3-4bp, binding sites usually not palindromic. * Basic/leucine zipper: 4-5Leu 7aa apart form coil w/ aliphatic Leu from each subunit that stick together (zip). Have dimerization region and DNA binding region- basic electrostatic interaction w/ backbone docks N-terminal extension helix to major groove, seq-specific contacts. (Inhibitor dimerization) * Basic Helix-loop-helix. Like Leu zipper but each subunit has 2 helices separated by loop. Homo or heterodimerise. E.g., MyoD recognises 6bp E-box. NB: some DNA binding domains bind specific seq within nucleosome (“pioneer” TFs) Subunits of dimers cooperate to bind DNA (more points of contact add stability). Palindromic recognition+ clamping over DNA strand during assembly. Different combinations of homo/heterodimers increase complexity of regulatory ctrl, allowing response to wide range of conditions. Base flipping and H bonding interactions not sufficient, so shape of DNA used (melting, stretching, compressing, base stacking etc w/ conformational change in proteins)
56
Activation domains in eukaryotes
Activation domains: intrinsically disordered but conformation can be important (exposed, collapsed, folded+ bound). Deletion impairs activation but not DNA binding; fusion to heterologous DNA binding domain confers activation. Numerous targets (not Pol II directly), including nucleosome modifying factors (HATs de-compact chromatin+ facilitate binding by bromodomain), nucleosome remodelling factors consume ATP, increase DNA mobility, can help expose sites to TFs/transcription machinery. Mediator subunits+ TFIID TAFs (co-activators, not essential for basal transcription), TFIIB interact with activation domains. Activators recruit core transcription machinery. Multiple activator targets-> functional synergy of multiple TF binding elements. Repressor modes of action similar. This type of ctrl may allow more rapid+ synchronous transcriptional ctrl+ lower transcriptional “background noise”.
57
Models of activator binding to remote sites in euk transcription
Models of activator binding to remote sites: Bind DNA+ scan along to promoter; DNA conformational changes transmitted to promoter; direct interaction of promoter, activator and enhancer with intervening DNA forming a loop. In vitro experiment with SV40 enhancer/ beta globin promoter on separate biotin labelled DNA fragments showed result consistent with looping.
58
3C experiments and the looping model
Chromatin conformation recapture (3C) experiments: formaldehyde crosslink DNA and proteins in live cells, isolate DNA, digest with restriction enzyme close to enhancer and promoter, ligate at high dilution (only fragments held together by crosslinks ligate), reverse crosslinks, isolate DNS, q-PCR detect and quantitate products where chromatin loops. This shows loop between beta-globin control region and its active promoters- 3C signals between expressed globin genes and DNase hyper-sensitive sites were very high in liver, where it is expressed (but not in the brain where it is not expressed), suggesting active promoter and all control regions form an “active chromatin hub” with intervening DNA looped out. Live imaging of fluorescently labelled genomic loci also shows enhancers and promoters co-localise. Reality a little more complex than looping model. Typical enhancer has multiple activator binding site. TFs in vivo are required for basal transcription in chromatin, regulated transcription and specificity.