DNA & Genome Structure Flashcards

1
Q

Eukaryotic DNA structure - recap

A
  • Hierarchical:
    o Double helix DNA slit into 2 strands w/genetic information
    o Coiled up into double helix →-ve backbone interacts w/ +ve histones
    o Histones coil up into coils → form coiled coils → form supercoils
  • Huge amount of DNA into v. small space
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Bacterial DNA structure -recap

A
  • Less coiled than eukaryotic
  • Highly organised supercoiled circular nucleoids
  • Some circular; some linear
  • Have scaffolding proteins e.g. Histone-like nucleoid structuring protein (H-NS)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

DNA synthesis – recap

A
  • Synthesis of polynucleotide chain by DNA polymerase
    o Requires ATP hydrolysis
    o Mg2+
  • Nucleophilic attack by 3’-OH on growing chain → hydrolyses incoming base → form phosphodiester bonds
  • Polymerase always works 5’ –> 3’ → DNA strands antiparallel
  • Endonuclease = cut into DNA → makes nick
  • Exonuclease = digest DNA from open ends
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Stability of DNA

A
  • DNA carries genetic information from one generation to next → must be stable over many lifetimes
  • DNA is susceptible to damage from environmental mutagens (e.g. smoke, chemicals, radiation)
    o ~10,000x per cell per day
    o Leads to disease e.g. cancer
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

DNA Damage

A
  • Can block replication/transcription
  • Can cause alteration in genetic code (mutation)

Causes:
1. Chemical alteration to DNA
o Exogenous – environmental mutants e.g. UV radiation
o Endogenous – internally generated damaging agents e.g. hydroxyl radicals
2. Spontaneous damage to DNA
o e.g. deamination, depurination

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Examples of DNA damage induced by exogenous agents

A
  1. UV light causes pyrimidine dimers (most common: T-T)
    o 2 adjacent pyrimidines covalently joined by cyclobutene ring structure
    o Can lead to skin cancer
  2. Alkylation in wrong place of DNA
    o Alkylation = addition of methyl/ethyl groups to various positions on DNA bases
    o E.g. alkylation of the O6 position of guanine → O6-methylguanine → changes Watson-Crick base paring potential of guanine
  3. Carcinogens (e.g. benzo-pyrenes)
    o Reactive bulky groups added to various positions within DNA bases
    o Often have to be activated by cellular enzymes e.g. cytochrome P450 (catabolic enzymes) try to break down foreign bulky chemicals but create reactive intermediates then react w/DNA
    o Cytochrome P450 highly relevant for drug metabolism → they break down into many products for foreign chemicals like drugs
    o Liver packed with cytochrome P450
    o Bulky groups prevent DNA polymerase moving properly through DNA
    o Can potentially change Watson-Crick base pairing potential → when DNA replicates will incorporate wrong base
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Spontaneous damage to DNA

A
  • Deamination of adenine, cytosine and guanine
    o Taking off amine off DNA base
    o E.g. NH2 of cytosine replaced by O → uracil (same base pair potential)
    o E.g. NH2 of adenine replaced by O → hypoxanthine (different base pair potential)
    o Hypoxanthine has C=O and NH that can H-bond (looks like G) → has H-bond acceptor which forms 2 H-bonds w/cytosine
    o Just losing one amine group spontaneously → DNA replicates telling cell to add C instead of T at position of A → creates permanent mutation in DNA → transfers down through generations
  • Depurination
    o Removing purine (A and G) → cleaving bond between purine and deoxyribose
    o Forms apurinic (AP) site in DNA → replaced entire DNA base w/OH → DNA loses entire coding potential → information lost from the cell
    o “Silver soup tureen → purine → AG”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

2 general types of DNA repair mechanisms

A
  1. Direct reversal
    o Chemical reaction responsible for DNA damage is reversed
    o Usually needs specific enzymes for specific kinds of DNA damage
    o More specialised repair mechanism
    Example: Repair of pyrimidine dimers caused by UV exposure
    o Photoreactivation → uses photo-reactivating/photo-lyase enzyme
    o Uses visible light to break cyclobutene ring
    o Enzymes found in bacteria, yeasts, some plants/animals but not in humans
  2. Excision repair
    o Remove DNA damage → single bases or stretches of DNA → replace w/new synth DNA
    o More common and most important in humans than direct reversal
    Example: alkylation → methylation of guanine → O6-methylguanine base pairs with thymine
    o Repaired by enzyme: O6-methylguanine methyltransferase
    o Has reactive cysteine in active site → reacts w/methyl group to form cov. bond (S-CH3)
    o Reaction is not fully enzymatic → cell needs to reduce S-CH3 back to Cys to be reused
    o Common enzyme because widespread in prokaryotes and eukaryotes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Mechanisms of Excision Repair

A
  1. Base-excision repair
    o Base removed leaving deoxyribose backbone intact
  2. Nucleotide-excision repair
    o Nucleotide removed → gap in one strand (oligonucleotide is usually removed)
  3. Mismatch repair
    o Repair of post-replicative mismatches
    o After DNA synthesis → repairs errors of DNA Polymerase
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Base-excision repair

A
  • Start w/DNA w/lesion e.g. G:U → no H-bond correctly → bulge in DNA (U by deamination of C)
  • Cell detects lesion using uracil DNA glycosylase → recognizes uracil
  • Glycosylase cleaves uracil-deoxyribose bond → leaves AP site → gap in DNA
  • AP endonuclease recognizes AP site → cleaves DNA chain hydrolysing phosphodiester backbone → remaining deoxyribose removed by deoxyribose-phosphodiesterase
  • Resulting gap filled by DNA Polymerase → sealed by ligase → incorporates C opposite G
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Nucleotide excision repair (NER)

A
  • Process discovered because of disease such as…
  • Xeroderma Pigmentosum → genes involved in repair pathway are damaged → T-T cannot be removed from cells quickly → cannot repair UV damage → damage does heal
  • NER is major mechanism to repair thymidine dimers in humans
  • Bulge in DNA recognised and cleaved on both sides of T-T dimer by 3’ and 5’ endonucleases
  • Helicase unwinds DNA resulting in excision of oligonucleotide containing damaged bases
  • Resulting gap filled by DNA polymerase from 5’ –> 3’ → sealed by ligase
    o In E. coli DNA Pol 1; in humans DNA Pol ß
  • In E. coli:
    o Catalysed by 3 gene products (uvrA, B, C)
    o Mutations of genes leads to high sensitivity to UV
    o UvrA recognises damaged DNA
    o UvrB/UvrC endonucleases cleave at 3’ and 5’ sides; helicase activity excises oligonucleotide
  • In eukaryotes:
    o Catalysed by RAD (radiation damage) gene products in yeast
    o Genes identified in humans with Xeroderma Pigmentosum (rare genetic disease affecting ¼ mill people)
    o 7 different repair genes involved → v. highly conserved → important in maintaining genomic integrity because of all the time exposed to UV
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mismatch repair in E. coli

A
  • Mismatch repair system detects and excises mismatched bases in newly replicated DNA
  • Must distinguish parental strand from newly synthesised daughter strand
  • DNA in E. coli is methylated by Dam methylase
  • Following replication new daughter strand will not be methylated at dam sites → DNA is hemi-methylated
  • Methylases also protect bacteria from own restriction enzymes
    o Foreign DNA from phage viruses infect bacteria → not methylated → cut up by restriction endonucleases → methylated DNA not cut
  • Once cell knows there’s old strand and new strand because of hemi-methylated DNA → mutHLS repair system removes lesion on new strand
    o MutS recognizes mismatch
    o MutL inly binds MutS at mismatches (ATP hydrolysis, forms DNA loops, translocates along DNA looking for hemi-methylated dam site) → either remains bound to mismatches or migrates away translocating in both directions
    o MutH endonuclease activated when bound to MutL → cleaves unmodified strand opposite site of hemi-methylation (GATC) → can discriminate newly synthesised DNA → discrimination does not require complex to be bound at mismatch site (presence of complex enough to signal mismatch is present)
    o Random process → mistake can happen anywhere
    o Depending on orientation nick can be upstream/downstream of mismatch
    o Different exonucleases required depending on polarity (have different 5’/3’ ends)
    o In one direction use exonuclease 7 (recJ); other direction use exonuclease 1 → end result is the same regardless of orientation but exonuclease digests single-stranded DNA until past lesion sites so lesion removed in new DNA → old strand still contains correct DNA base → can use DNA Pol and ligases to fill in gap and seal final nick by ligating P to 3’-OH
    o Extra components (UvrD helicase) to dissociate strands
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Mismatch repair in mammals

A
  • Recognition mechanism based on fact that DNA in eukaryotes has lots of strand breaks → DNA replication is semi-discontinuous
    o Cell differentiates between old DNA (no gaps) and new DNA (gaps)
  • Eukaryotic DNA contains many replicons that occur in different strands → end up w/Okazaki fragments on both strands
  • Cell can consistently detect mistakes in new DNA
  • Analogous to E. coli mismatch repair
    o Instead of MutHLS system → MHS complex
  • Method:
    o Lesion in DNA caused by newly-replicated DNA
    o Enzyme complex recognises old and new strand and binds in particular orientation
    o Helicase and exonuclease nicking either side of the lesion
    o Exonuclease digests oligonucleotide stretch
    o DNA polymerase fills gap and DNA ligase seals gap
  • In humans, mutations in hMsh2 and hMsh1 genes are cause of inherited non-polyposis colorectal cancer:
    o Affects 1:200
    o Causes ~15% of UK colorectal cancers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Double strand break

A
  • DNA broken to leave 2 free open stranded ends
  • Can occur due to:
    o DNA damage
    o DNA lesions prevent processivity of DNA Polymerase during DNA synthesis
    o DNA nicks by endonuclease damage
    o Ionising radiation (e.g. X-ray damage)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Non-homologous end joining (NHEJ)

A
  • Simplest mechanism; found in eukaryotes
  • Take broken ends ligated by enzymes (5’P joined to 3’OH)
  • Error prone process → often introduces mutations into DNA
    o Complicated because sometimes can have unwinding/overlapping at broken ends → deletion or addition base pairs
    o If you have microhomology, can predict indels within DNA as consequence
  • Exploited by modern genome editing tech (e.g. CRISPR/Cas9) for targeted mutations/knockouts
    o When guiding CRISPR/Cas9 to make ds break in genome, depending on chosen sequence at the ends, can increase/decrease favourability of having NHEJ or introduction if indels
    o Useful because if pick sequence likely to have microhomology and lead to deletion → v. efficient way of knocking out target gene on genome by changing reading frame, introducing stop codons, etc
  • Associated with recombination of variable regions in antibodies within human immune system
    o Generates more diversity within antibody regions
  • Only found in certain bacteria → homologous to human machinery
    o Kill microplasma bacterium by cutting genome because cannot repair properly by NHEJ
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Homologous recombination

A
  • General mechanism for repair where intramolecular template information has been lost
    o Uses information from paired DNA to repair broken strands → stimulated by ds breaks
    o Also repairs nicked DNA → not always fully ds breaks
  • Relative error-free process
  • Requires homologous DNA as template
    o In diploid organisms: pairs of chromosomes → every gene has 2 alleles → natural source of homologous DNA
    o In haploid organisms: homologous DNA template readily found at replication forks
  • When making ds breaks in biotech can provide own homologous template → can have mutations as long as ends are 100% homologous
    o Exploited by modern genome editing technology (e.g. CRISPR/Cas9) to make site-targeted gene repair, integration or modification
  • Basic process:
    1. Break two homologous DNA pairs
    2. Pair strands by Watson-Crick base pairing
    3. Reform phosphodiester bonds so new strands are crossed over and joined together
    4. Break the strands and reform again
    5. Any gaps filled in by DNA polymerase and ligase
  • Intermolecular HR:
    o Double crossover events common in meiosis
    o Generate diversity in eggs/sperms for gene swapping over
    o Source of variation for natural selection and evolution in longer term
  • Intramolecular HR:
    o Small stretches of DNA recombine between themselves within same stretch of DNA
    o Direct repeats → same orientation → cuts out DNA from longer stretch and join what was on either side of that region
    o Inverted repeats → pointing towards each other → after recombination, flip orientation of DNA stretch within a longer region
    o Evolution: can change orientation of regulatory promoter regions, enhancers, genes, etc thus change overall gene expression pattern of a cell
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

E. coli RecBCD recombination pathway

A
  • Provides model for initial steps of homologous recombination
  • Orientation of DNA 3’-OH and 5’-P ends → geometrical constraint
  • Exploited by bacterial genome editing technology with oligonucleotides → “recombineering”
  1. Ds break in DNA → damage causes initiation of HR
  2. RecBCD uwinds DNA and degrades one strand exposing 3’ strand
  3. 3’ strand bound by RecA to form a filament → stabilise it and allow strand invasion
  4. Ss RecA filament invades homologous strand and pairs up
  5. RecA comes off → start of crossover
  6. Have a d-loop in homologous strand (bulge of unpaired ssDNA)
  7. To complete formation of crossover → break second DNA → nicking
  8. End up w/ fully crossed over DNA → strand exchange of homologue
  9. Gaps filled by DNA polymerase
  10. Once reaches 5’-end → ligation to repair phosphodiester backbone
  11. DNA strands connected by covalent linkage through the molecule and crossed over in the middle (holliday junction)
  12. Resolution: get rid of crossover → break DNA and ligate again → final resolved products
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Homology search and strand invasion

A
  • Precise details of homology search not known
  • RecA filament formation is essential for strand invasion
  • Helical nature of RecA filament can form triplex structure w/homologus DNA duplex
    o Start w/ ssDNA coated by RecA protein → 3 strands of DNA come together during invasion: invading ss and 2 strands unwound to form d-loop
    o One original DNA strand displaced by invading strand
  • ATP required to drive process over longer lengths of DNA
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Holiday junction and recombination

A
  • Found by Robin Holliday in 1964
  • Following strand invasion, invaded duplex must be nicked using specific nucleases
  • Gaps on strand must be resealed using ligase
  • Holliday junction forms:
    o Drives swap of genetic information by moving randomly using RuvA+RuvB
    o Carries part of one chromosome/DNA template along w/them
    o Depending on how much it moves, determines amount of genetic information swapped
  • RuvA forms flat structure which protects crossover and stabilises it
    o Contains hydrophobic ‘pin’ in the middle helps separate strands → machinery slides along and swaps DNA
  • 2x RuvB motors bind either side of RuvA and use ATP to translocate DNA
  • KEY: Homologous strands are similar but do not have to be identical
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Resolution

A
  • After branch migration, resolve crossover to break Holliday junction and repair DNA to restore 2 original DNA strands
  • Resolution occurs in 2 different geometries
    o Cut horizontally (1,2) → patch
    o Vertically (3,4) → splice (have hybridised DNA between 2 strands)
    o Different outcomes in downstream products of recombination
  • After replication → segregation of different half-strands dividing → compDNA synthesised on either side of them
    o Leads to variations in inheritance of potential gene sequence in daughter cells
    o Can get different kinds of swapping over/variation (shown by diagram)
21
Q

What determines orientation of swapping over?

A
  • Position of RuvC (nuclease) binding and cleavage delineates outcome
  • Not known how this is coordinated between patch and splice variations
22
Q

Processes involving Homologous Recombination

A
  • Meiosis → gene shuffling → variation in offspring from sexual reproduction
    o Scrambles genes of maternal/paternal chromosomes leading to non-parental combinations
  • Forms physical links between homologous chromosomes to allow chromosome alignment during meiotic prophase
  • Evolution (viral, bacterial, etc) → horizontal gene transfer
  • Important in DNA repair
    o Caveat: always need to have clean/unbroken DNA to repair broken DNA
  • Exploited in biotechnology: genome editing with CRISPR/Cas (HDR: homology-directed repair)
23
Q

Meiotic recombination in yeast

A
  • Same pathway as single-cross over model from E. coli
    o But get 2 strand invasions
    o Also get filament stabilising protein (similar to RecA)
  • Double cross-over model → 2 Holliday junctions
    o 3’-overhangs invade d-loops in homologous DNA twice
  • Get branch migration of both cross-over events
    o After resolution have patchwork of recombinants
24
Q

Random outcome for swapped DNA

A
  • Potential mechanisms coming in ie. mismatch repair
    o Drive different heritable genetic traits to daughter cells
  • E.g. recombination between chromosomes w/2 different alleles
    o Get hybrids with mixture of alleles
    o Could be SNPs → now mismatched → have bulges in DNA
    o If just after synthesis, mismatch repair machinery could restore new strands
  • If not repaired, changes persist → following recombination will be inherited by one daughter cell
25
Q

Site-specific recombination: recombinases (viral and other non-H rec)

A
  • Require shorter homologous DNA than HR
  • Require specific enzymes evolved to recognize short sequences
  • Part of viral life cycles
  • Inverted repeats or tandem repeats leads to inverting genes or chopping out DNA within sequence
    o Important for integration of viruses or removing virus from DNA
26
Q

Integrases (viral and other non-H rec)

A
  • Integrases and recombinases → homologous
  • Important class of transposition enzymes
  • Lambda phage integrase (Int) first to be characterized
    o From virus that infects E. coli
    o Site specific at att sites → attP (phage site) and attB (bacteria site)
    o Int brings together att sites, cuts DNA, crosses over, ligate
  • Different to RecBCD pathway:
    o Happens in small enzyme, not series of different proteins coming together
    o No ATP for energy
    o Enzyme evolved to store high-energy intermediates → Tyr in active site gets phosphorylated and are temporarily bound to backbone of DNA (highly reactive)→ then hydrolyse P to recover Tyr and drive final recombination reaction
    o Fewer components
  • IHF (integration host factor –bacterial protein) and nuclease Xis get hijacked by integrase
  • Do not integrate if in wrong orientation (diagram shows right orientation
27
Q

Mechanism of integrase-class recombinases (viral and other non-H rec)

A
  • Tetramer → symmetry in system
  • DNA ends (5’ & 3’) marked for direction
  • Flips over DNA to create crossover-type systems
  • To complete system have re-joined and created 2 dsDNA that incorporate swapped over DNA
28
Q

Transposable genetic elements (viral and other non-H rec)

A
  • Transposons = mobile genetic elements move randomly along genomes to change composition
  • Do not require sequence homology
  • Simple transposons contain only genes required for their transposition
  • ‘Selfish DNA’
  • Complex transposons contain other genetic information e.g. antibiotic resistance
29
Q

Bacterial genomes

A
  • E. coli
    o v. well organised into operons so all genes for one function are lined up together
    o Practically no gaps
    o Chaperone genes: dnaK, dnaJ
    o Metabolite genes: trNA synthetase
    o Operons are co-regulated  regulated separately from each other
    o Highly evolved because short life cycle (~20 mins)  goes through more generations/ cycles of optimisation and natural selection
30
Q

Human genome

A
  • Human haploid genome 109 bases (E. coli 106)
  • ~20,000 human genes (estimates keep falling; before sequence thought 100,000)
  • C. elegans worm also has ~20,000
  • Many functionally different proteins from one single human gene by making alternative splice variants w/different exons being expressed
  • Small genes hard to locate
    o Defining what is a gene: small regulatory microRNAs important for biology  not coding for proteins?
  • Rarely expressed genes hard to detect
  • Gene density is surprisingly low
  • Non-coding repeats make ~50% of human genome

Physical structure of genome
* Organised into chromosomes
* Studied using metaphase spreads:
o Use stains to visualise DNA; different intensities depending on properties e.g coiling
o Dark bands AT rich
o Light bands GC rich
* Usually can only repeat smaller chromosomes; big ones are toxic

31
Q

Chromosomes numbers vary

A
  • No correlation between complexity and genome size
  • Yeast: haploid S. cerevisae has 12Mbp, 16 linear chromosomes
  • Salamander species: genome 10x bigger than human, only 14 chromosomes
  • Amoeba: 100,000Mbp (polyploid → many copies of homologous chromosomes)
  • Bigger cell = more DNA
  • Prokaryotes → ~3-12Mbp
32
Q

Other DNA structures

A
  • B chromosomes:
    o Extra small chromosomes
    o Originate from autosomes/sex chromosomes in intra-/inter-species crosses
  • Holocentric chromosomes:
    o Entire chromosomes act as centromeres
    o E.g C. elegans
  • Extrachromosomal DNA:
    o Plasmids → circular
    o Organelle DNA (e.g from mitochondria, chloroplast) → idea of endosymbiosis
33
Q

Organelle genomes

A
  • Mitochondrial DNA (all eukaryotes)
    o Covalently closed circular DNA (rarely linear e.g. Chlamydomonas)
    o Only 37 genes encoding for 13 protein open reading frames → rest of machinery made in nucleus
    o ~16 kbp
  • Chloroplast genome
    o ~120-170 kbp
    o Single closed circular DNA (rare exceptions)
    o Codes ~100 proteins → typically for photosynthesis
34
Q

Gene distribution in eukaryotic chromosomes

A
  • Disorganised and uneven
  • Centromeres → gene deserts → lower density
  • Telomeres (ends) → have own repeats (tend to not have genes)
  • Multi-gene families → same gene appearing repeatedly in genome
  • Gene superfamilies → v. big gene families e.g. Zinc fingers (+400 domains) → same gene appears in many places
  • Layout → mixture of elements of randomness & natural selection advantage
35
Q

General trends in gene distribution

A
  • Overall organisation differs between eukaryotes → reflect evolutionary histories of different organisms
  • Gene density possibly lower in more complex eukaryotes
    o Arguments of simple vs complex genomes
    o Sparser gene densities give you more chance for recombination and shuffling events → more repeat densities
36
Q

Human genome

A
  • Gene-rich parts
    o 700kb class III region of MHC
    o Chromosome 6; 60 genes; 1 pseudogene; v.high GC content 54% (41% whole genome)
  • Gene-deserts
    o Defined as 1Mb with no genes
    o 82 deserts identified (3% of genome, 144Mb)
    o 25% genome has patches of 500,000kb w/no genes
    o Largest is 5.1Mb
  • Are they really deserts?
    o Could contain regulatory regions acting as enhancers for other genes
    o Eukaryotes v. often have distal enhancer 50Mb away from particular promoter transcription start site
    o Regions could have many functions we don’t know about
    o Could contain v. big genes in them e.g. muscular dystrophy spans +2Mb → classic cloning methods do not pick that up → not easy to manipulate
    o Identify big genes by using reverse transcriptase of RNA samples of cells (but don’t get full sample of cDNA)
    o Not always trivial to identify exons/introns computationally
  • 62% of human genome = intergenic regions
    o Regions in between genes
    o 2 types: Unique vs repeated
    o Genome-wide repeats ~1400Mb of human genome → DNA transposons, LINEs, SINEs, LTR

Human genome repeats
* Classified by length, tandem (one after each other) or interspersed (random)
* Non-coding repeats make ~50% of human genome

37
Q

Repeated intergenic regions

A
  • Differ in GC content from rest of genome
  • Can be classified into different satellite sequences
  • Can be v. variable; typically, less than 10%
  • Some play structural role (centromeres and telomeres –also protect ends of chromosomes)
  • 5 detailed classes:
    o Transposons derived repeats (45% of genome)
    o Inactive gene copies (processed pseudogenes)
    o Simple sequence repeats (2-5 nucleotides)
    o Segmental duplications (5% of genome) –inter & intra chromosomal
    o Repeated structural sequences (centromeres, telomeres, etc)
38
Q

Classic discovery of repeat regions

A
  • Run CsCl desnity gradients of purified DNA from cells → in centrifuge tube
  • DNA migrating according to size and competition
  • Visualise with staining
  • Main genomic band + small satellite bands
39
Q

Satellite DNA

A

Often found in heterochromatin in centromeres
α-satellite family
171bp repeat unit
β-satellite family
68bp repeat unit; more interspersed; found in pseudogenes
Variable number tandem repeats (VNTRs)
Vary from human to human → useful in DNA fingerprinting
Minisatellites = 10-100bp; form clusters up to 20kb; associated w/structural features (e.g. centromere)
Microsatellites = less than 13bp; ‘simple tandem repeats’; telomeres (e.g. 5’-TTAGGG-3’)
Dinucleotide repeats = v. common forms 140,000 versions in genome of ‘CA’ repeat; 120,000 copies of ‘AA’

40
Q

Modes of satellite formation

A
  • Microsatellites → DNA polymerase slippage
    o Ends of short repeat units are fraying giving single stranded intermediates → then loop out → daughter strand slips back 1 repeating unit → each slippage event add 1 unit → sometimes deletions; mostly insertions
  • Tandem repeats
    o DNA recombination → ‘unequal crossing over’ → not perfectly lined up
41
Q

Tandem repeats show divergence

A
  • V. variable between individuals and generations within an individual
  • E.g. CAG coding repeat in Huntington gene → variations in polyCAG repeats → too many Glu → protein aggregating in brain → neurodegenerative repeat
42
Q

Interspersed repeats

A
  • 4 classes:
    o LINES – long interspersed nuclear elements
    o SINES – short interspersed nuclear elements
    o LTR – long terminal repeat (retrotransposons) → RNA intermediate; similar to viruses
    o DNA transposons → encode for transposases (recombination machines)
43
Q

DNA transposons in maize

A
  • First discovered in maize
  • Barbara McClintock
  • Black/yellow colours → DNA transposons moving around genome
  • Individual kernels pigmentation → transposons move in/out of pigment gene
  • White regions caused by genomes w/transposable elements inside pigment gene → activated → recombines outward looking repeats → excises transposable element → pigment gene restored → pigment colour expressed
  • Small spots: frequent excision late in kernel development
  • Large spot: excision early in kernel development
  • Full white: no excision, autonomous element in genome
  • Full black: element excised, expression restored
44
Q

DNA transposons

A
  • Different to CRE and lambda recombinases → don’t have sequence requirements
    o Target DNA randomly → generate compatible overhangs
    o Sticky end cut and ligated by transposase
    o Transposase requires machinery to repair damage
    o Flanked by direct repeats → can differ from location to location → defined by host genome → drive excision reaction in backwards process
    o “Cut and paste” process at DNA level w/o RNA intermediate
  • Mutagenic → when entering DNA, can disrupt it; source of genetic variation in mutation
  • E.g. mariner transposon
    o 14,000 copies in human genome (2.6 million bp)
    o 14% of insects have it
    o Transposition is old (~50 million years)
45
Q

LINES

A
  • Act via RNA intermediate
  • 5’ and 3’ untranslated regions
  • PolyA region
  • 2 open reading frames
    o 1 codes RNA binding protein; 2 codes for reverse transcriptase (converts own RNA-DNA)
  • Process:
    o Retrotransposon → converted to RNA → exported to cytoplasm → retrotranscribed to DNA → binds RNA binding protein → back to nucleus → jumps in somewhere else in genome (interspersed)
46
Q

SINES

A
  • V. similar mechanism → has polyA integration site
  • Does not have own reverse transcriptase; borrows it from LINES (non-autonomous)
  • Much smaller than lines
  • V. high copy number
  • 14% of human genome
  • Tend to not have own genes
  • Common family = Alu family
    o 1.2 million copies in human genome
    o Generate variation as they jump around
    o Generate repeats → substrate for homologous recombination
47
Q

LTR transposons

A
  • Bigger
  • More complex → look like retrovirus but haven’t propagated
48
Q

HERV (human endogenous retroviruses)

A
  • Look like full retrovirus but inactive
  • Careful: could get fully/partially activated and generate new virus
49
Q

Why so many repeats?

A
  • Bigger genome → metabolic burden → more energy/material to replicate genome
  • Rate of propagation counteracts rate of elimination
  • Continually discovering new roles for non-coding DNA
  • They mechanistically drive gene regulation and evolutionary change
  • ENCODE → nearly all genome is transcribed → maybe have functions yet to be discovered