Organisation of the Human Genome Flashcards
(30 cards)
What is the human genome made up of?
3 billion base pairs.
23 pairs of linear chromosomes.
And mitochondrial genome = 16,569 base pairs, circular DNA
What is a way to analyse a genome
DNA Melt-Reassociation study.
Denature DNA = single-stranded DNA fragments.
Rapid = highly repeated
Intermediate = moderately repeated
Slow = unique
= reannealed double stranded DNA fragments
What are the different elements (+%) of the genome?
Repetitive DNA including transposable elements = 44%
Introns / regulatory sequences = 24%
Unique noncoding DNA = 15%
Repetitive DNA unrelated to transposable elements = 15%
Exons (protein coding) = 1.5%
How is the eukaryotic DNA sequence organised?
Single copies.
Gene families.
Tandem gene arrays.
Intermediate repeats (transposable elements).
Simple sequence repetitive DNA.
What does single copy DNA do?
= Protein coding genes.
=25% of genome (exons only 1%)
= average gene is 27kb with 9 exons
What is the largest gene in the human genome?
DMD (dystrophin) gene
over 2 million base pairs on X chromosome
main mRNA = 14 kb containing 79 exons
What is one of the smallest genes in the human genome?
IFNA6 (Interferon alpha 6) gene
around 1.5kb on chromosome 9
is a single exon mRNA (no introns)
What does non-protein coding single copy DNA do?
24% of genome = intron
15% of genome = single copy NOT part of protein-coding gene
function = most is transcribed to RNA
What is the function of non-coding DNA?
Most of the genome can be transcribed.
22,219 non-coding genes.
What are examples of structural RNAs?
rRNAs, tRNAs, snRNAs
What are microRNAs (miRNAs) involved in?
Gene regulation.
Roles in development.
Can cause cancer when dysregulated.
What do long non-coding (lnc) RNAs do?
Some are known to be functional.
e.g. Xist:
Can target regulatory proteins.
Can be disease markers - DD3/PCA3 prostate cancer
Possible causative agents in disease - BACE1
What are some examples of human gene families? (+number of genes)?
alpha-globulins = 4
beta-globulins = 5
actin = 15
keratin type 1 = 19
beta-tubulin = 19
alpha-tubulin = 10
What is a pseudogene?
A non functional, inactive copy of a gene
What are the 2 classes of eukaryotic transposable elements?
Retrotransposons. (retroposons).
DNA-DNA transposable elements.
What is the role of eukaryotic transposable elements in genome evolution?
They are a source of regulatory elements, sites of recombination.
Insertion can cause disease.
What do retrotransposons do?
Transpose via an RNA intermediate.
Can be viral:
retrovirus like = endogenous retrovirus
LINE-like = LINE1 + LINE2
Can be non-viral:
SINEs
Processed pseudogenes
What do DNA-DNA transposable elements do?
Transpose directly from DNA to DNA.
Similar to bacterial transposons.
(NOT active in human genome).
What is the transposable element composition of the human genome? (30%)
Non - mobile element (ME) sequence = 33%
ME + repeat remnants = 21%
LINE-1 = 17.6%
LINE-2 = 3.5%
Alu = 10.7%
SVA = 0.1%
(H)ERV = 8.9%
DNA = 3.4%
Other MEs = 0.7%
Non-ME repeats = 2%
What is the retrovirus / retrotransposon life cycle?
What is the structure of viral retrotransposons?
= 250-260 bp
What is the structure of the LINE-1 element (L1)?
Over 500,000 copies in human genome.
1-6 kb in length.
Only 40-50 are active.
Has 2 open reading frames:
ORF1 = 1137 bp - homology to gag
ORF2 = 3900 bp - homology to pol
No LTRs
What is the timing and tissue specificity of L1 transposition
Mostly repressed (methylation).
Demethylation and increased transposition in tumours.
Germ cells (many unique new insertions).
Early embryos (somatic cells).
Neural progenitor cells during childhood.
Each human is unique - can be neuronal differences.
What are non-viral elements?
A class of retrotransposons.
SINEs (13% of genome)
= genomic copies of small RNAs
= most belong to Alu family (7SL RNA)
= also copies of snRNAs and tRNAs
Processed pseudogenes
= genomic copies of mRNAs