Genome Structure Flashcards
Describe the basic structure of DNA
• DNA is deoxyribonucleic acid
• It is a macromolecule consisting of a linear strand of nucleotides
• Single linear strands bind to complementary strands to form double-stranded DNA
Two antiparallel strands of DNA
Bases “stacked”
Two grooves
• Major
• Minor
Describe the charge of a DNA molecule
Negatively charged because of the phosphates
What type of molecule is a single strand of DNA
linear macromolecule
How many base pairs is the human genome and how many genes are there in a human?
- Human genome is 3 x 109 base pairs – 3Gbp
* It contains ~20 000 genes.
What is the main issue with DNA?
DNA is very lengthy:
There is around 2m of DNA in a nucleated cell
• 37.2 trillion cells in your body
• That is 7.44x1013 metres of DNA
So how do we fit DNA into nucleated cells?
The solution is Histones
Describe the charge of histones
How many histones form a nucleosome?
What binds the linker DNA
Basic positively charged that binds to negative DNA
Eight histones 2x(H2A+H2B+H3+H4) form the nucleosome
Histone 1
Have a look at the image of X-ray crystallography of DNA around histones
On image
What are the phases of DNA packing?
- DNA double helix
- Nucleosomes
- Chromatin fibre
- Extended section of chromosome
- Loops of chromatin fibre
- Metaphase chromosome
Descirbe the structure of a chromosome
On image
What is a human karotype and what does it show?
- Stained chromosomes, nucleus in metaphase
* Banding patterns
Define genome (3)
- The primary DNA sequence encodes all the gene products necessary for a human
- The primary DNA sequence also includes a large number of regulatory signals
- Much of the DNA sequence does not have an assigned function as yet
Define exon (there are two)
• The exome is made up of gene sequences
Coding regions of DNA
• Some definitions use all of the coding sequences (~37 Mbp – 1.2% of genome)
• Some definitions use all of the gene sequences (~60Mbp – 2% of genome)
Define gene and describe the structure of a gene
- All of the DNA that is transcribed into RNA plus all of the cis-linked (local) control regions that are required to ensure quantitatively appropriate tissue-specific expression of the final protein
- It is NOT just the bits that encode the final protein, regulation of the gene is very important.
On image
Where are the size of the genes globin and dystrophin ?
globin = 1.8kb, dystrophin = 2.4Mb
What are intergenic regions?
Intergenic regions contain sequences of no known function, such as repetitive DNA, endogenous retroviruses, pseudogenes. They may contain many regulatory elements.
How do genes tend to exist and what does this allow?
Genes often cluster in families – e.g. globin clusters
- allows for co-ordinated gene regulation
- may just reflect evolutionary history
What is an intron?
How many are there and what is the size of them?
Non-coding regions of DNA
- Vary in number – from 0 to at least 311
- Vary in size - 30bp to 1Mbp
Describe the structure of a gene
On image
What are the two major regions that the promoter region contains and what are there functions?
- Regulatory region - needed to regulate the recruitment of RNA polymerase 2
- TATA box - needed to recruit general transcription factors and RNA polymerase
3 general functions of the promoter region
- Promoters recruit RNA polymerase to a DNA template
- RNA polymerase binds asymmetrically and can only move 5’ to 3’
- Regulation occurs via transcription factors
What are the purpose of enhancers?
• Enhancers upregulate gene expression – they are short sequences that can be in the gene or many kilobases distant. They are targets for transcription factors (activators).
What are the purpose of silencers?
• Silencers downregulate gene expression. They are also position-independent and are also targets for transcription factors (repressors).
What are insulators?
• Insulators are short sequences that act to prevent enhancers/silencers influencing other genes