Human Genome Flashcards

(31 cards)

1
Q

How much of the genome encodes for proteins ?

A

5%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are Isochores ?

A

Large DNA segments (>300 kb) which are characterised by an internal variation in GC content

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Describe CpG Islands

A
  • Cytosine base followed by a guanine base is rare In vetebrate DNA
  • Cytosines following guanines tend to be methylated - the methylation state of these CpG islands can regulates the expression of the genes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Describe LINE retroansposon

A
  • repetitive element in genome
  • retro = going through an RNA intermediate
  • LINE = Long Interspersed Elements
  • complete sequence is 6000-8000 bp long
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe a SINE retrotransposon

A
  • SINE = Short Interspersed Elements
  • its a parasite’s parasite –> depends on LINE for its propagation
  • Alu elements are most abundant in humans - 300bp long
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How much of the genome are interspersed repeats ?

A

46%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the 4 repetitive elements found in the human genome ?

A
  1. LINE retrotransposon
  2. SINE retrotransposon
  3. Retroviruses
  4. DNA transposon
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are Alu Elements ?

A
  • only found in primates
  • can be sorted into distinct families according to shared patterns of variation
  • only one or several Alu “master copies” are capable of transposing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are flanking regions ?

A
  • consist of ‘unordered’ DNA
  • occur on each side of the repeat unit
  • critical because they allow for the development of locus-specific primers to amplify the microsatellites with PCR
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why are repetitive elements bad?

A
  • repetitive elements waste energy
  • insertion of of these repeats can be harmful
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why do repetitive elements still exist ?

A
  • generation & deletion of repeats have reached an equilibrium
  • mammalian genomes can tolerate them as they’ve developed mechanisms to control them - eg histone modification
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Describe a duplicated pseudogene

A
  • created from tandem duplication or unequal-crossover
  • segment duplication is prevalent
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a multigene family ?

A

groups of genes from the same organism that encode proteins with a similar sequence either over their full length or limited to a specific domain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What results in a multigene family ?

A
  • DNA duplications that involve 1 or more genes generate gene pairs
  • if both copies are maintained in subsequent generations then a multigene family will exist in the genome
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are histone proteins?

A
  • globular in shape
  • 5 members in the family = H1, H2A. H2B, H3 and H4
  • the family members are closely related but not identical in amino acid sequences
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Describe the structure of histone proteins

A
  • octamer 2x H2A, H2B, H3 & H4
  • H1 holds the structure together
  • the globular protein contain tails which are rich in amino acids which have positively charged R-groups
16
Q

Describe epigenetics

A
  • phenotype changes in a cell/organism
  • not the result of nucleotide changes
  • brought about by chemical changes to DNA (CpG) methylation and histones
17
Q

What is the first distinct mechanism caused by histone modifications and how does it affect chromosome function ?

A

modifications may alter the electrostatic charge of the histone resulting in a structural change in histones or their binding to DNA

18
Q

What is the second distinct mechanism caused by histone modifications and how does it affect chromosome function ?

A

modifications are binding sites for protein recognition modules, such as chrodomains that recognise acetylated lysines or methylated lysine

19
Q

What are 2 problems in genome sequencing ?

A
  1. genomes are incredibly large
  2. you need a lot of DNA to get a sequence
20
Q

What are the solutions to the problems of genome sequencing ?

A
  1. cut the DNA into smaller pieces then put it back together
  2. make lots of copies of each bit
21
Q

Describe PCR

A
  • Polymerase Chain Reaction
  • enables large amounts of DNA to be produced from very small/very complex samples
  • uses 2 primers & DNA polymerase
22
Q

What does PCR allow?

A
  • selecting the region of study
  • producing millions of copies of it
23
Q

Describe Sanger sequencing

A
  • similar to PCR
  • uses only a single primer and polymerase to make new single stranded DNA pieces
24
What is used to reduce the size of DNA in order to sequence it ?
- restriction enzymes - they recognise and cut specific sequences - there are hundreds of restriction enzymes are available
25
What is BAC ?
- Bacterial Artificial Chromosomes - after the use of restriction enzymes there is a soup of DNA fragments - we use the BAC to amplify and identify each fragment
26
What are the 2 purposes of the BAC libraries?
1. separates fragments so they can be sequenced individually 2. allows the production of lots of DNA by growing up lots of bacteria containing identical BACs or plasmids
27
Why is BAC library not suitable for genome sequencing on a large scale ?
- 20,000 different BAC clones are needed to contain the 3 billion pairs of bases in the genome - each inserts 150,000-200,000 base pairs - minimum of 1.5 million sub-clones needed
28
what are the simplified steps of DNA sample preparation ?
1. Extract DNA 2. Randomly Shatter (sonifcation) 3. Attach adapter sequence
29
Describe Illumina Next-Generation Sequencing
- most popular method - produces millions of reads - reads can be paired-end - very good for alignment
30
Describe DNA Barcoding
- barcoded = tagged with a short known sequence - allows multiple samples to be put on the same run - samples can be computationally isolated