Genetics Flashcards

(288 cards)

1
Q

What is the difference between heredity and variation?

A

Heredity is where offspring are similar to parents, variation is where offspring are different from their parents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a geno/phenotype?

A

Genetics relies on a link between them

Phenotype: discernible properties of an individual

Genotype: genetic information influencing the properties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the three types of genetics?

A

Molecular/developmental genetics - transmitted from DNA to affect cell function and phenotypes

Transmission genetics: from parent to offspring

Population/evolutionary genetics: transmitted over many generations within large populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does the genetic material differ between eukaryotic and prokaryotic cells?

A

Eu: membrane bound nucleus with linear chromosomes composed of chromatin (DNA + protein) - multi/unicellular

Pro: no nucleus, genes usually on single, circular chromosome - unicellular

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is chromatin?

A

DNA + histones = chromatin

Histones are highly conserved proteins - DNA wraps around to form compact structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the structure of a linear chromosome?

A

Telomeres: end which protects them from degradation

Centromere: specialised region that acts as a site for kinetochores to attach

p arm: ‘petite’ - shorter arm

q arm: longer arm

Genes are found at locus (loci)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What do modifications to chromatin structures do?

A

EPIGENETICS

Alter gene activity without changing DNA sequence

Euchromatin = loosely packed chromatin - readily transcribed

Heterochromatin = condensed - no expression of genes

Controlled by addition of methyl and acetyl groups to histones

Heritable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the difference between haploid and dipoid?

A

Diploid: 2n, 2 homologous chromosomes with the same genes but different alleles - e.g. human nucleus

Haploid: n, 1 copy of each homologous chromosome - gametes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Are all eukaryotic cells diploid?

A

No, not all!

e.g. Saccharomyces cerevisiae spends a large part of its life in haploid state

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are genes?

A

Unit of hereditary information that occupies a fixed position (locus) on a chromosome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a typical eukaryotic protein-coding gene?

A

Upstream (5’) flanking region:
Enhancer region
Promotor region

Transcriptional unit (5’ non-coding sequence, introns, exons, 3’ non coding sequence)

Downstream (3’ flanking region)
Enhancer region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are alleles?

A

Different versions of a gene

Alleles can differ by one-hundreds of nucleotides

AA/aa - homozygous - homozygote organisms

Aa - heterozygotes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is cytogenetics?

A

The study of chromosomes

We can use G-banding - stain chromosomes with Giemsa

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a karyotype?

A

The chromosome complement (set) of an individual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the typical human karyotype?

A

Autosomes - 1-22
Sex chromosomes - X,Y

XX - female
XY - male

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is cytoplasmic inheritance?

A

There is some DNA in mitochondria and chloroplasts (originate from ancient endosymbiotic events)

Generally means maternal inheritance as the egg has a large volume of cytoplasm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a clone?

A

Population of genetically identical single-celled organisms derived from a single ancestral cell

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What comes from a zygote?

A

Multicellular organism made of mostly genetically identical cells derived from a single fertilised egg (zygote)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Do all the cells in a multicellular organism contain the same genes?

A

Mostly

Some exceptions e.g. immune cells

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is binary fission in bacteria?

A

1) DNA duplicates - cell gets bigger
2) Replicated chromosomes move apart
3) A protein called FtsZ marks the middle
4) New cell wall
5) Two daughter cells

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the structure of the cell cycle?

A

Interphase: main part
- G1 - cell size increases, ribosomes/RNA produced, preparation for DNA replication
- S - DNA synthesised
- G2 - cell checks fidelity of DNA, prepares for nuclear division

Mitosis: cell division
- prophase, prometaphase, metaphase, anaphase, telophase
- cytokinesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is G0?

A

Cells are inactive/ not dividing

‘quiescent’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What happens to the amount of DNA in mitosis?

A

Double stranded - semiconservative replication in S phase - 2 sister chromatids - chromatids segregate - same number as beginning

Each chromosome exists in one or two copies at different stages of the cell cycle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is interphase?

A

G1 + S + G2 - most of the time of the cell cycle

Start of mitosis - DNA amount is twice what it usually is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is prophase?
Chromosomes condense - now visible Each chromosome comprises 2 chromatids and a single centrosome Sister chromatids are help by a protein called cohesin Later in prophase - nuclear membrane breaks down and chromosomes start to move towards equator - prometaphase
26
What is the mitotic spindle?
Cytoskeletal structure made of microtubules - polymers of small tubulin protein subunits Job - separate sister chromatids This happens due to microtubules getting shorter
27
What are centromeres and kinetochores?
Centromeres: specialised chromosome regions which direct equal segregation of chromosomes - can be found at any point Defined by specific epigenetic markers Sister chromatids are joined by a centromere Centromeres are connected to microtubules by a large protein complex - kinetochore
28
What is metaphase?
Centromeres align at spindle equator - midway of poles Microtubules attaching to each pole - tension between these keeps chromosomes in centre
29
What is anaphase?
Sister chromatid cohesin breaks down Chromatids - now separate chromosomes Centromeres split and move to opposite poles Depending on position of centromere - chromosomes can form V shape as dragged
30
What is telophase?
Chromosomes arrive at cell poles Chromosomes decondense - no longer recognisable threads Daughter nuclei reform
31
What is cytokinesis?
Division of cell Cleavage furrow (animals) or cell plate (plant) forms between poles Gives two daughter cells - can be symmetrical or two uneven shaped (e.g. in budding yeast)
32
Is mitosis the same in haploid and diploid species?
They look very similar In diploid - homologous chromosomes act separately
33
What could go wrong in mitosis?
Bridge chromosome: a chromosome that has erroneously has two centromeres is pulled towards both poles - ends up breaking Acentric chromosome - lacking a centromere - can't segregate properly
34
What is meiosis?
Two nuclear divisions - halves chromosome number Replication -> chromosome segregation -> chromatid segregation -> form tetrad (4 haploid products) Alternates between 2n and n Forms gametes
35
What is the nuclear lifecycle in Saccharomyces cerevisiae?
Haploid most of the life cycle Two mating types: a and alpha One a and one alpha = transient diploid stage Then meiosis occurs Gives 4 haploid products We can dissect these products (segregants) and culture them individually - gives us info on linkage relationships
36
What do we have at the end of meiosis I?
Two nuclei both containing both chromatids of one chromosome from each pair of homologues - different than parent cell
37
What is a dyad?
Each replicated chromosome 2 sister chromatids with a single centromere
38
What is a bivalent?
2 dyads joined together
39
What happens in prophase of meiosis I?
1) Leptotene - replicated chromosome contract 2) Zygotene - chromosomes line up in homologous pairs = synapsis, held together by synaptonemal complex - bivalents 3) Pachytene - crossing over (genetic exchange) between non-sister chromatids, bivalents get shorter and thicker 4) Diplotene - chromosomes separate mostly - including centromere but sites of crossing over are still visible (chiasmata) 5) Diakinesis - chromosomes contract further
40
What do we have at the end of meiosis II?
Sister chromatids line up - centromere splits and chromatids move to opposite poles Four nuclei - two contain one copy of first homologous chromosome, the other two contain one copy of the second
41
What is crossing over?
Sites of genetic exchange between homologous sequences on non-sister chromatids Double strand breaks are generated by an enzyme Homologous recombination occurs - relies on chromatids lining up correctly so identical sequences can be accessed. Sections of the chromosome are 'swapped' giving new allele combinations
42
What are sites of crossing over called?
Singular = chiasma Plural = chiasmata
43
Can we see crossing over in the diplotene stage?
Yes - but chromatids has separated slightly
44
What lines up on the equator?
Mitosis: sister chromatids line up Meiosis: I = homologous chromosomes, II = sister chromatids
45
What is a mutation?
The process that produces a gene or chromosome set differing to a wild type
46
What is a wild type?
The form that predominates in nature/in a standard lab stock
47
What is the effect of a spontaneous mutation?
Pass down to all daughter cells - population is no longer clonal - could be 'genetic mosaic' - mutation only passed on if it's in the germline
48
What is polyploidy and aneuploidy?
Polyploidy = unusual number of chromosome sets e.g. triploidy = 3n, tetraploidy = 4n, monoploidy = 1n (when 2n is normal) Aneuploidy = one or a few individual chromosomes missing
49
What are some large-scale chromosomal rearrangements?
Deletions: part of chromosome missing Inversions: part of chromosome has been flipped Translocations: part of chromosome has been moved
50
What are sources of mutations which cause genes to differ from wild type?
Mistakes during replication: point mutation (single nucleotide), small insertion/deletions (frameshift) Genes interrupted by transposons DNA breaks incorrectly repaired
51
What are the types of point mutations?
Silent - no change to AA sequence Nonsense - introduction of stop codon in gene Missense - changes AA sequence Null allele - total loss of function
52
What are auxotrophic mutants?
Mutants that are unable to synthesise essential compounds such as AA Cannot grow on minimal media - only on complete media or minimal media with the molecule they can't make
53
What are the two types of media?
Minimal growth media - only nutrients that an organism cannot synthesise for itself Complete media - extra nutrients e.g. all AA. This allows all mutants to grow as it makes up for some defects in metabolic pathways.
54
What is the nomenclature of naming species and mutants?
+ = wild type Gene name in italics Genes are usually named after the phenotype of the null allele
55
What are allelic and non-allelic mutations?
Allelic = both mutations in the same gene/step of a pathway in two mutant organisms Non-allelic = both in different genes
56
What is complementation?
The production of a wild-type phenotype when two haploid genomes bearing different recessive mutations are united in the same cell This can identify allelic vs non-allelic. Allelic = diploid still can't grow on minimal media - has two defective alleles, non-allelic = can grow on minimal media - has one wildtype allele on each gene.
57
Can we use complementation in habitual diploids?
Cross two homozygous recessive individuals - see the phenotype Allelic = non-complementation Non-allelic = complementation
58
What did Mendel do?
Worked in his monastery garden in mid-1800's - concentrated on peas. He worked with 7 phenotypes. These were luckily: simple traits controlled by one gene, straightforward dominance relationships, on different chromosomes
59
What are pure breeding lines?
All offspring from matings within those lines have that same character. E.g. following self-pollination, all offspring from a pure breeding plant giving green peas will also give green peas
60
What are the two types of pollination?
Cross: pollen from another plant Self: pollen from self Each pea is one offspring
61
What are Mendel's laws?
Law of equal segregation: during gamete formation - members of a gene pair separate equally. Each gamete carries one allele for each gene. Law of independent assortment: during gamete formation, chromosome pairs segregate independently from each other. Law of dominance: alleles can be dominant or recessive. An organism with at least one dominant allele will display the effect of that allele.
62
What is the law of equal segregation?
During gamete formation - members of a gene pair separate equally. Each gamete carries exactly one allele for each gene. You end up with the same allele ratio that you started.
63
What is the law of independent assortment?
Orientation of bivalents on meiosis I spindle is random Forms recombinants - combinations of alleles that are not found in parents
64
What is the law of dominance?
Alleles can be dominant or recessive - an organism with at least one dominant allele will display the effect of that gene. A slash shows alleles are a pair e.g. G/g
65
Are all phenotypes controlled by one gene?
No! Some are several e.g. height
66
What are parental and filial generations?
Parental P = first - round x wrinkled First filial F1 = second - all round Second filial F2 = third - (selfed) round and wrinkled
67
What can we use to show gamete combinations?
Punnet squares Probability trees These give you phenotypic ratios
68
What is the F2 ratio for a dominant phenotype Rr x Rr?
Around 3:1
69
What is the dominant allele?
Cross pure-breeding parental lines with each of the phenotypes: predict all F1 offspring will have dominant phenotype Self-fertilise the F1 generation: predict 75% of F2 will have dominant phenotype Cross-fertilise the F1 generation with a homozygous recessive plant: 50% of F2 will have dominant phenotype
70
How do we know if an individual with a dominant phenotype is homo/heterozygous?
Cross an individual with a recessive offspring If homozygous dominant: all are Aa - dominant phenotype If heterozygous: 1/2 have dominant, 1/2 are recessive
70
What do we use to analyse expected and observed phenotypes?
Chi-squared
71
What is chi-squared?
X^2 gives us the probability that we would get a deviation from the expected result as big as the one observed. 1) calculate observed and expected values 2) calculate chi-squared using the formula 3) use the table to get a p value 4) interpret this value
72
What is the chi-squared equation?
x2 = the sum of ((O - E)^2 / E)
73
What are degrees of freedom?
The number of parameters that can vary independently = number of classes - 1 It is minus 1 because the last one has no choice e.g. wrinkled and round
74
What do the different probabilities mean in degrees of freedom?
0.05> - low probability = poor agreement between observed and expected >0.05 - high probability - doesn't contradict our hypothesis
75
What is linkage?
Mendel's law of independent assortment only is true for genes that are not carried close together. e.g. B and C are close so are inherited together - these are linked
76
What is the TH Morgan data on Drosophilia?
pr: mutant allele giving purple eyes pr+: wild-type (red eyes) vg: mutant allele giving small, vestigial wings vg+: wild type (long wings) Crossed pr/pr . vg/vg male (double recessive so only need to worry about genotypes from female) with pr+/pr . vg+/vg female - we would predict 2 parental and 2 recombinant all in equal proportions 1:1:1:1 However! - not matches this ratio - lots of parental (equal numbers), little recombinant (pr+/vg, pr/vg+)
77
What did the TH Morgan data on Drosophilia show?
Shows linkage but with a few recombinants - due to crossing over
78
What is a dihybrid cross?
Think of both genes e.g. RRMM x rrmm - F1 all heterozygous F2 - self pollinate RrMm - 9/16 have R and M, 3/16 have m and R, 3/16 have M and r, 1/16 are rrmm - 9:3:3:1
79
What is the genotype of an individual with the dominant phenotype?
Cross with recessive - see if they hare recessive phenotype - is so it it heterozygous
80
What is the recombinant frequency equation?
RF = (recombinants/total meiotic products) x 100 = (NPD + 1/2 TT)/(PD + NPD + TT)
81
What is the recombination frequency?
It is roughly proportionate to the distance between the two loci. During meiosis I - multiple recombination events ensure the homologues are held together - at random positions. The closer two genes are - less likely there will be a recombination event in-between them. So the strength of linkage depends on the distance between genes.
82
What is the maximum recombination frequency?
0 - 50% Two loci right next to each other - no crossing over - 0% RF Two loci on separate chromosomes - 50% recombinant, 50% parental, 50% RF If the distance is far enough, its impossible to know if the two genes are far away on the same chromosome or if they are on two separate chromosomes.
83
What is the difference between permissive and selective conditions?
Permissive: all cells should be able to grow Selective: only wild-type
84
What are the selective and permissive conditions for mutants?
Auxotroph mutants: Selective - minimal media Permissive - complete media/MM + the relevant compound Temperature-sensitive mutants: Selective - slightly tough growing temp of 36 c - mutant proteins are unable to fold properly Permissive - ideal growing temp of 28
85
What do we call a tetrad which all four products are same as one of the parents?
Parental ditype - all products of meiosis are the same as one of the parents
86
What is a tetrad?
Group of four spores (haploid cells) that result from the meiosis of a diploid cell.
87
What are the genotypes of the offspring when there are two unlinked genes?
Independent assortment 50% of time = all non-parental - non-parental ditype 50% of the type = all parental - parental ditype
88
What is a tetratype?
Two parental and two non-parental - 4 different products Happens due to crossover between two non-sister chromatids and two loosely linked genes Can happen for a single crossover between a gene and its centromere - also gives a tetratype for unlinked genes
89
What is the recombination frequency for two loosely linked genes?
RF = (NPD + 1/2 TT/ PD + NPD + TT) x 100
90
What is an asynchronous population?
Length of stages - stages not in sync so you get cells in different stages
91
What does the TT/NPD/PD tell us?
If PD = NPD - independent assortment - unlinked genes If PD>>NPD - inherited together - linked genes
92
What causes Edwards syndrome?
Trisomy 18
93
How many phage particles in sea water?
Around 10^7 phage particles/ml
94
How many viruses have been described in detail?
4,958
95
What are viruses?
Genetic elements that cannot replicate independent of a living host They can exist as virus particles outside the host
96
What is the size of Smallpox?
Virus - 200 nm
97
What is the size of poliovirus?
Virus - 28 nm
98
What is the size of a viral genome?
0.5 - 1000 kb Some have fewer than 5 genes
99
What are the general properties of a virus?
Virion = extracellular form - nucleic acid surrounded by protein (and other molecules) Intracellular form - replicative state
100
What virus infects C.difficile?
ΦCD508 - tail fibres attach to S-layer - triggers a structural change so can inject its DNA We use cyro-electro microscopy to view it
101
What is the virus' genome?
- Very small and reliant on host replicative and metabolic machinery - DNA or RNA genome - single or double stranded - linear or circular
102
How is the virus genome adapted to the limits on physical size?
- One solution is to have a small number of protein species forming the capsid - The capsid proteins usually are self-assembly - e.g. TMV
103
What is the structure of an extended phage?
Head Neck Tail (225 nm) Baseplate
104
What are the protein shells for animal and phage viruses?
Spherical animal viruses and phages have protein shells built of many copies of a few identical subunits
105
What is the typical viral replication?
1) Attachment (absorption) - a structural change occurs 2) Penetration (injection) - protein coat remains outside 3) Synthesis of nucleic acid and protein 4) Assembly and packaging 5) Release (lysis) Some cases - viral proteins also enter
106
What viral proteins are expressed?
The timing is very important Early proteins = for replication of viral nucleic acid e.g. enzymes Late proteins = include coat proteins
107
What is transcribed first in T4?
T4 is a bacteriophage First = early mRNA/proteins = nucleases, DNA polymerase, new sigma factors Middle = middle mRNA/proteins = phage DNA Last = late mRNA/proteins = phage head proteins, tail, collar, base plate and tail fiber proteins Then self assembly Then T4 lysozyme production - lysis 25 minutes in total
108
What causes transcription?
RNA polymerase (core enzyme) and sigma factors Sigma recognises promotor and initiation site Transcription begins - sigma factor released Chain grows until termination site
109
How is viral transcription assured?
T4 doesn't encode own RNA polymerase - uses host This is modified to specifically recognise promotors on phage DNA For early proteins - host sigma factors are used Also encodes an anti-sigma factor that binds to host sigma factor σ70 - preventing host transcription
110
How is the switch to middle proteins achieved?
Some early phage proteins modify host RNA polymerase α subunits Some bind to the host RNA polymerase These alter polymerase specificity to recognise middle promotors One early protein, MotA, recognises a sequence in the middle promotors and guides RNA polymerase to correct sites
111
How is the switch from middle to late promotors?
Requires a new T4-encoded sigma factor Most late proteins are structural proteins
112
What is the difference between lysogenic and lytic?
Lysogenic - incorporation of viral genome into host genome - infecting from within Lytic - reproduction of viruses using host cell to manufacture new viruses
113
What is lysogeny in temperate bacteriophages?
The life cycle can result in a stable genetic relationship with the host called lysogeny Virus genome is integrated into host chromosome e.g. in λ phage, or in a plasmid form e.g. P1 phage - relies on a repressor
114
What is lysogeny?
Most virus genes are not expressed The virus genome (prophage) is replicated in synchrony with host chromosome Lysogenic viruses may revert to lytic pathway and produce virions
115
What is the role of repressor proteins in lysogeny?
Inactivation of repressor or prevention of repressor synthesis induced the prophase This results in lysis
116
How is λ integrated into the host DNA?
Viral genome integrates at attachment site, attλ Requires the λ integrase enzyme 1) genome cyclases at cohesive ends 2) nuclease creates staggered ends of phage and host DNA 3) integration of λ (lambda) DNA and closing gaps by DNA ligase
117
What is the lytic cycle?
The viral genome must be replicated ready for packaging and release from host Multiple copies of viral genome must be replicated
118
How is the virus genome replicated in the lytic pathway?
Rolling circle replication 1) 1 strand of the circular lambda genome is nicked - makes a long single-stranded concatamer using the unbroken strand as a template 2) second strand made using the single stranded concatamer as a template - the double stranded concatamer is cut into genome-sized lengths at the cos sites giving cohesive ends
119
How do eukaryotic viruses enter and release?
1) binding 2) fusion 3) reverse transcription if the viral genome is RNA 4) integration 5) transcription 6) translation 7) assembly 8) budding 9) release
120
Can polycistronic mRNA be translated in eukaryotes?
No!
121
How is mRNA processed?
5' capped Splicing 3' polyadenylation
122
What are the (+) strand RNA viruses of animals?
Typically very small (~30 nm) except coronaviruses e.g. poliovirus, rhinovirus (common cold), hepatitis A
123
What is poliovirus?
Single linear ssRNA genome Genome acts as the mRNA but there is no capping - instead the RNA at 5' end folds into stem loops and has a protein, Vpg, bound - mimics cap
124
What is the proteinsynthesis in poliovirus?
Genome RNA acts as mRNA Viral protein synthesised as one large polyprotein - common strategy in viruses Proteases then cleave the polyprotein to form structural coat proteins, RNA replicase ect.
125
What is replication in poliovirus?
Occurs in cytoplasm - does not involve DNA Host RNA and protein synthesis is inhibited as host cap-binding protein is destroyed RNA replicase - synthesises (-) from + then makes a - from the (+)
126
What are (-) strand viruses?
Includes rabies, influenza and ebola The - strand encodes the genome but cannot act as mRNA
127
How does rabies replicate?
A viral replicase is essential and carried by the virion 2 distinct classes of RNA are transcribed (-) strand parental (RNA pol makes mRNA from this) --> + strand RNA --> (-) strand genomic RNA
128
What is Influenza's genome and structure?
The (-) strand RNA is segmented Influenza A has 8 linear ssRNA molecules Structure: Inside (RNA genome - in 8 pieces, RNA endonuclease, viral RNA polymerase), Envelope, hemagglutinin, neuraminidase
129
How is Influenza replicated?
The viral nucleic acid replicates in the host nucleus The overall pattern of viral genome RNA synthesis resembles rabies Transcription results in viral mRNA with 5' caps - primers are cut from 5' ends of newly synthesised host mRNAs by viral endonuclease - poly A tails are added - viral mRNA moves to cytoplasm for translation
130
What is the antigenic drift?
The surface proteins are the main immunogenic regions of the virus Antigenic drift arises from mutations in the genes for the surface proteins Therefore, an annual vaccination mimicking the surface protein epitopes is needed
131
What causes an antigenic shift?
Portions of the RNA genome from 2 genetically different strains, both infecting a cell, are reassorted This leads to a different combination of surface proteins Antigenic shift is the origin of pandemics and epidemics
132
What is HIV?
RETROVIRUS Genome has 2 strands of ssRNA Genome is replicated through a DNA intermediate - requires a reverse transcriptase Structure: - RNA and enzymes (reverse transcriptase, protease, integrase) - surrounded by core protein - Surrounded by a core shell protein - Lipid bilayer - Transmembrane envelope protein - Surface envelope protein
133
What is the HIV genome?
Retroviruses have 2 identical ss(+)RNA strands Have R terminal repeats (at end of strands) - essential for replication Has gag region - encode structural proteins Has pol region - encodes reverse transcriptase and an integrase Has env region - encodes envelope proteins that sit in the membrane
134
How does HIV replicate?
1) Entrance 2) Uncoating 3) Reverse transcriptase 4) Travel to nucleus and integrate in host DNA 5) Transcription 6) Encapsidation 7) Budding 8) Release
135
How are HIV genes expressed?
In provirus form - viral genome may be expressed or remain latent Activation of promotors in LTR region leads to mRNA transcripts that are capped and polyadenylated Viral mRNAs are either encapsidated or translated Similar to polio - polyproteins are synthesised and then processed
136
What is an example of a virus with overlapping genes?
Simian virus 40 (SV40) One dsDNA circle (has early and late region) complexed with host cell histones No viral-encoded enzymes
137
What does SV40's genome encode - early?
A single RNA is made by the cellular RNA polymerase from the early region This is processed into 2 mRNAs - both capped The introns are excised out The expressed T antigen protein binds to origin of replication to initiate genome synthesis Genome too small to encode viral DNA polymerase - uses host DNA is replicated in a bidirectional fashion using host cell machinery
138
What does SV40's genome encode - late?
Begins at a promotor near origin of replication This late RNA is then spliced, capped and polyA'd to yield mRNA corresponding to the three coat proteins: VP1, VP2, VP3 The genes for these proteins overlap
139
What is the structure of SV40?
See your photos!
140
What is a coronavirus?
Single (+) strand RNA virus Replication in cytoplasm (like polio) Causes resp infections in humans - cause 15% of common cold but occasionally fatal e.g. COVID
141
What is the coronavirus structure?
Enveloped virions Glycoprotein spiked give 'crown' like appearance Largest known RNA viruses ~ 30 kb
142
What is the infection cycle for coronaviruses?
1) Entry - spike protein binds to receptor on host cell called ACE2 - host molecule TMPRSS2 cleaves spike protein exposing sites to fuse with 2) Inside - RNA is transcribed into non-structural proteins that quickly supress the translation of host messenger RNAs in favour of those belonging to virus 3) Remodelling - virus transforms the cell's ER into double-membrane vesicles - safe place for viral RNA to be transcribed 4) Exit - assemble into complete virus and leave through golgi/lysosomes 5) Furin cuts five amino acids on own spike protein - ready to infect another cell
143
How is the coronavirus' genome replicated?
Genome has 5' cap and poly(A) tail so can act as mRNA However only the replicase is translated This then generates a (-) strand of copied RNA From this - several monocistronic mRNAs are transcribed and translated into viral proteins - progeny genomes are also produced
144
How can we visualise bacteria chromosomes?
Osmotic disruption of E.Coli - causes the DNA to spew out - 1500 times longer than the cell - use electron microscopy
145
How is genetic material organised in bacteria?
A nucleoid - NOT NUCLEUS 30-100 domain loops in DNA after release from bacterial cell Supercoiled Some supercoils are relaxed in a nicked strand e.g. with endonuclease - loops out
146
What causes the nucleoid supercoiling in bacteria?
Supercoiling of 300 bp circular DNA molecule Enzymes (topoisomerases) make a double strand break in the circle - passes another part of DNA through then reseal
147
What is the DNA replication in bacteria?
15 minutes Semiconservative: at replication fork, DNA synthesis machine separates the two strands - forms a replication bubble, whilst extending the new strands From origin, oriC, to terminus, ter. 5'-3' Lagging has okazaki fragments
148
How is DNA replication terminated in bacteria?
There are >8 arrest sites at the terminator regions Replication forks moving clockwise are trapped by terB, C, F, G, J Replication forks moving anticlockwise are trapped by terA, D, E, H, I
149
What is the chromosome of E.coli K-12?
Circular - 4600kbp - 4288 genes No plasmids Order of transcription can be CW or ACW Replication proceeds in both directions from origin at 84.3 min There are restriction sites for Not1 and some Hfr origin ect.
150
What's a difference between eukaryotic and prokaryotic chromosomes?
Bacterial: tightly packed with coding genes with few mobile elements and little unused sequence Human: 95% non-coding, coding is separated with introns. Ancient gene duplications have decayed to pseudogenes (can't code for a protein)
151
What is the genome of Salmonella enterica serovar Typhimurium?
Has big differences between itself and other strains of the same species The + and - strands are transcribed in opposite directions
152
What is the genetic organisation of bacterial genes?
One gene makes a monocistronic message (one protein) An operon of e.g. three genes makes a polycistronic message In a chromosome - genes and operons may form a regulon controlled by a common regulatory protein
153
What are plasmids?
A genetic structure in a cell that can replicate independently of the chromosomes - typically a small circular DNA strand in the cytoplasm Ranging from few hundred - many kbp
154
What's a common cloning vector/plasmid?
pBR322 Used in molecular cloning 1) origin of replication 2) two antibiotic resistance genes (amp and tet) 3) three unique 6bp restriction sites (HindIII, PstI and BamHI)
155
What are some phenotypes that some plasmids have in prokaryotes?
Antibiotic production Conjugation Metabolic functions Resistance Virulence
156
What is the genetic map of the resistance plasmid, R100?
- Mercuric ion resistance - mer - Sulfonamide resistance - sul - Streptomycin resistance - str - Chloramphenicol (antibiotic) resistance - cat - Insertion sequences and transposon Tn10 - Transfer functions - can be transferred between enteric bacteria and are a threat to antibiotic therapy - tra - Replication functions - encode replication proteins
157
What are features of a mutation?
Heritable Rare Good/bad/neutral Usually small
158
What are the nomenclature rules in genotypes in bacterial genetics?
IMPORTANT Gene name - 3/4 letters All except the 4th in lower case Always italicised e.g. cydA, cydD, hmp A wild-type gene is written with a + e.g. cydA+ A mutant is written cydA If there are several alleles we write cydA1, cydA2
159
What are the nomenclature rules in phenotypes and proteins in bacterial genetics?
Phenotype: Not in italics First letter is capitalised +/- used to show presence/absence of a property Encoded protein: No italics/superscript - first letter is a capital e.g. The CydA protein, CydD
160
What mutations are caused by a base-pair substitution?
Missense - faulty protein Nonsense - stop codon Silent - normal Genetic code is degenerate and different types of mutant proteins may result
161
What is the mutation in sickle cell?
GAG to GUG
162
What's a mutagen?
Agents which increase mutation rates Radiation Intercalating dyes Chemicals reacting with DNA Base analogs
163
What are some examples of the mutagenic base analongs?
2-Bromouracil - substitutes T with itself and over time turns AT-GC or GC - AT 2-Aminopurine - over time turns AT-GC or GC-AT
164
What mutagens react with DNA?
Nitrous acid (HNO2) - deaminates A and C - turns AT-GC Hydroxylamine - reacts with C - GC - AT Alkylating agents: Monofunctional (e.g. ethyl methane sulfonate) - puts methyl on G - faulty pairing with T - GC -AT Bifunctional (e.g. nitrogen mustards, mitomycin) - crosslinks DNA strands - point mutations and deletions
165
How is radiation a mutagen?
UV - pyrimidine dimer formation Ionising e.g. x-rays - free radical attack Both may cause error or deletion when repaired
166
What are types of mutants?
Auxotrophs - loss of enzyme in biosynthetic pathway - can't grow on media missing nutrient Temp-sens - heat sensitive Cold sensitive - can't grow at low temp as essential protein is inactivated at low temp Drug-resistant - detoxification of drug or alteration of drug target - can grow on media containing a normal amount of drug Rough colony - loss of liposaccharide layer - granular, irregular colonies Nonencapsulated - loss of surface capsule - small colonies Nonmotile - compact colonies
167
How can DNA from a donor cell be introduced into a recipient cell?
1) Transformation - uptake of naked DNA 2) Transduction - phage mediated 3) Conjugation - plasmid mediated
168
How does transformation work?
DNA may or may not be degraded 1) DNA binds via protein 2) one strand enters using DNA translocase - second is degraded 3) internalised single strand is bound by RecA protein 4) as RecA has more than one DNA binding site - it can hold the ssDNA and dsDNA together 5) the Rec-A-ssDNA complex stretches the dsDNA to increase complementary recognition (conformational proofreading) 6) branch migration and homologous recombination follows 7) Transforming DNA doesn't have to be a plasmid - often is
169
What is the mechanism of transduction?
'normal' bacteriophage contains phage genes; transducing particles contain host genes too Lytic cycle or lysogenic cycle
170
What is the mechanism of conjugation?
Mating requires pili (sex pilus) responsible for cell pairing
171
What is the genetic map of the F plasmid of E.coli?
NEEDED FOR CONJUGATION F = fertility - only in donor cells Tra region - genes needed for conjugative transfer including sex pilus oriT - origin of transfer during conjugation IS regions - insertion sequences may recombine with regions on chromosome
172
How is the F plasmid transferred by conjugation?
1) pilus retracts 2) cell pairs stabilized - F plasmid nicked in one strand 3) transfer of one strand from F+ (donor) to F- (recipient) - F plasmid is replicated in F+ cell 4) Complementary strand is synthesised in recipient cell by rolling circle mechanism 5) Completion of DNA transfer - cells separate
173
Can chromosomes be transferred in conjugation?
yes! due to integrated F plasmid Hfr strains were isolated (efficient genetic donors) - high frequency of recombination for chromosomal markers The plasmid can't control own replication now - tra function operates and cell forms pili Then Hfr encounters recipient strain - the F+ plasmid is integrates - the bacteria is Hfr
174
What happens after the transfer of the plasmid in conjugation?
The Hfr strain transfers the F plasmid to a recipient as an F+ would The recipient does not become Hfr as part of the integrates Hfr is transferred Look at diagram
175
What has the Hfr strains allowed scientists to do?
As strains transfer genes in different orders and from different regions - can identify the arrangement and orientation of almost all genes before the genome sequence of E.coli was known
176
How are the three methods of gene transfer used?
Transformation - get plasmids/DNA in bacteria - some bacteria are natural competent to be transformed, some e.g. E.coli need chemical treatment or electroporation Transduction - second most important method - needed to move fragments of DNA/gene/plasmids from one cell to another Conjugation - has historical interest but used less
177
How big must the t-test class be?
Each class must have more than 5 members
178
What is Zea mays?
Maize or corn Kernels have two possible morphologies: starchy (smooth) and sweet (wrinkled) because starch is insoluble so contain less water so don't wrinkle when dried Also come in two colours: yellow or purple
179
What are locust genetics?
Normal sex-chromosome complement of male locust is XO, not XY All chromosomes are telocentric - centromeres at one end
180
What is a pericentric inversion?
When the centromere is in the inverted segment Pericentric = around centromere Paracentric = besides centromere A diploid individual with one normal chromosome and one with pericentric inversion = inversion heterozygote
181
What is an inversion loop?
When pairing occurs when there is extensive inversion
182
How does genetics affect our health throughout our lifespan?
Embryonic: predicted that 50% of embryos have chromosomal defects Newborn babies: 5% of babies suffer from congenital disorder - chromosomal, mitochondrial, single gene, complex (multi-factoral, genetic and environment) Later life: 2/3 diseases have genetic component in their aetiology
183
How common are chromosomal disorders?
Prenatal: 8% of clinically recognised conceptions terminate due to chromosome abnormalities - predicted that 50% of terminations are due to it Live births: 0.1% of live births display chromosomal abnormalities: aneuploidy e.g. down syndrome, trisomy 21, chromosome abnormalities
184
What are mitochondrial disorders?
Each cell has 100's of mitochondria - has own genome (2-10 copies of circular 16.6 kb genome) Normally all copies of the DNA are the same (homoplasmid) Mutation - mixture of genomes (heteroplasmid) Can lead to mitochondrial disease - don't follow mendelian patterns of inheritance
185
What are the differences between single gene disorders vs complex diseases?
Single gene vs multiple genes (polygenic) Single follows mendelian inheritance pattern, complex has no simple pattern but runs in families Single conditions are rare but collectively common (1/17), complex are common Single are high penetrance, complex suggest a susceptibility not deterministic Single has predictive tests, complex has no reliable tests Complex are influenced by environment
186
What is the size of the human genome?
3.2 Gb Only 1.1% is coding 20,500 protein coding genes Single: all mutations in coding sequence Complex: many disease variants lie outside coding
187
What is the difference between the human genome and the fly/yeast?
Human: 3.2 Gb, 20,500 genes Yeast: 12.1 Mb, 6000 genes Fly: 175 Gb, 14,000 genes
188
What are the non coding parts of the genome?
Regulatory elements, promotor Introns
189
What is ENCODE?
ENCyclopedia Of Dna Elements Goal is to build a comprehensive parts list of functional elements in human genome Achieved through genomic, functional and bioinformatic approaches List includes elements that act at the protein and RNA levels and regulatory elements
190
When was the human genome coded?
In 2001 following 15 years ($3 billion) 2 competing consortiums: - International Human Genome Sequencing Consortium - Celera Genomics Sequencing the whole euchromatic region (92%) minus the heterochromatic and repetitive regions
191
Why was sequencing the genome difficult?
- length of genome - generating enough clones to ensure complete coverage - accuracy-sequencing must be repeated - reassembling genome from individual sequences
192
What is next generation sequencing?
Novel sequencing technologies reduce the time and cost Different approached but based upon sequencing a lot of small fragments of target DNA, in contrast to sanger sequencing Whole genome sequencing (WES): enormous amount of data, more expensive Whole exome sequencing (WES): targeted sequencing of only protein encoding region of genome (1%)
193
Why is NGS better than sanger?
Average length of sequence read: 36 vs 800 for sanger Average number of reads: 35,000,000 vs 384 for sanger Total amount of sequencing: 1.3 Gb vs 300 kb (sanger) Cost/Mb: $4 vs $600 for sanger A whole genome can be sequenced in 24 hours for $600
194
What are some single gene disorders?
6,466 phenotypes, 4,412 genes Autosomal: huntingtons - autosomal dominant, cystic fibrosis - autosomal recessive X-chromosome linked disease - Duschenes muscular dystrophy
195
What are autosomal dominant disorders?
1 copy of mutation is needed Wile-type allele is recessive, defective is dominant More likely that there is one defective allele than 2 Inherited in dominant pattern or new mutation arises in single copy Phenotype in every generation - parent can transmit to every progeny according to mendelian inheritance ratios
196
What is Huntington's disease?
Late onset autosomal dominant disorder 1 in 10,000 Progressive neurogenerative brain disorder: - neuronal death - cerebral atrophy - CNS disorder - Uncontrolled movements - Decrease in cognitive function - Personality changes
197
When was the genomic region for Huntington's found?
1983 through positional cloning The HTT gene was identified and clones a decade later Dominant pattern - affects every generation
198
What type of disease is Huntington's?
Polyglutamine repeat disease HTT encodes a protein with a poly glutamine tract (encoded by CAG codon) Normal copies have less than 36 repeats - more than 36 = Huntington's Greater number of CAG repeats - earlier onset CAG are unstable and are prone to expansion and can expand upon parental transmission
199
What is the penetrance of Huntington's?
Shows incomplete penetrance Signs and symptoms appear at the earlier age as HD is passed from one gen to another - anticipation
200
What does the mutated Huntington's protein do?
Has poly Q expansion and has prosperity to misfold and aggregate These aggregates form inclusion bodies in neurones Impairs axon transport, mitochondrial dysfunction, dysregulation of transcription
201
What are autosomal recessive disorders?
2 recessive defective genes are needed Phenotype doesn't appear in every generation Both parents are carriers Transmission in accordance with mendelian inheritance
202
What is cystic fibrosis?
Most common severe autosomal recessive disorder 1 in 2000 are affected 1 in 22 are carriers Characterised by build up of thick mucus that can damage many body organs The CFTR gene and mutations were identified by positional cloning (like huntingtons)
203
What are therapies of cystic fibrosis?
Physiotherapy DNase to reduce mucus viscosity Antibiotics and anti-inflammatories Mannitol spray to increase osmolality of mucus
204
What id the CFTR gene?
Encodes the cystic fibrosis transmembrane conductance protein CFTR transports Cl- across the plasma membrane of cells that line the lung Ensures the hydration of the airway surface layer
205
206
What are the classes of CFTR mutations?
I - no protein II - no traffic III - no function IV - less function V - less protein VI - less stable I and V treatment: - production correctors (ataluren) - promote CFTR transcription e.g. nonsense mutations II and III - correctors (Lumacaftor) - improve intracellular processing of CFTR e.g. Phe508 IV and VI: - potentiators (Ivacaftor) - recovers function of CFTR
207
What do patients with CFTR homozygous Phe508del?
- potentiator alone is not affective - potentiator and a corrector (orkambi) received FDA approval and now available on NHS however orkambi is expensive (£100k per year)
208
What are X-linked disorders?
Most are recessive Males can inherit from female carrier parent A female can inherit only when both the parents carry allele Females are generally unaffected but are carriers Affected males progeny never show disease phenotype
209
What is Duchenne muscular dystrophy?
Most common X-linked recessive disorder Disease onset if before 6 years old Life expectancy = 20-30 Gene passed on by mother Multi-system disorder: causes muscle weakness, wasting and atrophy, skeletal muscle degeneration and cardiac myopathy
210
What is the gene causing duchenne muscular dystrophy?
On X chromosome Largest known human gene (2.4 mb) - 79 exons and encodes dystrophin protein - 427 kDa
211
What does the dystrophin protein do?
Located in skeletal and cardiac muscle In muscle: part of protein complex which strengthens muscle fibres by linking a muscle cell's cytoskeleton (actin) to ECM (connective fibres)
212
What are the mutations in duchenne muscular dystrophy?
Approx 60% of DMD cases are due to deletions of at least one exon Hotspot deletion zone - between exons 40-54 Deletions can cause frameshifts
213
How is DMD treated?
Anti-sense oligonucleotides - Eteplirsen - prevents recruitment of the splicing machinery
214
What will future NGS of whole genome/exome allow?
Identification of genetic determinants in: - novel conditions - characterised disorders - atypical presentation of a disease Analysis of genetic determinants of complex disorders
215
POSITIONAL CLONING??
216
What is the life cycle of yeast?
Haploid cells - MATa and MATalpha fuse to form a mating figure which buds off diploid cells. These diploids can reproduce by mitosis to produce a population of diploid cells - these can be studied directly to test dominance and complementation of genes contributed by the two haploid parents Alternatively, diploids can be sporulated to obtain haploid sexual progeny - study gene segregations, gene interactions and linkage - when diploids sporulate - each diploid nucleus divides meiotically to form four haploid products/segregants - the tetrad comprises four haploid ascospore enclosed in ascus
217
What information can we gather from looking at tetrads?
- parental ditype (PD) - tetrad only has parental genotypes - non-parental ditype (NPD) - tetrad only has non-parental (recombinant) genotypes - tetratype (TT) - tetras contains four different genotypes - 2 parental, 2 recombinant
218
What can analysing tetrads tell us about linkage?
For genes closely linked - we would see mostly parental (PD>>NPD) For two genes found on different chromosomes or far on the same - expect equal abouts (PD =NPD) Containing all four combinations - caused by single crossover between the two alleles or between one gene and the centromere - doesn't tell us about linkage
219
What are temperature sensitive mutants?
Changes thermal stability of protein - has normal phenotype at low temp (permissive/non-restricted) - mutated phenotype at high temp (restricted/non-permissive) The protein has a decreased stability meaning it denatures at a lower temp - if the protein is essential then the cells cannot grow cdc mutant - can grow at 25 - not 36
220
What are complex genes?
Common - 66% lifetime risk Multifactorial: polygenic, environmental Risk alleles suggest susceptibility - not deterministic Are familial
221
What are some examples of complex diseases?
Obesity, rheumatoid arthritis, Type1/2 diabetes, MS, cardiovascular disease, epilepsy, peptic ulcer, hyperthyroidism, some cancers Alzheimer's, bipolar, depression, tourette's Congenital: cleft lip/palate, neural tube defects, pyloric stenosis
222
Why do we identify the genetic components of complex disease?
Early diagnosis and treatment Allow lifestyle changes to lower risk Help understand the molecular basis to develop therapeutics
223
How do we identify risk alleles?
11-15 million common variants - single nucleotide polymorphisms (SNPs) SNPs are found in both coding and non-coding regions Minor allele frequency (MAF) is the second most common allele in the population Common SNP/MAF > 5% Rare SNP/MAF < 5% Risk from SNP can be small - correlation between disease and genotype can be weak Identifying alleles which increase risk requires large number of individuals
224
What are the methods to find risk alleles in complex disease?
Family based linkage analysis: - looks for linkage between mapping markers and occurence of disease in families Genome-wide association studies (GWAS): - search for alleles in a population that occur more frequently in disease cases than in matched controls - more powerful at identifying rare risk alleles and those that contribute a small increase of risk
225
What is the phenotypic variation?
The sum of genetic and environmental variation Vp = Vg + Ve Vp = phenotypic variance Vg = genetic variance Ve = environmental variance
226
How do we work out heritability?
The degree of variation within a population that is due to genetic variation = Vg/Vp Higher the heritability - greater the genetic contribution but environment can still influence likelihood
227
How can heritability be experimentally determined?
Twin studies allow the effects of genetics (heritability) and environment on phenotypic variance Monozygotic - identical twins - share 100% genes - shared environment Dizygotic - non-identical twins - share ~50% of alleles and share environment Assume equal environmental influence for identical and non-identical twins
228
How do we do the twin studies?
ACE model A (genetic variance): - MZ = constant, DZ = variable C (common environment): - MZ = constant, DZ = constant E (specific environment): - MZ = variable, DZ = variable Environmental contribution is the same
229
What is concordance in twin studies?
Percentage of identical or non-identical twins which share phenotype/disease MZ: - correlation = A (100%) + C DZ: - correlation = A (50%) + C The difference in concordance between identical and non-identical is a measure of the effect of 50% of genes on variation in the population The greater the difference in concordance between MZ and DZ - greater the heritability of the trait
230
What is the phenotypic variation in complex disease?
Most phenotypes show a continuous variation - some have complex - disease is controlled by multiple polygenes - each allele segregates by mendelian laws - many polygenes + environment = continuous phenotypic variation Threshold for developing disease: - most diseases are discontinuous - an individual either has or hasn't got the disease
231
Do polygenic diseases show a continuous phenotype?
Yes - many loci can contribute to variation -> broad range of phenotypic variation - environmental factors will also affect phenotype - same genotype will display different phenotypes - not all alleles will contribute the same risk - an increase in number of loci, greater the distribution of variation
232
Can complex diseases show a continuous phenotype?
Yes e.g. depression Identified 44 significant risk loci for major depression All humans carry greater or lesser Associated with clinical features and included drug targets
233
What is the threshold hypothesis for disease susceptibility?
Most disease phenotypes are discontinuous Polygenes and environment produce a distribution of liability The disease occurs when liability exceeds threshold Relatives of affective individuals have an increased risk
234
What is the evidence for threshold hypothesis?
Pyloric stenosis: - narrowing of opening between stomach and SI - projectile vomiting after feeding in infants - multiple genetic and environment factors have been identified as contributing to disease - five times more common in males - females have a higher liability threshold than males - affected females carry more risk alleles - relatives of affected females have greater risk than relatives of affected males
235
What is the genetic architecture of complex disease?
Different possible architectures: - small number of dominant alleles confer a large risk increase in risk - parkinsons - common disease, common variant model (CDCV) - many alleles confer a small increase in risk - type II diabetes - intermediate - one major allele exerts a large effect, numerous other lower risk alleles - breast cancer
236
What are SNPs?
Single nucleotide polymorphisms - between 11 and 15 million common SNPs (MAF (minor allele frequency) >5%) - uneven distribution of SNPs in genome - can occur at: coding regions - synonymous (no change in AA), non-synonymous (missense/nonsense), or non-coding regions - can affect expression/regulation of associated genes - complex diseases arise from combinations of multiple SNPs
237
What is the gnomAD database?
There are exome and genomes from unrelated individuals sequenced as part of various disease-specific and population genetics studies - gnomAD aggregates over 730,000 exome and 76,000 genome databases - over 910 million variants mapped - records frequency of alleles in a population - documents rare mutations - highly pathogenic variants seen with a lower frequency in the general population
238
What are genome wide association studies?
- examine a panel of SNPs in the genome for association with the disease phenotypes - search for alleles that occur more frequently in disease cases than in matched controls - requires many participants - GWAS studies have been performed for most common diseases - many risk loci have yet to be identified - missing loci contribute to the 'missing heritability problem' - GWAS compares the allelic frequency across the entire genome in case and control populations. - Significant differences in allelic frequency constitutes an association with disease.
239
How are SNPs associated with disease?
Association studies can tell us if an allele is associated with disease The SNP itself can increase risk The SNP correlates with the risk allele due to linkage disequilibrium - they are inherited together
240
What is linkage disequilibrium?
The non-random association of alleles at different genomic sites. Depends on: - distance between alleles - recombination rates The pattern of LD can be summarised as haplotype blocks
241
What is a haplotype block?
Region of high LD that are separated from other haplotype blocks by many historical recombination events In haplotype mapping - groups of alleles are clustered so a single SNP can identify cluster of alleles (Tag SNP)
242
How do we identify risk alleles?
GWAS studies identify SNPs associated with disease - not necessarily risk alleles Need integration with functional data on candidate regions to identify causality
243
How are alleles associated to the disease?
Measured in an odds ratio OR is a statistic that quantifies the strength of the association between two events OR = 1 - events independent OR > 1 - events are correlated OR< 1 - events are negatively correlated Common disease common variant (CDCV) model of complex diseases - multiple alleles with OR< 1.2 showing weak association to disease phenotype
244
What does GWAS rely on?
Statistical analysis and large cohorts Statistical significance is needed to differentiate true positives from false positives Genome wide significance is where p value < 5x10^-8 1/20 events are non-significant (0.05) For 1 million SNPS - expect 50,000 false positives Very large number of participants required Very large number of participants are required
245
How do we define risk variants?
Manhattan plot The threshold for significance is shown by red horizontal line (p = 5 x 10^-8) 44 risk loci defined with significant p-values (green stacks) GWAS susceptible to high number of false positives
246
What is type II diabetes?
Common chronic condition caused by inability to take up sugar Characterised by high BP, insulin resistance and a lack of insulin production Multifactorial (genetic and environmental): - familial - geographical and ethnicity - age, weight, diet, exercise
247
What is the GWAS of type II diabetes?
Red SNPs - previously identified common alleles Green SNPs - novel association loci determined due to increased statistical power The novel loci have low odds ratio (1.06-1.27) - each causes a small increase in risk TCF7L2 - allele providing greatest risk of type II diabetes, intronic variant, transcription factor required for pancreatic development FTO - intronic variant, involved in body weight regulation CDKN2A/B - non-coding regulatory variant
248
What is breast cancer?
Lifetime risk in females - 8-12% Risk increases if first-degree relatives suffer Rare coding mutations and common non-coding increase risk - rare coding mutations e.g. BRCA1/2 - huge risk - common variants in polygenes contribute small increase 5% of cases - BRCA1/2 autosomal dominant alleles - mapped by linkage analysis
249
What is the GWAS of breast cancer?
Identify high known risk factors that occur infrequently in the population Identify 66 common low-risk alleles - many in non coding regions of genome
250
What is missing heritability?
Heritability = proportion of variance in a particular phenotype in a population that is due to genetic variation We can identify risk alleles for complex diseases but heritability is not fully explained by these alleles e.g. Crohn's - 32 loci - 20% heritability explained by these alleles Type II diabetes - 18 loci - 6% Early onset myocardial infarction - 9 loci - 2.8% Missing risk can arise from: - false negatives in GWAS studies - rare variant alleles with MAF 1-5% - structural alteration of the genome - epigenetics - 3D genome organisations - NONE ARE DETECTED IN GWAS
251
What is cell differentiation?
e.g. haemopoietic stem cell gives rise to different blood cells Cells within a defined ancestry = lineage Same DNA but differences in gene expression and proteome
252
What is pattern formation?
The concept of position information proposes that cells acquire positional values as in a coordinate system - they interpret this by developing in particular ways to give rise to spatial patterns - e.g. organ placement Louis Wolpert made this concept
253
Is polarity fundamental to development?
Yes - we see it in vast majority of organisms Molecular events define the polarities e.g. Apical basal symmetry - bilateral symmetry in fly Floral symmetry - flowers e.g. actinomorphic (radial), zygomorphic (bilateral) Dorsal-ventral - shark
254
Do we need to understand development for disease?
Yes! epigenetics change differentiated -><- undifferentiated e.g. some diseases occur due to defects in development - can help us find treatments
255
Where are stem cells found?
Found in all multicellular organisms e.g. plants derive from stem cells generated in embryogenesis - shoot and root meristem
256
What is a stem cell?
Undifferentiated cell of a multicellular organism which is capable of giving risk to indefinitely more cells of the same type - other types of cell can arise from differentiation Goes through self-renewal - when it divides, it generates a new stem cell and a second progenitor cell which can generate new lineages
257
What is self-renewal?
Maintains stem cell population - forms a stem cell and progenitor cell Asymmetric division - mitotic division of progenator cell that gives rise to progeny with different rates
258
What are the types of stem cell?
Totipotent - can form whole new organism. Mammals - can form embryo, extraembryonic membrane and placenta Pluripotent - can form many cell types but not all. Mammals - can form embryo Multipotent - can form fewer but still many cell types e.g. haemopoietic stem cell
259
What are the sources of stem cells and their potentials?
Zygote - totipotent Blastocyst - pluripotent, self-renewal division type - embryonic stem cell Adult - multipotent, self-renewal division type - multipotent stem cell Organ - limited/no potential, limited self-renewal division = progenitor, division = committed progenitor, functional division = differentiated
260
How do we make embryonic stem cells from a mouse?
From the mouse blastocyst (the outside = trophoectoderm (makes the extraembryonic membrane and placenta)) take the inner cell mass Place in tissue culture conditions - some will turn into embryonic stem cells (not totipotent - pluripotent)
261
What are uses of embryonic stem cells?
Introduce genes - e.g. introduce human genes and study where protein is localised - fused to GFP or what its transcripts do Knockout genes - can study role of the gene Chromosomal rearrangements/deletions -generate disease models Make specific genome edits - can make mutation found in gene known to cause disease Need to see these changes in the mouse itself
262
Can we make transgenic mice from ESC?
Yes! But they are not totipotent Therefore, we need to take cultured ESCs and introduce them into a developing blastocyst (has an inner cell mass) This is then implanted into a pseudopregnant mouse Forms chimeric mouse - has cells of two genotypes from two different zygotes (not same as genetic mosaic as this is where it comes from one zygote)
263
How do we keep track of which cells were derived from the implanted ESCs and which were already present - I?
Genetic markers - common to use mouse coat colour genes Fur colour is determined by banding pattern on bristles and the wildtype colour requires A and C to give agouti (greyish) colour whereas albino mouse has homozygous for a and c genes So if ESCs are derived from a AACC blastocyst and are transported into a albino mouse blastocyst - 1st gen will be a mixed genotype - patchy colour - NOT HETEROZYGOUS MOUSE - a cell will either be AACC or aacc Cells in reproductive system will hopefully be chimeric and give rise to gametes that are either AC (from transgenic ESC) or ac (from albino blastocyst)
264
How do we keep track of which cells were derived from the implanted ESCs and which were already present - II?
The chimeric mouse would be crossed with an albino mouse again and you would look for WT (agouti) mice - will have genotype AaCc The A and C derive from cells which were originally the ESCs Can only happen if the reproductive organs in the 1st gen mouse had some cells from the ESCs
265
What are some ethical issues for embryonic stem cells?
Significant research tool and same method in mice require human blastocysts. Therefore would have to sacrifice human blastocyst However, have been approved for limited experimentation including human-animal hybrids - must terminate early
266
Can somatic nuclei be reprogrammed to be totipotent?
Yes! e.g. shown in xenopus laevis - nuclei from embryonic tissue was placed into enucleated embryos to generate tadpoles - he took nuclei from somatic cells and placed it into enucleated eggs - resulted in tadpoles Demonstrated that a somatic nuclei did contain all the genetic make up to produce all cells of an organism - in egg environment it can be reprogrammed to become totipotent This paved the way for the first animal cloning - nuclei was transported into enucleated embryo - dolly the sheep
267
What are induced pluripotent stem cells?
Yamanaka took adult fibroblast cells and using tissue culture and transcription factors - was able to reprogram a small percentage of cells to dedifferentiate and become like embryonic stem cells These can then be differentiated into different cell types
268
What is the genetics of drosophila melanogaster development?
4 pairs of chromosomes : X, Y, 2, 3, 4 ~ 16,000 genes ~ 140 Mb
269
What is the drosophila development?
Fertilised egg (0 days) Embryonic development Hatching (1 day) Larva (three larval stages, separated by molts) Pupation (5 days) Pupa Metamorphosis Adult (9 days)
270
How do we map adult segmentation in drosophila?
Can be mapped to larval segments e.g. head, prothorax, mesothorax, metathorax - easy to map so used as model organism - obvious polarity - formed basis for mutagenesis Body plan is determined by hierarchy of developmental genes - defects in segmentation pattern was easy to screen for and identified a hierarchy of genes
271
What are the segmentation genes?
Maternal effect genes: phenotype of progeny is determined - by maternal genotype Zygotic genes: phenotype of individual is determined by the individual's genotype. 1) segmentation genes (determine features of the fly) 2) homeotic genes (determine structures - wings, legs ect.)
272
What is early development in drosophila dependent on?
Maternal - Axis formation: Egg-polarity genes Zygotic - Segment identity: Gap genes Pair-rule genes Segment polarity genes
273
What are maternal effect genes?
Germ cell-less mutant = gcl - results in sterile adult flies The parental genotype does not affect the fertility of the parents - all parents are fertile The genotype of the progeny has no impact on progeny It is the maternal genotype that determines the fertility, if the mother is gcl/gcl then ALL PROGENY are sterile, irrespective of own genotype
274
What is anterior-posterior polarity determined in?
The oocyte The unfertilised oocyte is found in the egg chamber and is associated with Nurse cells and follicle cells. The nurse cells generate certain maternal transcripts that are then transported into unfertilised oocyte Has polarity before fertilised
275
What are two important maternal effect genes?
30 genes e.g. BICOID and NANOS BICOID and NANOS mRNA's are polarised before fertilisation. BICOID mRNA is located at anterior, NANOS mRNA at posterior pole.
276
What does a mutation in maternal effect genes do?
Result in large deletions of the body plan bicoid mutation - loss of head and thorax (anterior) nanos mutation - no abdomen sections
277
When do the bicoid and nanos mRNAs get translated?
After fertilisation Create concentration gradients of mRNA and proteins
278
What does the the ME proteins then do?
Regulate zygotic genes
279
What are homeotic genes?
Regulate the structures which develop on each segment of the adult fly Mutations can change positions e.g. segment 2 taking identity of segment 3 = two sets of wings (ultrabithorax) or antennae replacing legs (antennapedia)
280
How were homeotic genes identified?
Because in a mutation there is a transformation of organs in which the developmental fate of an organ is change of that of another - replacement not deletion
281
How are homeotic genes organised?
Into complexes - sequence correlates to order in which they are expressed on body axis 8 homeotic genes split into two groups on chromosome 3: - antennapedia complex - bithorax complex Collectively called Hom-C complex in drosophila Conserved but not universal
282
Are homeotic genes homeodomain transcription factors?
Yes Homeotic genes contain 180bp homeobox sequence Encodes 60aa homeodomain DNA (and RNA e.g. BICOID) biding domain Regulate genes that specify tissue/organ primordia Not all homeobox containing genes are homeotic genes Not all homeotic genes contain a homeobox
283
What are hox genes?
Mouse and human hox genes Homologous to drosophila Hom-C Share similar order Expression is anterior-posterior 4 hox clusters in mice and animals - A, B, C, D
284
What causes Syndactyly?
Two-fused together digits Autosomal dominant of Hox D13
285
Are most homeotic mutations dominant or recessive?
Recessive
286
Are homeotic mutations found in plants?
Yes! apetala1 - lacks sepals and petals pisillata - has only sepals and petals agamous - consists of sepals and petals
287