Disease Gene Mapping Flashcards

1
Q

What has made gene mapping in humans so difficult?

A

Inability to perform desired crosses and small number of progeny. This means research is restricted to analysing pedigrees with limited information.

Small offspring numbers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What gene is found to be linked to nail-patella syndrome?

A

The ABO blood type gene

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How can DNA markers be used?

A

They can help locate disease or other gene by linkage and physical location.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the advantages of DNA markers?

A

DNA markers are rapid (PCR and sequencing)

Microsatellite markers are multiallelic, SNVs are biallelic

No need to wait for phenotype expression.

Phenotypes are often hard to distinguish due to not being discrete charaters.

DNA markers allow distinction between homozygous and heterozygous.

DNA markers must be well spread out on all chromosomes to ensure that there are no gapsin chromosomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why is it important to know about variations?

A

Underlying phenotypic differences can be more well understood

They cause inherited diseases.

They allow tracking of ancestral human history

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the goal of a genetic study?

A

Identification of genetic risk factors causing human complex disease (Type II diabetes, hypertension, asthma)

Common complex disease and rare single gene (Mendelian) disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What evidence suggests that a disease or trait is genetic?

A

Twin studies (compare monozygotic to dizygotic twins)

Families segregation (Increased risk for disease among family members of an affected individual so frequency in first degree relatives is compared to frequency of disease in general population)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How are genes that are involved in complex disease phenotypes identified?

A

Functional cloning: Disease -> Function -> Gene -> Map

Positional cloning: Disease -> Map -> Gene -> Function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How are genes contributing to disease approached?

A

Candidates gene approach: Candidate genes are located in chromosome region suspected of being involved in the expression of disease traits. Can be identified by association and linkage with phenotypes.

Whole genome screen approach: Scanning whole genome without prior information, can discover potential genes playing roles for diseases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the advantages and disadvantages of using positional cloning to identify a gene?

A

Strengths:

No need to know the gene products required

Very strong track record in single-gene disorders.

Weaknesses:

Understanding of function not a certain outcome

Poor track record with multifactorial effects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can gene mapping be used to identify disease phenotypes?

A

Marker genes located near the disease genotype are identified allowing the identification of traits that are linked to the genotype of the disease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How are genes identified that contribute to disease?

A

Linkage mapping (Measures segregation of alleles and a phenotype within a family. Use crossover during meiosis II. (Genes that are physically close together are more likely to be inherited together)

Linkage disequilibrium (Evaluate evidence of correlation between marker allele and a disease risk allele)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What important gene was discovered by positional cloning?

A

1st gene responsible for human genetic disease discovered by positional cloning was cystic fibrosis. Geneticists attempted to identify the gene for CF and clones were isolated after linkage analysis found a region narrow enough.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How were clones from the cystic fibrosis region identified?

A

Via chromosome walking and jumping.

Analysis of DNA sequences within the clones revealed 4 candidate genes

Additional studies eliminated 3 of the candidate genes

DNA sequencing revealed presence of a 3-bp deletion in the gene of the CF patient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the recombination fraction?

A

The recombination fraction is the probability that in any meiosis there will be a recombination between them.

If θ=0.5 there is mendelian segregation and thus no linkage so recombination

If θ-0.0 there is very tight linkage and thus no recombination

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is required if there is a higher recombination fraction?

A

More meioses are needed to obtain evidence that 2 genes are linked. (eg. 0.40 requires 343 meioses minimum.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How is linkage mapping conducted?

A

Collect families with affected individuals

Genome scan - test markers evenly spaces across the entire genome.

Lod score (log of the odds) - What are the odds of observing the family marker data if the marker is linked to the disease (less recombination than expected) compared to if the marker is not linked to the disease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the problems with mapping in humans? How is this solved?

A

Can’t do controlled crosses

Crosses equivalent to tet-cross are extremely rare

Humans produce a very small number of progeny.

Solution - Combine results of many identical matings.(Lod score)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How is the Lod score calculated?

A

2 probabilities calculated:

  • First one assumes independent assortment
  • Second one assumes a specific degree of linkage

Ratio (odds) of probabilities is calculated

Log of ratio is the Lod value

Repeat calculation for range of different degrees of linkage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What Lod score indicates significant support for a specific RF value?

A

> 3

3 represents an RF value that is 10^3 times as likely as there being no linkage.

2

21
Q

What is multipoint linkage mapping?

A

Using 1000s of markers, genetic maps have been constructed across the whole genome.

Multipoint mapping uses several markers at once to localise a disease gene relative to the other markers in the map.

More efficient process than using 1 markers at a time.

22
Q

What have the established successes and failures of linkage mapping been thus far?

A

It has been widely used to identify chromosomal regions linked to diabetes, breast cancer, alzheimer’s disease, and bipolar disorder.

Not every finding that was deemed significantly linked has been replicated convincingly.

23
Q

What are the important study design categories for gene linkage mapping?

A

Linkage analysis analysis looks through families for co-segregation of disease and gene variants

Association studies detect association between genetic variants and disease across families: exploits linkage disequilibrium. (more appropriate for complex disease)

24
Q

What kind of studies are association studies?

A

Case-control studies

Cohort designs

Parents - affected child trios (TDT= Transmission disequilibrium testing)

25
Q

What do case-control studies do to identify markers for disease?

A

Collect affected and unaffected subjects. Then they compare frequency of genetic components between the 2 groups.

26
Q

What do TDTs do to identify disease?

A

Collect affected children and their parents.

Compare the distribution of the transmitted allele to the distribution of the non-transmitted allele from parents.

27
Q

What are the advantages and disadvantages to using population based case-control studies?

A

Advantages: Quite powerful to detect small genotypic effects, even in modest samples of case and controls.

Easy to collect the cases and controls for general population samples

Disadvantage: Population stratification. If there are underlying differences in the cases and controls that are unrelated to disease risk false positive results are more likely.

28
Q

What are the advantages and disadvantages of family-based association studies?

A

Advantage: Resistant to potential bias from population stratification

Disadvantage: Requires at least one parent to be heterozygous at the marker
being tested, therefore power of this approach is significantly lower

29
Q

What assumption is there when using association studies to identify disease causing mutations?

A

That the mutation has the same ancestral origin and over time it has gotten smaller until very short segments are conserved around the mutated locus.

30
Q

What is linkage disequilibrium?

A

Nonrandom association of alleles within a population.

Alleles at neighbouring loci tend to cosegregate.

31
Q

What are haplotypes?

A

polymorphisms within a single gene that form various combinations. (If there are 5 polymorphisms and alleles are equally frequent and independent then 2^5 kinds of haplotypes are produced

32
Q

Why are there less combinations of haplotypes produced than expected?

A

Because alleles are not independently assorted.

33
Q

What is linkage disequilibrium?

A

The occurrence in members of a population of combinations of linked genes in non-random proportions.

34
Q

How can SNP haplotypes be used for mapping?

A

A haplotype is a chromosomal region defined by a specific array of marker alleles it carries

Haplotypes are inherited through generations as blocks which eventually break up due to recombination.

35
Q

What are the issue with using association studies?

A

Spurious association is caused by 2 factors in population stratification:

A difference in proportion of individuals from 2 or more subpopulations in case and controls (confounding effect of differences between people of different subpopulations)

Subpopulations have different allele frequencies at the locus.

36
Q

How has the problem of population stratification been solved?

A

Using family based controls (eg. TDT - a type of family association study looking at proportion of children getting the disease from heterozygous parents)

Genetic control: Extra genotyping looking for evidence of background population substructure and account for it

37
Q

How are analyses different between linkage and association analyses?

A

Linkage analyses look for relationship between a marker and disease within a family

Association analyses look for relationship between a marker and disease between families

38
Q

How is localisation resolution different between linkage and association analysis?

A

Due to low number of generations involved in linkage analysis we can’t get to very specific variants. However, we can work with relatively few markers due to being in the same family.

Association analysis gives a more fine-scaled resolution due to marker genes being present in a huge number of people who are distantly related. However, due to how numerous the related markers are, they can be tedious (and expensive).

39
Q

What are genome wide association studies?

A

GWASs typically focus on associations between single-nucleotide polymorphisms (SNPs) and traits like major human diseases.

40
Q

What happens to shared segments with increasing number of generations between them and their common ancestor?

A

The more distant a common ancestor the smaller each shared segment is but number of people sharing the segment will be greater.

41
Q

How are genomewide association studies carried out?

A

Extract DNA - Genotype

Calculate which of 300 - 500k SNPs and/or haplotypes are more frequent in cases than controls.

Use whole genome chips

42
Q

What is a SNP chip?

A

Extension of microarray technology allowed automated genotyping of huge numbers of SNPs across the genome

43
Q

What assumption is made in Genome Wide Association studies?

A

Common complex diseases are caused by common variants (Now genomes can just be sequenced for this information)

44
Q

What are the pros and cons of using microsatellites as genetic markers?

A

Pros:

Highly multi-allelic/High hereozygosity/informative

Cons:

Much fewer in numbers and have complex genotyping assays

45
Q

What are the pros and cons of using SNPs as genetic markers?

A

Pros:

Very frequent

Simplified genotyping platforms

Cons:

Biallelic and not very informative.

46
Q

What do quantile-quantile plots tell us?

A

2 types of observed test statistics generated in GWAS

In case control studies a chi-squared comparison of absolute genotype counts is calculated for each variant

47
Q

Why haven’t GWS been as successful as had been hoped?

A

Proposed hypothesis:

Different combinations of variants at multiple loci aggregate in specific individuals to increase disease risk.

Common variants are expected to be of ancient origin, they are more susceptibility factors and so have typically weak deleterious effects (i.e mild missense mutations or changes in gene expression)

Even cumulative contributions of identified variants are small

48
Q

What is missing heritability and what is the explanation?

A

Small proportion of genetic variance in complex diseases is explained by available GWAS data this is missing heritability. This can be explained by:

Large numbers of common variants having weak effects (People chosen for studies having the condition are the people with the common variants)

Rare variants have large effect

Gene-gene and gene-environmental interactions.