linkage analysis Flashcards

1
Q

Define genetic variations.

A
  • differences in the DNA sequence between individuals in a population.
  • variation can be inherited or due to environmental factors.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What 4 different effects of genetic variation can have?

A
  1. alteration of the amino acid sequence.
  2. changes in gene regulation (gene expression)
  3. physical appearance of an individual (eg. eye colour, genetic disease risk)
  4. silent or no apparent effect.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is genetic variation important?

A
  1. genetic variation underlies phenotypic differences among different individuals.
  2. genetic variation determines our predisposition to complex diseases and responses to drugs and environmental factors.
  3. genetic variation reveals clues of ancesteral human migration history.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are mechanisms of genetic variation?

A
  1. mutation/polymorphism
    - > germ line mutations : passed on to descents
    - > somatic mutations : not transmitted to descendants
    - > de novo mutations : new mutations not inherited by either parent.
  2. homologous recombination: shuffling of chromosomal segments between partner (homologous) chromosomes of a pair.
  3. gene flow: movement of genes from one population to another (eg. migration)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

compare and contrast rare variant (mutation) and common variant (polymorphism).

A
  • mutation is a rare change in DNA sequence that is different to the ref sequence (normal).
  • polymorphism is a DNA sequence variant that is common in the population. In this case no single allele is regarded as ‘normal’ allele, there are 2 or more equally acceptable alternatives.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you determine if variant is mutation or polymorphism?

A
  • the arbitrary cut -off point between a mutation and a polymorphism is a minor allele frequency (MAF) of 1%
  • for a variant to be considered polymorphism the least common (minor) allele must be present in >/= 1% of the population.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is meiosis and recombination?

A

meiosis = creation of haploid gamete from sperm and egg.
=> Genetic recombination (stage I)
- homologous (maternal and paternal) chromosomes line up at the centre of nucleus and crossing over of arms of the chromosomes occurs, exchange in genetic material = introducing variation.
=> RANDOM recombination.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is crossing over?

A
  • reciprocal breaking and re-joining of the homologous chromosomes during meiosis.
  • results in exchange of chromosome segments and new allele combinations.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define a genotype.

A

genetic make up of an individual.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define phenotype.

A

physical expression of the genetic makeup.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define alleles.

A
  • alternative versions called alleles.

- an organism inherits 2 alleles one from each parents the alleles can be same ( homozygous) or different (heterozygous)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define haplotype.

A

a group of alleles that are inherited together from a single parent (haplo = single)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is homozygosity and heterozygosity?

A
homo = chromosomes with the same allele of gene on maternal and paternal chromosome at the same loci 
hetro= different allele of gene on maternal and paternal chromosome.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When can linkage analysis be applied?

A
  • mendelian/monogenic : disease caused by a single gene , with little or no impact from the environment (eg. PKD)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When can linkage analysis not be applied?

A
  • non-mendelian/polygenic disease or traits caused by the impact of many different genes, each having only a small individual impact on the final condition (e.g. psoriasis)
  • multifactorial : diseases resulting from interaction between multiple genes and often multiple environmental factors (eg. heart disease)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is linkage analysis?

A

method used to map the location of a disease gene in the genome.
‘linkage’ refers to the assumption of two things being physically linked to each other.

17
Q

What do we use genetic markers for?

A
  • to identify genetic ‘markers’ to identify the location of a disease gene based on physical proximity.
18
Q

What is the importance of genetic mapping?

A

like we know which way is north and south on a map we can work out what genes are proximal and distal in a chromosome by using genetic mapping to establish the locations of genes on the chromosomes.

19
Q

What is the importance of genetic mapping?

A

like we know which way is north and south on a map we can work out what genes are proximal and distal in a chromosome by using genetic mapping to establish the locations of genes on the chromosomes.

=> uses an observed locus (genetic marker) to draw inferences about an unobserved locus (disease gene)

20
Q

What are 2 types of maps?

A
  1. genetic maps = look at information in blocks like zones in a tube map
    => pre - 2001 , =>centimorgans = 1% chance of recombination.
    => because we had no genome sequence to refer to these types of maps used recombinant events to determine distance along the chromosome : further away = more likely crossing over.
  2. physical maps = look at information on the physical distances between landmarks (e.g. stations on a tube map)
    => post 2001
    => 1 megabase = 1 million base-pairs
21
Q

What are principles of genetic linkage?

A
  • genetic linkage is the tendency for alleles at neighbouring loci to segregate together at meiosis
  • cross -overs are more likely to occur between loci separated by some distance rather than between loci close together on the chromosome
  • therefore to be linked, two loci must lie very closely together.
22
Q

How does distance from gene marker and recombination link?

A

scenario 1: disease gene is a long distance away from a genetic marker => independent assortment => high likelihood of recombination

scenario 2: disease gene is close to gene marker => non-independent assortment => large proportion of non-recombinants expected (ie, greater likelihood of co-segregation of marker with the gene)

23
Q

What are some methods of genetic linkage?

A
  • genotype multiple genetic markers across the genome
  • genotype multiple family members from families with the genetic trait
  • identify which genetic markers co-segregate with the disease (phenotype) (ie which haplotypes are the same in all affected family members)
  • these genetic markers are therefore ‘linked’ to the disease gene is likely to be located.
24
Q

What micro-satellite markers?

A
  • less common now. Highly polymorphic short tandem repeats of 2 to 6 bp
  • micro-satellites may differ may differ in length between chromosome (heterozygous)
  • are relatively widely spaced apart.
=> 400 (200) microsatellite markers
=> average spacing 9cM (20cM)
=>PCR - based system
=> fluorescently -labelled primers
=>manual assignment of genotypes 
=>labour intensive
=>whole genome scan > 2-3 months
25
Q

What single nucleotide polymorphism?

A
  • now the genetic marker of choice. biallelic (a SNP will be one of two possible bases)
  • lower heterozygosity than micro satellites, but spaced much closer together
  • more informative
=>6,000 SNPs 
=> spaced throughout the genome 
=> micro-array based automatically 
=> highly automated 
=> data returned within <1-2 months
26
Q

What is microsatellite marker genotyping typically used for?

A
  • DNA fingerprinting from very small amounts of material
  • standard test uses 13 core loci making the likelihood of chance match 1 in three trillion
  • paternity testing
  • linkage analysis for disease gene identification.
27
Q

What is the mechanism of fluorescent marker genotyping?

A
  • fluorescently - tagged PCR primers
  • Allows for multiplexing of PCR products with different colours and fragment lengths
  • fragment sizes separated down to 1bp resolution
28
Q

What is SNP genotyping microarrays?

A
  • provides genome - wide coverage of SNP markers
  • SNPs are proxy markers ; NOT the casual disease variants
  • Can amplify thousands of markers in a single experiment
  • alleles are identified by relative fluroescence
    => homozygous for allele 1 = green signal
    => homozygous for allele 2 = red signal
    => heterozygous (1/2) = yellow signa
29
Q

what is SNP genotyping microarrays typically used for?

A
  • linkage analysis in families (affected vs unaffected relatives)
    => homozygosity mapping (autosomal recessive) and mapping of Mandelian traits
  • GWAS in populations (unrelated cases VS matched controls)
    => non- Mendelian disorders and multifactorial traits
30
Q

outline how linkage mapping using genetic markers is done?

A
  • uses an observed locus(genetic marker) to draw inferences about an unobserved locus (disease gene)
  • if a marker is linked to a disease locus, the same marker alleles will be inherited by two affected relatives more often than expected by chance.
  • if the marker and the disease locus are unlinked, the affected relatives in a family are less likely to inherit the same marker alleles.
31
Q

how do you build a haplotype trees?

A
  • determine which allele is inherited from which parent
  • paternal haplotype drawn on the left
  • maternal haplotype drawn on the right
32
Q

how do we use statistical analysis of linkage to identify disease genes.

A

=> linkage software
=> probability of linkage can be asses using LOD score
=> assesses the probability of obtaining test data if the two loci are linked, to the likelihood of observing same data purely by chance
=> i.e calculates a likelihood ratio of observed vs expected
=> higher the LOD score, the higher the likelihood of linkage.

33
Q

What do the LOD tell?

A
  • a LOD score >= 3is considered evidence for linkage
    => equivalent to odds of 1000: 1 observed linkage happened by chance
    =>translates to a p value of 0.05
  • a LOD score of <= -2 is considered evidence against linkage.
34
Q

What is an example of an autosomal dominant disease gene and what are associated features?

A

Adams-Oliver syndrome
=> terminal transverse limb defects (TTLD)
=> scalp aplasia cutis congenita (ACC)

associated features:

  • neurological anomalies
  • cardiac malformations
  • vascular defects (eg. cutis marmorata telangiectatica congenita, dilated veins)
35
Q

What are stages of variant validation by exome sequencing?

A
  1. variant profile
    - affected individual (SNVs, CVNs, short indels)
2.Filtering strategy 
 zygosity 
exclude known/common 
variants 
- dbSNP
-1000 genomes
- control variant profiles

Linkage

  • prior linkage scans
  • variant profiles of relatives

Predicted functionality
nonsense/missense/splice/silent

prior biological knowledge:
- candidate genes and pathway

  1. filtering parameters
    zygosity = heterozygous
    excludes known/ common variants = alleles frequency < 0.01 in control population
    linkage = chr 3q13 peak (115-128Mb)
    predicted functionality = nonsense > splice > missense
    biological knowledge = candidate genes and pathways (disease causing variants in ARHGAP31 were identified through a candidate gene analysis of genes in the minimal linkage interval
  2. causative variant