Lecture 3: Linkage Analysis Flashcards

(53 cards)

1
Q

Why do we care about variations?

A
  1. Allow tracking ancestral human history
  2. Underlie phenotypic differences
  3. Cause inherited diseases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

If you are interested in STUDYING A HUMAN DISEASE, HOW do
you FIND out WHICH GENE, when MUTATED, CAUSES THAT DISEASE?

A

You can find that POSITION by GENETIC MAPPING.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the EVIDENCE that a DISEASE OR TRAIT is GENETIC?

A
  1. TWIN STUDIES
  2. FAMILY SEGREGATION
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the EVIDENCE that a DISEASE OR TRAIT is GENETIC?

twin studies…

MS EXAMPLE

A

TWIN STUDIES:
COMPARE MONOZYGOTIC AND DIZYGOTIC TWINS
—- COMPARE THE CONCORDANCE RATES OF MZ AND DZ TWINS.

  1. Monozygotic Twins : Genetically identical
  2. Dizygotic Twins : Like siblings (1/2 GENOME shared)

***MS Example:
- Monozygotic Twins : Concordance rate 25-30%

  • Dizygotic Twins : Concordance rate 2-5%
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the EVIDENCE that a DISEASE OR TRAIT is GENETIC?

family segregation …

MS EXAMPLE

A

Family segregation: INCREASED RISK for DISEASE AMONG FAMILY MEMBERS OF AN AFFECTED INDIVIDUAL.

HOW?
COMPARE FREQUENCY OF DISEASE AMONG FIRST DEGREE RELATIVES OF AFFECTED INDIVIDUALS WITH THE FREQUENCY OF THE DISEASE IN THE GENERAL POPULATION.

MS Example:
- Risk to 1st-degree relatives (Parents, siblings, children): 2-5%
- General Population: 0.1-0.2%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the goal of a genetic study?

A

IDENTIFY GENETIC RISK FACTORS (genetic locations) causing human diseases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Types of Genetic (INHERTIABLE) Disease:

A
  1. SINGLE GENE (MENDELIAN) DISORDERS
  2. COMPLEX DISEASES/COMPLEX GENETIC DISORDERS
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain single gene mendelian disorders: 3

A
  1. rare
  2. gene segregation within pedigrees (families)
  3. obvious they are genetic
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

explain complex disease/complex genetic disorders: 4

A
  1. may be NO RECOGNISABLE PATTERN OF INHERITANCE
  2. likely due to ACTION OF MULTIPLE GENES
  3. GENES may be INTERACTING WITH EACH OTHER TO RESULT IN DISEASE PHENOTYPES
  4. ENVIRONMENTAL FACTORS/GENE-ENVIRONMENTAL INTERACTIONS may affect disease
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a complex disease?

A

complex disease: phenotypes, symptoms, endotypes
- disease features and nomenclature may change overtime

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some factors which induce or affect complex diseases: 5

A
  1. Environmental drivers
  2. Epigenetics
  3. Social ‘determinants of health’
  4. Genetics
  5. other HOST factors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explain/list Environmental Drivers: 6

A
  1. chemicals
  2. smoking
  3. sun exposure
  4. infection/infectious diseases
  5. diet
  6. alcohol
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

List/explain Genetics: 4

A
  1. molecular mechanisms of disease
  2. susceptibility (risk) or modifier gene/variants
  3. genomic and somatic variants
  4. SNPs, CNV, INdels, mode of inheritance, monogenic or polygenic
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Host factors : 9

A
  1. time
  2. age
  3. microbiome
  4. obesity
  5. nutrition
  6. inflammation
  7. immune system
  8. stage of development at exposure
  9. immune response
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Epigenetics: 6

A
  1. methylation
  2. miRNA
  3. retroviruses
  4. regulatory events
  5. gene rearrangements
  6. gene expression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

social determinants of health: 5

A
  1. social setting
  2. communication barriers
  3. access to healthcare
  4. cultural and or economic status
  5. adverse childhood experiences
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Explain the MONOGENIC DISORDER PATHWAY (SLIDE 10): 4

A
  1. Gene; MUTATION present
  2. Inheritance pattern (in family members) – DOMINANT OR RECESSIVE
  3. IMPACT of MUTATIONS in a SINGLE GENE on a DISEASE PHENOTYPE
    ( genetic risk in population… in percentage)
  4. Genetic risk in different families (in percent)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Explain the COMPLEX DISORDER PATHWAY (SLIDE 10): 4

A
  1. GENE VARIATIONS
    eg. gene A, gene B, gene C, gene D.
  2. Inheritance pattern is COMPLEX
  3. IMPACT OF VARIATIONS IN DIFFERENT GENES ON DISEASE PHENOTYPE
    - Genetic risk in population (epidemiological evidence..in percent)
  4. Affecting families (in percent).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are the 3 GENE IDENTIFICATION APPROACHES

A
  1. POSITIONAL CLONING
  2. CANDIDATE GENE APPROACH
  3. WHOLE GENOME SCREENING
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

GENE IDENTIFCATION APPROACH: POSITIONAL CLONING

WHAT?
EXAMPLE?
HOW?

A
  1. POSITIONAL CLONING (REVERSE GENETICS)
  2. DISEASE —> (not) FUNCTION <— GENE <— MAP

EXAMPLE: Cystic Fibrosis (CFTR)

HOW: identification of a gene BASED SOLELY on its POSITION IN THE GENOME

MOST WIDESPREAD STRATEGY IN HUMAN GENETICS IN THE LAST 20-35 YEARS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

STRENGTHS AND WEAKNESSES OF POSITIONAL CLONING: 4

A

STRENGTH:
1. No knowledge of gene product required

  1. very strong track record in single-gene disorders

WEAKNESSES
3. Understanding of FUNCTION not a CERTAIN OUTCOME

  1. POOR TRACK RECORD WITH MULTIFACTORIAL TRAITS
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Gene Identification Approaches

WHAT IS “CANDIDATE GENE APPROACH”: 3

A
  1. CANDIDATE GENES are genes LOCATED in a CHROMOSOME REGION SUSPECTED OF BEING INVOLVED IN THE EXPRESSION OF DISEASE TRAITS.

2.Limit to the known biological functions of a particular disease.

  1. Can be IDENTIFIED BY ASSOCIATION AAND LINKAGE WITH PHENOTYPES
23
Q

Gene Identification Approaches

WHAT IS “WHOLE GENOME SCREEN APPROACH”: 4

LINKAGE VS ASSOCIATION

A
  1. SCAN the WHOLE GENOME WITHOUT ANY PRIOR INFORMATION
  2. CAN DISCOVER POTENTIAL GENES PLAYING ROLES IN DISEASES
  3. LINKAGE: 6 or 10k SNPs enough
  4. ASSOCIATION: 500K OR 1 MILLION SNPSs are recently available
24
Q

SUMMARY OF GENE IDENTIFICATION APPROACHES

FOR “POSITIONAL CLONING”
- STARTING POINT
- KEY METHOD
- USE CASE

A

1.STARTING POINT:
genetic marker/location

  1. KEY METHOD:
    linkage analysis, chromosomal mapping
  2. USE CASE:
    identifying genes linked to specific chromosomal regions
25
SUMMARY OF GENE IDENTIFICATION APPROACHES FOR "CANDIDATE GENE APPROACH" - STARTING POINT - KEY METHOD - USE CASE
1. STARTING POINT suspected gene based on function 2. KEY METHOD genetic association studies 3. USE CASE investigation specific genes based on prior knowledge
26
SUMMARY OF GENE IDENTIFICATION APPROACHES FOR "WHOLE GENOME SCREENING" - STARTING POINT - KEY METHOD - USE CASE
1. STARTING POINT Entire genome 2. KEY METHOD GWAS, whole genome sequencing 3. USE CASE unbiased discovery of genetic factors for complex traits
27
Gene Mapping technique: 3
Disease genotype <---> Disease phenotype = BIOLOGY Disease genotype <---> Marker genotype = PROXIMITY Marker Genotype <---> Marker Genotype = GENE MAPPING; LINKAGE/ASSOCIATION ANALYSIS
28
GENE MAPPING TECHNIQUE PRINCIPLE:
"People who have similar phenotypic values (ie DISEASE) should have higher chance of SHARING OF GENETIC MATERIAL near the genes that influence those traits."
29
How to identify genes contributing to disease? = 2
1. LINKAGE MAPPING 2. LINKAGE DISEQUILIBRIUM (ASSOCIATION STUDY)
30
Explain LINKAGE MAPPING: 5
1. MEASURES THE SEGREGATION OF ALLELES AND A PHENOTYPE WITHIN A FAMILY 2. USE CROSSOVER OCCURRING DURING MEIOSIS II ---> 3. Genes that are PHYSICALLY CLOSE TOGETHER ARE MORE LIKELY TO BE CO-INHERITED ---> 4. * Genes that are PHYSICALLY FAR APART ARE LESS LIKELY TO BE CO-INHERITED 5. DETECT OVER BOARD CHROMOSOMAL REGIONS ON THE GENOME
31
EXPLAIN Linkage disequilibrium (Association Study): 2
1. EVALUATE the EVIDENCE of a DIRECT CORRELATION BETWEEN A MARKER ALLELE and a DISEASE RISK ALLELE 2. SHARING of GENETIC MATERIAL: ACTUAL SHARING OF THE SAME ALLELE (LINKAGE DISEQUILIBRIUM = LD)
32
What is a GENETIC MARKER? TYPES?
A genetic marker is a POLYMORPHIC DNA SEQUENCE with a KNOWN LOCATION ON A CHROMOSOME THAT CAN BE USED TO IDENTIFY INDIVIDUALS that can be used to identify individuals TYPES: 1. SNP 2. SSR (Short sequence repeat/micro satellite)
33
SNP: No of Loci: Advantage: Disadvantage:
1. NO. OF LOCI: High, occur ~1 in every 100-300 bp 2. ADVANTAGE: ...1. Abundant in the genome ...2. Provide high-resolution mapping ...3. Low mutation rate (stable) ...4. Many technologies available 3. DISADVANTAGE Often Bi-allelic (two possible alleles) which LIMITS INFORMATION FROM ONE LOCATION
34
SSR (short sequence repeat/micro satellite): No of Loci: Advantage: Disadvantage
1. NO. OF LOCI: High, occurs ~ every 2-30kb 2. ADVANTAGE: Multi-allelic: each SSR can have multiple alleles 3. DISADVANTAGE ...1. Lower abundance ...2. more labour-intensive ...3. Higher mutation rate
35
Meiosis and Recombination: 3
1. During MEIOSIS, the CHROMOSOMES DUPLICATE, then CROSS OVER (‘rRECOMBINE) to PRODUCE A HAPLOID GAMETE (sperm/ egg) 2. The gamete DERIVES GENETIC VARIANTS FROM BOTH PARENTS 3. MEIOSIS IS THE "BASIS" FOR "HEREDITY"
36
BIVALENTS IN PROPHASE OF MEIOSIS I:
SINGLE CROSSOVER ---> BIVALENTS IN PROPHASE OF MEIOSIS I: "A1 and A2 CROSSOVER" ---> CHROMATIDS IN GAMETE: NRRN DOUBLE CROSSOVER --->BIVALENTS IN PROPHASE OF MEIOSIS I: "A1 AND A2 CROSS OVER, B1 AND B2 CROSSOVER" ---> CHROMATIDS IN GAMETE: 1. 2 STRAND: NNNN 2. 3 STRAND: RNRN 3. FOUR STRAND: RRRR NEED TO LOOK AT SLIDE 18
37
Understanding Markers and Inheritance = 3
1. Polymorphic loci whose locations are known 2. Most often SNPs or micro-satellites 3. Inherited within the Chromosomes
38
For linkage analysis, we need informative meiosis
NEED NOT A WANT UNDERSTAND SLIDE 20 PEDIGREE
39
Markers and Inheritance
NEED NOT A WANT UNDERSTAND SLIDE 21 PEDIGREE 1 And 2
40
What is LINKAGE? = 4 "HOW DO WE MEASURE IT?"
1. Only ~1 recombination per chromosome/meiosis "→ Loci that are close together on the same chromosome tend to be inherited together (‘linked’ or ‘in LD’ = linkage disequilibrium)" 2. The closer the loci, the more linkage "→ Degree of linkage is a measure of genetic distance" 3. * Linkage is measured by the recombination fraction, θ = proportion of recombinants "θ = 0 complete linkage θ = 0.5: no linkage" 4. Two loci on the same chromosome, only a cross-over event will separate them.
41
What is recombinant fraction? = 5
1. θ is a measure of genetic distance 2. Further apart two loci, the more likely a crossover event will occur (θ value will increase) 3. Centimorgan (cM) is a genetic distance unit 4. 1 cM = 1% chance of recombination 5. 1 cM approx. 1000 kb = 1 m
42
Obtaining enough family material to test multiple meiosis is difficult for rare diseases
1. Higher RF (RECOMBINATION FRACTION) between two loci, MORE MEIOSIS needed to OBTAIN EVIDENCE that THEY ARE LINKED. 2. Scoring recombinants in human pedigrees not always simple
43
Linkage mapping: is a marker “linked” to the disease gene
1. Collect families with affected individuals 2. Genome Scan - Test markers evenly spaced across the entire genome (~every 10cM, ~400 markers) 3. Lod score (“log of the odds”)
44
WHAT IS LOD SCORE?
what are the odds of observing the family marker data if the marker is linked to the disease (less recombination than expected) compared to if the marker is not linked to the disease
45
Test to estimate whether the likelihood that TWO LOCI ARE LINKED is greater than likelihood that THE TWO LOCI BEING UNLINKED
Test to estimate whether the likelihood that TWO LOCI ARE LINKED is greater than likelihood that THE TWO LOCI BEING UNLINKED Z = LOG 10
46
Z = LOG10 FORMULA
Z = LOG10 * (LIKELIHOOD OF LINKAGE ( THETA <0.5)/ LIKLIHOOD LOCI ARE UNLINKED ( THETA = 0.5))
47
linkage mapping
1. parametric method 2. non-parametric method
48
parametric method
Estimate recombination fraction between a marker locus and an unobserved trait locus Out of 4 informative meioses, 2 are recombinants => 1/2 slide 26
49
NON PARAMETRIC METHOD
Count the number of alleles two affected sibs share identical by descent (IBD). "If the marker is linked to the disease locus, the affected sibs will tend to share the disease allele more often than they would at a marker unlinked to the disease locus."
50
Statistical significance of Lod Scores:
Z>3.0 EVIDENCE OF LINKAGE (with 5% chance of error) 2.0 EXCLUSION OF LINKAGE
51
Lod Scores: 4
1. Human families only produce small numbers of children. 2. To get statistically significant evidence for linkage, combine evidence from many families 3. A complex mathematical procedure, implemented by computer software, is used to generate "Lod scores". 4. Lod score is a statistic that describes the STRENGTH OF EVIDENCE for linkage, at any chosen value of the RF, given the family data available.
52
The task of linkage analysis is to:
find markers that are linked to the hypothetical disease locus
53
The task of linkage analysis is to find markers that are linked to the hypothetical disease locus HOW? = 7
1. Determine the approximate location of disease predisposing genes 2. Linkage is caused by loci (e.g. the risk gene and a genetic marker) being close to each other on a chromosome. 3. The recombination fraction is the probability that, in any meiosis, there will be a recombination between them between two markers. .......4. θ = 0.5 Mendelian segregation – no linkage, = recombination. .......5. θ = 0.0 very tight linkage – no recombination. 6. Great success in identifying genes for simple Mendelian diseases 7. Few successes in identifying genes contributing to complex diseases