Week 4.7.8: Genetic trait associations Flashcards

Question

**Different human populations have different patterns of linkage disequilibrium –** and this partly depends on there history and so the longer it is back to a common ancestor the more linkage disequilibrium you will find in a population

Answer 1

**Here is part of chromosome 7** around a gene that is involved in metabolic risk complication of obesity genes (MRC-OB) project cohort from Northen europe Each line is a SNP marker in that chromosome – the chart shows how associated the SNPs are – if it is RED it means the two SNPS are highly associated – (they are in linkage disequilibrium with each other) – whereas if it is in white that means there is NO linkage disequilibrium (thus in equilibrium) Imagine diagonal lines going up from each SNP, we can see that the big block is often found as one block – recombination doesn’t happen often within that block and if we look within that block it seems that recombination hardly every happens within that block and s what ever SNP apply is there if there are two variable SNPs they will always vary the same way – Within the block you only need to know what allele is present in one of these particular bases to be able to know what is present in all the blocks – given your knowledge of the variation in the population – because they are all closely linked and we call those haplotypes – a little block where all the variation is inherited together is known as a haplotype block – we can see that along chromosome 7 there are a few haplotype blocks We can infer the identify of ALL SNP alleles given knowledge given one of them, that is a process called imputation If you know one allele and you use that to infer what alleles are present at other loci that is called imputation

Answer 2

Collection based on people in Utah from people with North European heritage. To some extent the linkage plots are smaller in the MRCOB cohort smaller than is often found Tends to be broken up into 4 sub block in the Utah population Even within European population you can see slight differences that is even more the case when we look at the rest of Africa Bone-mass you can see in Europe two big blocks that are found in linakge disequilibrium where as in Africa they are smaller – Higher linage disequilibrim in Europe than in Africa

Answer 3

associated 3.2 billion alleles We are trying to identify one SNP per block at least, then try to associate different sections of the genome with different traits – each column is a different case – half are cases (trait) – half controls (no trait) Looking at 8 blocks – we want to know to what extent these different loci are present in these cases – the number 4 is always blank squares but controls are diamonds filled Whereas the one at the bottom is pretty much the same – one allele very slightly different but probably not the particular phenotype we are looing at GWAS looks at thousands of loci scattered across the genome, at least one per halpotyde block, and asks is there a particular type of allele associated

Answer 4

Common genetic variants on 5p14.1 associate with autism spectrum disorders, A lot of maths has gone into Manhattan plot to get the values, chromosomes 5, shows higher probability of being associated with autism – so we zoom in on chromosome 5

Answer 5

**GWAS significance** Null hypothesis · There is no difference between cases and controls · There is no relationship between a genetic variable and a quantitative trait Statistical significance: P-value Y axis is showing significance, not strength of effect Threshold must be set high due to multiple hypothesis testing Loci that just cross the significance threshold may have a stronger effect than loci that cross it comfortably, its not the highest ones it’s the ones with strongest effect Strength of effect in GWAS is normally given with an **odds ratio**

Answer 6

**Odds Ratio is calculated once we know a trait is significantly associated with a locus** **Odds** The **odds** is the **ratio** of the **probability** that the event of interest **occurs** to the **probability** that it **does** **not**. **The odds that a single throw of a die will produce a six are 1 to 5, or 1/5 = 0.2** The probability of a 6 is 1/6 = 0.166666667 The probability of a not 6 is 5/6 **(1/6)/(5/6)=0.2** **Odds ratio (OR) A ratio of two ratios** **OR= Odds** of having the trait given you have the trait associated allele / of having the trait given you have the trait associated allele

Answer 7

The odds of getting a disease given you have a T allele are 1 in 3 The odds of getting the disease if you have an A allele is 1 in 9 **Odds ratio is (1/3)/(1/9) = 3 (three times as likely to get the disease, relatively speaking)** **Odds Ratio (OR)** The odds ratio is an indicator of the strength of the relationship between a genetic variant and a trait OR = 1 It doesn’t make a difference which allele you have OR \> 1 You are more likely to have the trait if you have the allele OR \< 1 You are less likely to have the trait, but statistic is not directly interpretable

Answer 8

Diease A is highly heritable, B equally, C not so much D hardly at all Black box is the environmental content, that explains the environmental and the SNPs explain the genetic influence but the ? is unknown it is “missing heritability” It is found within almost every genus – many reasons why that can be found

Answer 9

Epistatic interactions among loci, if you change just one locus it can have effects on everything else Small effect variants that is hard to detect Rare variants Gene by environment (GxE) interactions Heritability was over-estimated in the first place

Answer 10

GWAS works best if common diseases are due to common variants It can’t pick up cases where multiple recent mutations give rise to the same disease phenotype (mutation-selection hypothesis). This is likely to happen because any mutation that gives rise to disease is likely to be selected against and so natural selection should weed it out of populations It should be caught by rare variants, but should be weeded out by natural selection – most disease are rare deleterious mutation it will be hard to find if each one is giving rise to the same phenotype

Answer 11

SNP rs7612463 is associated with Type 2 diabetes in East Asian populations, it does not have this association in Caucasian populations. Could be because there is a different linkage plot – it could be that there are other genes also involved in type 2 diabetes that also differ and those mean that the locus near SNP 761243 don’t have the same effect

Answer 12

You discover you carry an allele with a significant association with a disease and a high odds ratio What is your risk of getting that disease? No generally accepted way of calculating this – pretty ad hock

Answer 13

**Assume Hardy-Weinberg equilibrium to calculate population genotype frequencies** Ø CC à 19% x 19% = 3.6% Ø CG à 2 x 19% x 81% = 30.8% Ø GG à 81% x 81% = 65.6% Relative risk of ARMD for whole population = 2.43 (odds ratio) x 0.036 + 1.53 x 0.308 + 1 x 0.656 = **1.22** **Relative risk of ARMD for a CC individual =2.43/1.22 = 1.99** If average population incidence of ARMD is 8% **Overall risk of ARMD for a CC individual** **=0.08 x 1.99 = 16%**

Answer 14

**A ratio of two probabilities** Needs accurate measures of population frequencies of genotypes in affected and unaffected samples These are often not available even when a GWAS has been done You can often learn more about your probability of getting a disease by looking at the prevalence of the disease in your population than you can learn by looking at your genotype at disease-associated loci **Multilocus risk estimation** For a polygenic trait, we cannot assess our risk just from one locus We need to combine information from many markers **Multilocus risk estimation** Need to be sure each locus had been associated with exactly the same trait Need to be sure each locus’ association was determined rigorously Need to check loci are not linked in a haplotype block Need single OR for each locus even if different GWAS studies have given different ORs

Answer 15

**Mapping Crohn’s disease** Linkage analysis with parametric LOD score method LOD: Logarithm of Odds the likelihood of obtaining the test data if two loci/markers/traits are linked, compared to the likelihood of observing the same data purely by chance

Week 4.7.8: Genetic trait associations Flashcards

(40 cards)