What is genetic association?
The presence of a variant allele at a higher frequency in unrelated subjects with a particular disease (cases), compared to those that do not have the disease (controls).
What is an allele?
One form of a variant in the genome
What is a locus?
A position in the genome
What is a genotype?
Both alleles at a locus e.g. locus 1: 1,4 and Locus 2: 1,1
What is a haplotype?
This is the order of alleles along a chromosome
Why are case-control studies used?
What is case-control association?
Cases: gene variant is associated with disease
versus controls
Describe how the case control study works
There are two groups:
Then measure the genetic loci of interest
Statistical analysis to determine which genetic loci correlate with disease
Identify genomic region associated with disease
What is needed in a case-control genetic study?
What is the ideal genetic marker?
What is a SNP?
How do SNPs arise?
Where are SNPs located?
In the Gene coding region:
In the Gene non-coding region:
In the intergenic region
What is the dbSNP?
It is an online database at NCBI of single nucleotide polymorphisms (SNPs) and multiple small-scale variations that include insertions/deletions, microsatellites, and non-polymorphic variants.
What is the minor allele?
It is the less common alllele. Each allele has a frequency in the general population and the minor allele has a MAF.
What does the Minor AF + Major AF add up to?
1
What is a genome wide association study (GWAS)?
Use markers across the whole genome
What do SNP microarrays do?
How is GWAS data presented?
It is presented as a single graph called a Manhattan plot.
What is the X-axis and Y-axis in a Manhattan plot?
- Y-axis is -log10 (p-value) on the chromosome
What is a Manhattan plot?
A simple way to visualise the markers across the genome associated with the disease.
What is the WTCCC?
It is the Wellcome Trust Case Control Consortium
What do the peaks indicate in manhatten plots?
Significant p-values of p <5x10-5
What are some misconceptions of the peaks in GWAS results?
- The peak identifies the genomic region associated with the disease