HGA Studies Flashcards
(41 cards)
What are 4 categories of genetic variation?
- SNP - Single nucleotide polymorphism
- InDel - Insertion/Deletion
- SSR - Simple sequence repeat
- CNV/CNP - Copy number variation/polymorphisms
What are SSR’s?
Simple sequence repeats
- Microsatellites= small indels (micro – small; satellites = repeated sequence)
- Mutate faster than SNP’s – generally selectively neutral (Don’t change phenotype)
- Highly polymorphic because of their potential for faulty replication. Used in forensics, because is unique.
- Small base pairs, dinucleotides MAYBE trinucleotides.
- Means its unique
- SSR mutations can cause disease: Huntington’s disease (HD) caused by triplet expansion SSR in coding region.
o Normal Huntington <34 CAG repeats
o Diseased Huntington >42 CAG repeats
o Proteins folding which is related to function. So if it is much longer then the function will be affected
o Dominant autosomal disease.
What is an easy assay to find SSRs?
o Isolate buccal cells
o Amplify by defining region using 2 PCR primers on either end
o Electrophoresis to see length to see how many repeats there are (How long
What is a SNP?
Single Nucleotide Polymorphism
- Locations within the genome where nucleotide substitutions are present
- At least 1% of a population must have the alternative nucleotide variant to be considered a SNP
- Percentage of SNPs in a pop depends on the population (how old it is, how much migration has happened etc.)
- More SNP’s present in non-coding DNA’s & introns; often hit 3rd codon and don’t change function
- Give an idea of population structure and migration patterns because of the accumulation of mutations etc.
- Can be homozygous or heterozygous in an allele.
o Homo T would have +strand T on both chromosomes (we diploid) etc.
What are 2 molecular methods for SNP genotyping?
- RFLP where a SNP changes restriction enzyme site
a. Only looks at 1 SNP at a time, basically a waste of time - DNA microarrays where you can detect SNPs alleles at >1 000 000 loci
a. Make a tagged probe that is complementary to the sequence of interest.
b. One strand for each allelic version with different tags
c. Use that in the genome of the person, and 1 of the alleles will light up telling you which allelic version of the SNP the person has
d. Microarray is a microscope slide that is printed with specific DNA.
What is a CNV?
Copy Number Variant
- More than 1 copy of a variant, usually very large. Larger chunk of sequence repeated.
- Dose effect is very important (That’s why females turn off an X chromosome) – like microsatellite but piece of DNA repeated is MUCH (Megabases) bigger
- Very long deletions or duplication in the genome are associated with 30% increased risk of psychiatric disorder
- Deletion in some specific genomic regions are directly associated with autism, schizophrenia or mental retardation
- > 99% of CNVs are inherited & not derived from new mutation
- Chromosomal microarray using probes to find CNV’s
o (and therefore intensity of color) is linked to how many copies there are - Is associated with phenotype, but not always bad:
What is Complex inheritance?
- Most human traits do NOT have simple single gene inheritance patterns
What is Incomplete Penetrance?
Diseased genotype can occur in people who do not express the disease phenotype
What is Genetic Heterogeneity?
Different disease genotypes are responsible for the same disease in different families
o i.e. 2 families can have the exact same disease & same disease phenotype, but can have different disease genotypes. i.e. different reasons for the disease
o Different mutations in the same (large) gene
o Or mutations in different genes (if multiple genes are responsible for the disease)
What is polygenic determination?
Mutant alleles at more than 1 locus influence expression of the disease in a single person.
What is complete penetrance?
The genotype ‘penetrates’ and is expressed in phenotype.
Complex Inheritance in Breast Cancer
- Autosomal dominant inheritance of 2 unlinked disease loci predispose women to breast cancer
- Runs in families but can also mutate de novo.
- Polygenetic determination is at play (at least 2 genes associated)
- Mutated BRCA1 & BRCA2 genes cause cancer of women reproductive organs
o However, those genes aren’t mutated in all women with breast cancer:
• Aren’t the only genes responsible
• BRCA1&2 are genes with incomplete penetrance
o Only 66% of women with a mutant BRCA1 allele will develop cancer - 1/10 chance of any woman developing breast cancer (Normal BRCA)
- 4/10 chance with mutated BRCA2
- 6/10 chance with mutated BRCA1
- Child bearing and breastfeeding reduces risk of breast cancer because of the break it provides from hormonal cycling
What are the functions of the BRCA genes?
- BRCA1 & BRCA2 are in same DNA repair pathway
o Repair double strand DNA breaks
o Promote homologous recombination - They do not share any protein structure
- Both are tumor suppressors
o Involved in DNA repair and check point regulation
o Loss of function promotes cancer - Repair pathways contain other gene
What are common symptoms of Schizophrenia?
- May have difficulty distinguishing between reality & imaginary world
- May be unresponsive or withdrawn; have difficulty expressing normal emotions in social situations
- Majority of people with schizophrenia are not violent, nor do they pose a danger to others
- Symptoms are NOT identical for each person
What are some causes of SZ?
- Schizophrenia is NOT caused by: childhood experiences, poor parenting or lack of willpower
- Has a large heritable component where genetic makeup plays an important role
o Studied by looking at twins because they have identical DNA
o Has heritability score of ±80%; so if you have it your kids most probably will - Neurotransmitters may play a role in phenotype
o Worked this out by giving patients medication/neuroblockers and observed improvement - There is strong genetic evidence for schizophrenia (although we not sure what it is)
o Known CNV’s are implicated as risk factors in 2-3% of cases - There is evidence for association between schizophrenia and CNV’s
o Schizophrenia associated CNV’s also associated with autism, mental retardation, ADHD & epilepsy
o Schizophrenia-associated CNVs span multiple genes
o Exact mechanisms of pathogenesis is limited but some hints are emerging:
• E.g. deletions of NRXN1 code for presynaptic neuronal cell adhesion molecule neurexin which implies abnormal synaptic function
What were the aims of the study by Kirov, et al?
- Compare case (SZ) and controls in order to:
- Identify novel CNVs that increase risk of SZ
- Compare novel (de novo) CNVs vs inherited CNVs
- Compare SZ-linked CNV’s with CNV’s in other disorders
- Understand pathophysiology of SZ (mechanisms)
Why are multiple cohorts important in a GWAS?
- It is NB to compare data in the context of other data (populations etc.)
- Using so many data sets strengthens reliability of results. P vals become smaller & more reliable with increased data sets
What cohorts were used in the Kirov, et al paper?
- Bulgaria: Bulgarian parent-proband trios
o Used to determine if the CNV’s identified were de novo or inherited.
o Proband = person or animal being studied
o All cases had been hospitalized & met DSM-IV guidelines - Icelandic control: 2623 complete parent-offspring trios without the disease were the control
o To ensure their findings weren’t isolated to their population in Bulgaria
o Government sequenced most of their population, so they are the gold standard control - Autism case and control:
o Data from recent large study based upon a high-density array used
o To see if there were CNV’s that were shared between SZ population & those with autism – as well as with Icelandic control & non-autistic siblings that were used in both cases - Case-control data SZ sets:
o Used 4 large publicly available data sets
o Cases had SZ controls did not
o Important to compare results in context of their data (unique to Bulgaria?) REPLICATION
Describe a workflow from obtaining CNV’s associated with SZ to biological pathways associated with CNV’s
- Microarray analysis of DNA extracted from blood samples of cases vs controls
- Quality Control: Statistical analysis to check there were no statistical differences in the running of each experiment
- Did stats on >600 CNV’s to identify CNV’s associated with SZ
o Also recorded level of education each case had - confirmed that all cases were SZ singe it isn’t associated with decreased mental ability - Mapped variants to genes – then performed a Gene Ontology analysis which defines gene functions and how these functions are related
- Ran enriched pathway analysis:
a. Looked at which genes were enriched in cases relative to the control (Ones that weren’t enriched in controls) - Find pathways associated with SZ
a. Finds the pathways of the genes from 5. which are enriched.
What is a systems biology approach?
- Clinical work
- Genomic microarrays
- Stats
- Bioinformatic work in gene enrichment sets
What were the findings of the Kirov et al paper?
- Rate of de novo CNVs in cases (5.1%) > than controls (2.2%) – suggests at least ½ all discovered CNVs are associated with SZ
- Found 34 de novo CNV’s
o Didn’t show that older fathers correlated to SZ in offspring - 8 (of 34) CNVs were at 4 known SZ loci and some were known to be pathogenic in other disorders
- Some CNVs were in genes which are components of the postsynaptic density
- 2 CNVs affected genes known to directly regulate DLG (family of membrane-associated guanylate kinases) family members.
o Implies some epigenetic implications
What is the difference between a GWAS and a linkage study?
o Linkage: • Pedigree based • Everyone’s relatedness is known • Small number of individuals • Less complex diseases
o GWAS:
• NOT pedigree based.
• Array-based study
• Very large numbers of unrelated people are examined (case vs controls groups).
What is an example of a simple GWAS pathway?
- Collect samples – cases & controls
- Microchip array
- Produces raw data
- Genotype calling to produce genotype data
- Identify chromosomal markers – map geographical loci etc.
- Interpretation
What is a haplotype?
A haplotype is a unique set of common genetic markers (In this case SNPs) which are on a single chromatid and are inherited together. They have a high statistical association