week 10 - genetic analysis Flashcards
Genetic Variation Across Species
chatgpt
Genetic variation exists in all species and includes SNPs, indels, CNVs, structural variants, and novel mutations. These variations can be common or rare, and their frequency and effects can vary by species and population.
Plants: often show high structural variation (e.g. polyploidy, TE activity)
Animals: domesticated species have artificial selection-driven variation
Humans: millions of SNPs, rare variants, and de novo mutations per individual
Understanding Complex Traits
chatgpt
Complex traits:
Controlled by many genes (polygenic)
Influenced by environment and gene-environment interactions
Show continuous variation (e.g. height, yield)
Often have non-Mendelian inheritance and no clear genotype-phenotype map
QTL Mapping Approaches
linkage analysis
GWAS
pedigree analysis
QTL Mapping Approaches
linkage analysis
Population Type: Controlled crosses or pedigrees
Resolution: Low (Mb–cM)
Pros: Powerful for rare variants, family-based
Cons: Low resolution, limited by recombination
QTL Mapping Approaches
GWAS
Population Type: Natural populations
Resolution: High (kb–100 kb)
Pros: High resolution, no prior knowledge needed
Cons: Sensitive to confounding, missing heritability
QTL Mapping Approaches
Pedigree Analysis
Population Type: Human/animal pedigrees
Resolution: Medium
Pros: Tracks inheritance of traits
Cons: Relies on known family structure
Design and Challenges of GWAS
chatgpt
GWAS design principles include using large, diverse, well-matched populations; high-density SNP genotyping; and statistical thresholds to detect associations.
Challenges:
Missing heritability
Linkage disequilibrium confounds
Rare variant detection
Causal variant identification
Population stratification
From QTL to Causal Variant
chatgpt
Moving from QTL to causal variants involves:
Fine-mapping within associated regions (e.g., higher resolution studies)
Functional validation via expression data, reporter assays, or gene editing
Integration with other data (e.g., eQTLs, chromatin state, transcriptomics) It’s difficult because QTL peaks often span many genes, and some signals lie in non-coding or regulatory regions far from known genes.
Genetic variation and polymorphism
- Variation: the existence if two or more forms (alleles) of a section of DNA
- If a variation occur with frequency >0.5% then it is a polymorphism
- Genetic variation could lead to observable effects, but the majority do not
Genetic variation and polymorphism
example
- Example of an Arabidopsis dwarf mutant that has a SNP which converts a normal looking plant – like this one – to a dwarf plant.
o This is a simple case where there is a 1:1 mapping between genotype and phenotype.
o The dwarf phenotype is caused by mutation in a single gene
Classes of genetic variants:
Single Nucleotide Variant (SNV/SNP)
Change of one base (A→G, T→C); most common variant; may be silent or impactful
Classes of genetic variants:
- Insertion-deletion variant
o INDEls occur where one or more bases are present in some genome and absent in others.
o Generally only a few bases long but can be up to 80kb in length!
Addition or loss of one or more bases; can shift reading frames in coding regions
Classes of genetic variants:
- Block substitution
o a string of adjacent nucleotides varies between individuals
Classes of genetic variants:
- Inversion variant
o the order of the bases is reversed in a defined section of the genome.
A segment of DNA is reversed within the chromosome
Classes of genetic variants:
- Copy number variant
Large DNA segments are duplicated or deleted; affect gene dosage
o identical or nearly identical sequences are repeated in some genomes and not others.
Genetic variation and polymorphism
frequency?
- Human genetic variations are referred to as either COMMON or RARE, to denote the frequency of the minor allele – the less frequent allele in the population
o Rare variants are population-specific
Genetic variation and polymorphism
common variant
- Common variants have minor allele frequency (MAF)
o >1%
o E.g. a C/T SNP with 5% frequency of the T allele
Genetic variation and polymorphism
rare variant
- Rare variants have minor allele frequency
o <0.5%
Genetic variation and polymorphism
- Novel/de novo variant
- Novel/de novo variants occur only in a single family/individual
o E.g. a variant that we do not share with our parent
Genetic variation and polymorphism:
Single nucleotide polymorphism (SNP)
- Single base pair substitutions
- Arise through mistakes in DNA replication or caused by mutagens
o E.g. mutation rate in Arabidopsis is 7x10^-9 base substitutions per site per generation - Biallelic – 2 alleles (in diploids)
Genetic variation and polymorphism:
Single nucleotide polymorphism (SNP):
frequency
- Minor allele frequency can range from <1% to 50%
Genetic variation and polymorphism:
Single nucleotide polymorphism (SNP):
methods for detecting
- Many methods for detecting SNPs
o SNP microarrays
Genetic variation and polymorphism:
Single nucleotide polymorphism (SNP):
common?
- SNPs are the most common
o Which is why they are used a lot
Deletions, duplications and insertions
- Expand or contract the length of non-repetitive DNA
- Small deletions and duplications arise by unequal crossing over
- Small insertions can arise through the activity of transposable elements