Viva prep all Flashcards
(135 cards)
What is the equation for causal effects on a phenotype?
P = G x E, where P is phenotype, G is genetics, and E is environment.
Why can heritability estimates from twin studies be inflated upwards?
Due to the assumption of shared environment across monozygotic and dizygotic twins, which may not hold true in all cases.
What percentage of AD SNPs are found in non-coding regions?
98%
What is an example of an extreme distal enhancer?
limb enhancer region which regulates expression of sonic hedgehog (SHH) gene. This enhancer, the Zone of Polarizing Activity Regulatory Sequence (ZRS), is approximately 800,000 to 1,000,000 base pairs away (in mouse and human) and found within the non-coding, intron of a neighbouring gene but when eliminated, causes truncation of limbs in mice. Moreover, this regulation can be altered by single base pair changes in humans.
What percentage of the genome is protein coding?
1.5%
Which protein-coding genes are implicated in the pathogenesis of AD?
APOE, APP, PSEN1 and PSEN2. APOE produced predominantly by astrocytes and activated microglia in the brain - role in transporting lipids between cells and organs, are the greatest genetic risk factor for AD.
Why is RNA-Seq considered better than microarrays?
RNA-Seq offered a full view of the whole transcriptome as profiles RNA from the sequencing of complementary DNA (cDNA) , i.e. the whole repertoire of transcripts for the particular tissue of cells, including allele-specific expression and alternative splicing. Microarrays which relied on pre-defined transcripts or genes
What is the methodological approach of scRNA-Seq?
isolating cells using methods such as microfluidics, microwells, droplet‐based or in situ barcoding to co-encapsulate it with a unique DNA barcoded bead. This meant mRNA could be linked back to the cell of origin.
What are the steps in processing scRNA-Seq data?
scFLOW Nextflow pipeline steps -
1. emptydroplets (distinguish true nuclei from empty droplets and determine ambient RNA profile i.e. cell-free mRNA - Models the distribution of UMIs in empty droplets to establish a background distribution. Classifies droplets with significant deviation from the background as cell-containing.)
2. Nuclei filtering based on total read counts and total genes expressed (200 minimum for each) or with > 4 median absolute deviation (MAD) for either
3. Mitochondrial reads - 10% or greater -> indicative of cell death
4. DoubletFinder - doublets detection - gen random doublets from input data, lower dim space with real, use nearest neighbour alg to calculate doublet scores. Repreat iteratively
5. LIGER - Calculate integrative factors across samples
6. UMAP - Two-dimensional embeddings of the LIGER integrated factors were calculated using UMAP
7. Leiden community detection algorithm - detect clusters of cells from the 2D UMAP (LIGER) embeddings
8. Cell-typing of clusters - using EWCE, top five marker genes for each automatically annotated cell-type were determined using Monocle 3 and validated against canonical cell-type markers
How does LIGER work?
Uses integrative non-negative matrix factorization to identify factors shared among data sets. NMF formula V = WxH, from this you get lower representation.W -> Feature Matrix, H -> Coefficient Matrix (Weights associated with W)
How does the Leiden community detection algorithm work?
a hierarchical clustering algorithm, that recursively merges communities into single nodes by greedily optimizing the modularity and the process repeats in the condensed graph. It modifies theLouvainalgorithm to address some of its shortcomings, namely the case where some of the communities found by Louvain are not well-connected. This is achieved by periodically randomly breaking down communities into smaller well-connected ones.
What are pseudoreplication approaches?
ROTS, which takes cells as independent replicates.
What are pseudobulk approaches?
Pseudobulk + EdgeR LRT, aggregate a cell type’s reads to an individual often by sum or mean to help avoid issues with cell dropout and low sequencing depths.
What are mixed model approaches? Give example?
generalised mixed models (GLMs) account for both subject and cell-level information often using a random effect for samples to account for the group’s subject-level heterogeneity
What is the motivation for looking into scRNA-Seq in AD?
Many studies but DEGs vary dramatically as approaches changes massively across. Need consensus on approach.
What is the definition of epigenetics?
changes in gene function that are mitotically (somatic - non-reproductive cells) and/or meiotically (germline - reproductive cells) heritable and that do not entail a change in DNA sequence i.e. dynamic cellular regulation.
Can epigenetic marks be inherited?
recent study found certain ones can perhaps due to partial escape of complete reprogramming, resulting in cross-generational changes in developmental and cellular features.
What are some biological roles of epigenetics?
cell differentiation and specialisation (as shown in Waddington’s landscape), cell cycle entry and exit and immune cell activation.
Can tumorigenesis occur without genetic alteration?
Recent study found tumorigenesis to arise even in the absence of any genetic mutations following transient loss of epigenetic gene silencing (could be DNA methylation as located in a gene promoter, DNA methylation typically acts to repress gene transcription)
What are the types of epigenetics?
DNA methylation, DNA-binding proteins, non-coding RNAs and histone modifications
What is the order of epigenetic modifications?
pioneer transcription factors binding to compacted chromatin, inducing nucleosome structural changes and recruiting histone marks and removing methyl marks.
What is the ChIP-Seq approach?
Chromatin immunoprecipitation followed by sequencing - cross-linking proteins to DNA, cleaving the chromatin, immunoprecipitating with protein-specific antibodies and finally, amplifying and sequencing the associated DNA fragments
What is the ATAC-Seq approach?
assay for transposase-accessible chromatin using sequencing - Open chromatin regions are tagged with sequencing adaptors and cleaved by hyperactive Tn5 transposases. The tagged DNA fragments are next purified, amplified and sequenced
What did Nott 2019 find?
Histone modifications associated with noncoding regulatory regions such as enhancers have also been assayed and found to harbour AD variants in cell type-specific analyses (AD & Microglia)