Week 12 (GTEx and ENCODE) Flashcards

1
Q

GTEx

A

Genotype Tissue Expression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what was the purpose of GTEx?

A

create a comprehensive resource to understand how genetic variants influence gene expression across various human tissues

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how were tissues collected for GTEx?

A

pieces and parts were collected from (dead) humans

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what conclusion did GTEx draw about using blood as a sample to assay things?

A

blood is a bad tissue to use because it has the fewest transcribed genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

GTEx used a small sample size so that examined _____ that act in cis to the gene as the expected effect size

A

eQTLs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what tissue shows that the variant is affecting it?

A

the heart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

the number of ________ identified showed no signs of plateauing at current sample sizes in any of the tissues

A

eGenes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what are eGenes?

A

These are genes whose expression levels are influenced by genetic variants, meaning that different versions of a gene can lead to different levels of expression in cells

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what type of genes would have a DNA variant that effects the expression in ALL tissues?

A

house keeping genes (genes necessary for normal operations)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

why are some variants found in all tissues while some are only found in one type of tissue?

A

some genes are only expressed in specific tissues

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what do we refer to the quantitative variation of gene structure due to genetic variation as?

A

splicing QTLs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

eQTLs

A

expression quantitative trait loci

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are eQTLs?

A

genetic variants that are significantly associated with changes in gene expression levels in different human tissues, eQTLs are like “switches” that can affect how much a gene is expressed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the difference between cis and trans eQTLs?

A
  • cis (within the same gene)
  • trans (affecting a different gene)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is the significance of eQTLs?

A

eQTLs provide valuable information about the genetic basis of complex traits and diseases, as well as the regulatory mechanisms that control gene expression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

genetic variants affecting gene expression and splicing patterns have been shown to fall within regulatory elements, providing a potential ______ ______ for their effects

A

molecular basis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is the value of eQTL information?

A

to establish functional links between regulatory elements and genes in the genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

how can you increase the power of eQTL information?

A

adding additional tissues and subjects to the data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what is the purpose and future goals of individual-level transcriptome analysis?

A

it is likely to become a crucial addition to the personalized assessment of an individuals biology and likely disease status

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

what was the ultimate goal of the GTEx project?

A

to provide a framework for biological interpretation of disease related variants
(a framework to help understand disease-related genetic changes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

in GTEx, what could you increase to likely increase the number of detected GWAS-eQTL loci?

A
  • sample sizes
  • range of tissues
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

GTEx found that ~13% of candidate genes proposed by GTEx eQTLs and in LD to genome wide significant GWAS SNPs, are _________ genes

23
Q

when doing anything in genomics, why does more mean better?

A

the more more samples or individuals tested allow you to see more effects, see the variations and notice when things vary

24
Q

what does ENCODE stand for?

A

Encyclopedia of DNA Elements

25
what was the purpose of the ENCODE Project?
the project provides new insights into the organization and regulation of out genes and the genomes and is an expansive resource of functional annotations for biomedical research
26
define a functional element
a discrete genome segment that encodes a defined product or displays reproducible biochemical signature
27
what is a discrete genome segment?
something that has a defined beginning and end (there are coordinates)
28
ENCODE abbreviations: RNA-seq
isolation of RNA sequences (different purification techniques to isolate different fractions of RNA followed by high throughput sequencing)
29
ENCODE abbreviations: CAGE
capture of the methylated cap at the 5' end of RNA
30
ENCODE abbreviations: ChiP-seq
chromatin immunoprecipitation sequencing. a method used to identify the binding sites of DNA-associated proteins, often transcription factors, on a genome-wide scale.
31
ENCODE abbreviations: RRBS
reduced representation bisulphoe sequencing. It's a method used to analyze DNA methylation patterns, particularly focusing on CpG islands and other CpG-rich regions of the genome.
32
what is currently the preferred method for ENCODE?
ATAC-seq
33
ENCODE abbreviations: ATAC-seq
Assay for Transposase-Accessible Chromatin sequencing. It's a technique used in molecular biology to map chromatin accessibility, meaning which parts of the DNA are open and accessible to proteins.
34
major findings of the ENCODE project
- the discovery that a vast majority (80.4%) of the human genome is transcribed into functional RNA molecules, challenging the traditional view of "junk DNA" - identified numerous regulatory elements
35
what did ENCODE reveal about noncoding regions of the genome?
ENCODE has further shown that non-coding regions are often functional and that many non-coding variants associated with disease lie within these functional regions
36
____% of the genome relies within 8 kilo bases of a DNA-protein interaction
95%
37
there are 5.6 times more enhancer regions that promoter-like regions. why?
likely due to the increased need for fine-tuning gene expression. Enhancers, which can be located far from their target promoters, allow for more complex and nuanced control of gene activity by interacting with a wider range of transcription factors and other regulatory elements. Promoters, on the other hand, are more directly involved in the initiation of transcription and are typically located closer to the gene they regulate
38
how many protein coding genes in mammals?
20,000
39
how many bp in mammalian genome?
3 bp
40
what percentage of bp are protein coding?
2%
41
there are 5.6 times more ________ than _______ like regions in the genome
enhancer; promoter
42
there is more variation in non-coding regions than coding regions. Why?
coding regions carry the instructions for protein synthesis so they have to be able to maintain their functionality, so there is less variation while non-coding regions do not directly encode proteins so they do not have as significant of an impact on the organisms survival if a variant occurs, it does not break the function of the organism
43
T/F: there are more "disease" variants outside coding regions than in coding regions
True
44
on average, each "gene" has 3-4 alternative transcription start sites (TSS). why?
for regulation purposes
45
there are lots of RNAs produced that don't code for proteins. What might these non-coding RNAs do?
Non-coding RNAs (ncRNAs) are functional RNA molecules that do not code for proteins but play crucial roles in regulating gene expression, maintaining cellular structures, and other cellular processes
46
have we identified all the "genes"?
NO
47
in ENCODE, wat did they use to identify transcription start sites at high confidence?
CAGE-seq
48
why are there regions of histone modification?
integration of the different histone modification information can be used systematically to assign functional attributes to genomic regions
49
ENCODE found that ____% of the human genome has a "functional" element
80%
50
what was the 80% "functional" elements made up of?
- 62% RNA - 56% histone modifications - 15% open chromatin - 8% transcription factor binding
51
what is the broadest element class that covers 62% of the genome?
RNA
52
How is it that humans have about the same number of protein coding genes as “simpler” organisms but are more complex?
the use of alternative splicing, non-coding DNA, and complex gene regulation
53
GWAS SNPs are particularly enriched in the segmentation classes associated with _______ and _______ across several cell types
enhancers and TSSs
54
GWAS analysesbtypically associate disease to SNPs in large regions, what can a comparison to ENCODE non-coding functional elements do?
can help pinpoint casual variants