Proteomics & Transcriptomics Flashcards
Expression profiling
Comparing gene expression levels in 2 or more samples using thousands of genes
Microarrays
Assay expression of thousands of genes
Gene chips
slides or chips with oligonucleotides
either ~25 bp, ~70bp, correspond to each gene;
multiple per gene;
attached to chip
How do microarrays show expression levels
cDNAs from 2 samples are fluorescently labelled with different dyes & competitively hybridized to gene chips
cDNA hybridized to chips, chips are scanned & fluorescence detected; data analyzed to determine expression level
Microarray – RNA-seq comparisons for expression profiling:
Microarray advantages
cheaper, less computational intensive than RNA-Seq
Microarray – RNA-seq comparisons for expression profiling:
Microarray Disadvantages
- Need a reference genome or transcriptome to construct arrays
- Need to verify results with qRT-PCR
- Cross-hybridization to genes with highly similar sequences
Microarray – RNA-seq comparisons for expression profiling:
RNA-seq, advantages
can detect…
- rare transcripts, can detect
- allele-specific expression
- exon-intron boundaries
- alternative splicing
Don’t need reference Genome or transcript one
Don’t need to construct an array
- Works with non-model organisms
- No cross hybridization but can Multi-map reads
Microarray – RNA-seq comparisons for expression profiling:
RNA-seq, disadvantages
- Very expensive
- Very large sequence files
- Need scripting Knowledge (Python, R)
transcriptome profiling
Profiling: comparing expression levels in different samples
Transcriptome profiling platforms
Illumina is currently best
Transcriptome sequencing to get a reference transcriptome
Reference transcriptome sequencing:
all the RNAs present in the sample, multiple tissue/organs are used – can mix or not mix
RNA-seq data analysis
1) Align reads to the genome or reference transcriptome
2) count reads
3) quantify expression for profiling using analysis programs
For read alignment: Bowtie (older), GSNAP, STAR
Cufflinks (older) — used to estimate transcript abundance & test for differential expression
Measuring gene expression levels with Illumina RNA-seq
RNA-seq reads aligned to a genome: read coverage in exons is often not uniform
Normalize the number of reads to the length of the gene
Ribosome profiling: Ribo-Seq
- Capture ribosome-mRNA transcript complexes in action
translating mRNAs, by cycloheximide treatment - 28-30 nt of a transcript captured
- Allows for high-throughput analysis of translational activity
- Measure the rate of protein production, but not abundance in contrast to MS proteomics
Hi-C
Method to study the three-dimensional architecture of genomes by sequencing long range interactions of DNA regions
How to study the three-dimensional architecture of genomes?
Steps of Hi-C?
- Cells are fixed with formaldehyde, causes interacting loci to be bound to one another by covalent DNA-protein cross links
- DNA is digested with a restriction enzyme, the linked loci remain intact
- Blunt-end ligation is performed to ligate cross linked DNA fragments
- Results in genome-wide library of ligation products, corresponding to pairs of fragments that were originally in close proximity to each other in the nucleus
- Library is sheared, junctions are pulled down with streptavidin beads (bind to biotin)
- Purified junctions are sequenced
Transcriptome sequencing platforms
454
Pacific Biosciences (PacBio)
Ion Torrent
Oxford
Nanopore’s Minion – longer reads
Short reads: Illumina
Genomic studies of cis-regulatory elements
DNase1-seq: identifies protein binding sites that are potential regulatory elements
Chromatin immunoprecipitation followed by tiling array (ChIP-chip) or Illumina sequencing (ChIP-seq): analyzes binding sites of an individual TF
DAP-seq: DNA affinity purification sequencing
DNase1-seq
Used to identify protein binding sites that are potential regulatory elements
Chromatin immunoprecipitation followed by tiling array or Illumina sequencing
ChIP-chip or ChIP-seq
To analyzes binding sites of an individual TF
To characterize histone modifications
- Can reveal his tone marks in gene coding vs. regulatory sequences
DAP-seq
DNA affinity purification sequencing
- TF binding site discovery assay
- Couples affinity-purified TFs with Illumina sequencing of
a gDNA library - Identifies genome-wide binding locations for each TF assayed
How DNAse1-seq works
- DNase 1 cuts at open chromatin sites where no protein is bound, locations of bound proteins are protected
- Sequencing DNA to show sites of bound proteins
Assumes bound proteins are regulatory factors
- Does not reveal identity of the proteins
What DNase1-seq reveals
- Reveals protein binding sites on a genome-wide scale at single base resolution
- Can reveal new cis regulatory elements in genes
How to ChIP-chip or ChIP-seq
- Cross-link complex with formaldehyde
- Purify nuclei
- Fragment chromatin proteins bound to DNA
- Immuno-enrich protein-DNA complex antibody to a specific TF
- Reverse cross-links (65 degrees C)
- Hybridized to array or Illumina Sequence!