What is the transcriptome and what does it include?
The transcriptional output of a genome
includes ribosomal RNA, messenger RNA, transfer RNA and regulatory RNAs
Translation of the transcriptome generates the proteome
What is gene expression data and how is it obtained?
Expression data refers to RNA:
Obtained through various experimental means, usually Reverse Transcription
Traditional molecular biological techniques for gene-by-gene analysis?
Northern Blotting
In situ hybridisation
RT-PCR (Reverse transcription- PCR)
Describe the process of northern blotting
Northern blotting: involves extracting RNA and comparing transcript abundance from different samples
Describe process of in-situ hybridisation
in-situ hybridisation: allows detection directly in tissues
What are expressed-sequence tags (ESTs)?
Expressed sequence tag = short sub-sequence of a cDNA sequence. Represent a portion of an expressed gene
ESTs may be used to identify gene transcripts, useful in gene sequence determination
What is the purpose of Unigene?
Unigene collates EST data
ESTs organised into clusters - each represent one unique expressed human gene
Allows comparing of gene expression via Digital Differential Display
What is Digitial Differential DIsplay (DDD) and what is it used for?
DDD compares the presentation of Unigene clusters in multiple cDNA / EST libraries
Allows for analysis of significant differences in gene expression
Particularly useful in disease vs ‘normal’ comparison
Pros and cons of EST analysis
Pros - All genes analysed including novel transcript variants - Fast and CHeap - EST clones available to community (I.M.A.G.E Consortium)
Cons
What is the purpose of ‘Serial analysis of gene expression’ (SAGE)?
Technique developed to further quantify ESTs
- increases efficiency of EST profiling
cDNA is synthesised from mRNA , cut into short (10-17bp) fragments with enzymes and concatenated (joined together)
Each molecule is included in concatamer at a rate proportional to abundance
Qualitative (presence/absence) and quantitative (count of tags) data
What database collates SAGE data?
SAGEgenie
What is a microarray? How is gene expression visualised using them?
Patches (features) of DNA molecules on a glass/silicon support
Gene expression visualised via hybridisation of fluorescently labelled cDNA/mRNA
- data collected by fluorescence microscopy scanning
2 main types of micro array and their differences?
Spotted array
In situ synthesised DNA array
Problems with microarray data - how do scientists make sense of it?
Problems
Large amounts of information, few samples, many genes (sparse data)
+ changes in gene expression is correlated
Solution
- Cluster genes / samples by expression patterns, interactions, regulation etc. across samples/genes e.g. using Volcano plots to visualise data
Volcano plots show what?
Volcano plots identify genes with particular fold change and levels of statistical significance
Which major database collates data on gene expression? What data does it accept?
Gene expression omnibus (GEO)
-contains info from SAGEgenie and Unigene
Only accepts MIAME compliant data