PART III FROM GENOTYPE TO PHENOTYPE Flashcards
(123 cards)
Which areas of the genome are genes concentrated in ?
-G/C rich areas
What percentage of the genome encodes protein or non-coding RNA?
<2%
What percentage of the genome is regulatory/introns?
- 25%
What rough % of the genome is junk DNA?
- > 50%
What is the genes and gene product mismatch problem?
- That there are about 20 000 genes but more than 500 000 proteins
What is a putative gene?
- A gene whose protein and function is not known but it is based on an ORF and believed to be a gene
What is the trasncriptome?
- COMPLETE collection of RNA produced from a genome BUT not every RNA is present in every cell and eukayotic RNAs are spiced
What does alternative splicing give rise to?
- Different protein isoforms from the SAME gene (this partially explains the gene product mismatch problem)
How can an RNA sequence be deduced?
- By making and analysing cDNA
What are cDNAs and ESTs (Expressed Sequence Tags) used to analyse?
- Used to analyse gene structure, and presence + levels of specific RNA in cells
What is transcriptomics?
- The study of THOUSANDS of RNAs simultneously
Is the whole transcriptome produced in cells?
NO
- Because only a subset of genes is active in any cell
What are the 3 major classes of RNAs that make up the eukaryotic transcriptome?
- Ribosomal RNAs trancribed by RNA pol I
- Protein encoding RNAs (mRNA) and microRNAs (miRNA) transcribed by RNA polymerase II
- Small RNAs (including tRNA) trsanscribed by RNA pol III
Are genes organised into operons in eukaryotes?
-NO
What is the splicing process?
- Where eukaryotic mRNA is produced by excision of non-coding segments (introns) from precursor (pre-mRNA)
Is splicing SEQUENCE specific and if so what can be found out from this?
- YES!
- Intron/exon boundaries can be predicted using bioinformatics genomic sequence analyses
- But there is NO specific splice seuqence that is cut out…more an overall general pattern
What is the key to gene identification in eukaryotic genome analyses?
- Accurately predicting splice junctions
Via what process can related but DIFFERENT polypeptides be generated from the same primary transcript?
- Alternative splicing
What allows for different isoforms of a transcript specifically?
- Different EXONS being incorperated OR omitted from the final mRNA
What process explains why relatively few genes in genome can give rise to vastly greater number of proteins?
- Alternative splicing
Can splicing errors cause disease via mutations?
- YES!
- Mutations can occur in splice donor or acceptor sequences OR generate NEW (cryptic) splice sequences
e. g. Exons being omitted (skipped) deletes a section of protein –> severely affects the structure
How can the use of false (cryptic) acceptor or donor sites sseverely affect the protein strucutre?
- By truncating (shortening) or lengthening exons
What is the old definition and 2 new definitions for the gene repectively?
OLD: One gene encodes one protein
NEW 1: Single transcription unit (gene) encodes one set of protein isoforms
NEW 2 (newest): A single polypeptide is the product of a single gene
What 3 things do we need to know from each gene in terms of RNA?
- Where and when it is transcribed into RNA
- How it is spliced, and how many spliceoforms there are
- Whether particular spliceoforms are restricted to particular cells or growth stage