Microbial Genomics Flashcards
(33 cards)
Genome
entire complement of genetic info including genes, regulatory sequences and noncoding DNA
Genomics
discipline of mapping, sequencing, analyzing and comparing genes
Sanger Dideoxy Sequencing Method
- uses radioactively labeled bases in PCR
- uses gel electrophoresis to sequence the DNA
Illumina Sequencing Method
- 200 million reads/run; 300 bp length
- Steps:
- put on double stranded adapters
- have complimentary strands on flowell (600 million clusters)
- creates build up/bridges on flowell
- bridge amplification, stationary phase PCR
- clusters are identified w/ fluorescence
PacBio Sequencing
capable of 30,000 reads/run; 2000 bp length
454 Sequencing
- 1 million reads/run; 700 bp length
- generates data 100x faster than Sanger method
- relies on 2 major advances
- use of robotics
- pyrosequencing
- Steps:
- chop up DNA, attach things w/ ligase, undergo emulsion PCR which attaches DNA to bead
- everything placed in picotiter plate (1.7 million wells)
- nucleotides are run over DNA and will attach
- releases PPi which get transferred to ATP which then admits light
- light identifiable for each base
How do you assemble the information from sequencing into a genome?
- align all reads w/ software to generate longer sequencing (contigs)
- gaps remain btwn contigs when don’t match up = “draft” genomes
- orientation can be assumed if know related organisms
- can amplify gaps by sequencing PCR amplicons
- most laborous and expensive part of sequencing
Annotation
- coverting raw sequence data into a list of genes present in genome
- majority of genes encode mRNAs (open reading frames)
- some encode functional RNAs (rRNAs, tRNAs, ncRNAs)
- Functional ORF = Open Reading Frame (encodes proteins)
- computers search for start/stop codons and shine-dalgarno sequences
- identify ORF
- computers search for start/stop codons and shine-dalgarno sequences
What is the correlation of gene size to genome size?
- gene = 1,000 bp
- 1,000 genes = 1 genome
- 1 Mbp = 1,000 genes = 1,000,000 bp
What is the fractino of noncoding DNA in microbes?
- Eukaryotic microbes = >80%
- Bacteria = <15%
How many kbp does Nanoarcheaum equitans have?
139 kbp
How many kbp does *Mucobacterium tuberculosis *have?
4,400 kbp
What is the estimated minimum # of genes needed for cell viability?
250 - 400 genes
Which prokaryote has the larges genome to date at 13.0 Mbp?
Sorangium cellulosm
Do archaeal genomes tend to be smaller or larger than bacterial ones?
smaller at only an estimated ~5 Mbp
What is the general gene distribution in prokaryotes?
- the most abundant class is metabolic genes
- a minor class is made up of DNA replication and transcription genes
- prevalent class is those genes for nontranslated RNAs (rRNA, tRNA, ncRNA)
Compare gene distribution between Bacteria and Archaea:
- archaea devote higher % to energy and coenzyme production
- bacteria devote higher % to carbohydrate metabolism and cell membrane fxn
Metagenome
- total gene content of the organisms present in an environment
- usually used to analyze all of 16S genes, and then compare to determine present organisms
Transcriptome
- the entire complement of RNA produced under a given set of conditions
Why are Transcriptomes studied?
- to analyze …
- global gene expression
- expression of specific groups of genes under diff conditions
- expression of genes w/ unknown fxn; can yield clues to its role
- comparison of gene content in closely related organisms
- ID of specific organisms
How is the Transcriptome measured?
- qRT-PCR = Qualitative Real Time - PCR (only for individual genes)
- Microarrays (for whole transcriptomes)
- RNA-seq (for whole transcriptomes)
Microarrays
- small, solid supports to which genes or segments of genes are fixed and arrayed spatially in known pattern
- gives a relative estimation of RNA abundance btwn 2 conditions
Steps of using Microarrays:
- isolate RNA
- cDNA generation w/ PCR
- labeling of the diff samples
- hybridization of microarray (indicates transcribed gene)
- imaging
- when A>B
- when B>A
- when A=B
RNA-seq
- used to compare 2 conditions
- collect all RNA from sample
- convert to cDNA
- sequence w/ nex-gen
- align sequences to genome
- count reads for each gene
- compare samples