Heaphy 12 Genomics Flashcards
(34 cards)
Genomics:
study of genomes and genome sequences
first use by Hans Winker in 1920- All the genes in a haploid set. Modern usage encompasses all the cellular DNA in an organism
term genomics was coined by
Thomas Roderick for mapping sequencing and characterising genomes
1986
Genomics Scope:
Viral genomics, Archaeal, Bacterial, (prokaryotic), Eukaryotic.
Species genomics, mouse, human…, mammalian, avian….,organellar,
Metagenomics:
soil water ocean, human, gut, mouth, teeth,skin, object or surface such as a shower curtain.
DNA sequencing
Sanger di-deoxy sequencing 1980 to date, human genome sequence, technological improvements, very accurate long sequences but relatively slow and expensive. 1000s of nucleotides per day, genomes in months or years
DNA sequencing
capillary fluorescence:
Much faster, machine read
and filed straight to a computer in the 1990s
DNA sequencing
radioactive gel-based:
Slow, manually read in the 1980s
next generation sequencing:
2007 + , Illumina, Roche 454, ion torrent and many more technologies, short sequences, less accurate but massively parallel, millions to billions of nucleotides per day, genomes in minutes or overnight. Technology still developing
Shotgun sequencing
Genome assembly relies on identifying overlapping sequences
Human genome
3 billion base pairs 23 chromosome pairs
first sequenced gene
bacteriophage MS2 coat protein gene 1972
first sequenced genome
bacteriophage MS2, 3569 nts 1976, RNA genome
first sequenced DNA genome
bacteriophage fX174, 5368 nts 1977
first bacterial genome
Haemophilus influenzae, 1,830,140 nts 1995
first eukaryotic sequence
S. cerevisiae, 12,495,682 in 1996
Human genome sequence reported
~3.3 billion nts in 2003
Largest genome sequence
loblolly pine, 22.18 billion
Nucleotides
Genome sizes
E.coli 5Mb
S. cerevisiae 12.5 Mb
H. sapiens 3.3GB
Genome sizes from graph
Viroids, 300 nts; viruses 3.5Kb-2.5Mb; bacteria and archaea 150Kb-10Mb;
eukaryotes 13Mb-22Gb. Free living organisms have larger genomes than
Endosymbionts. Overlap in genome sizes between domains
Microbial
General features of genomes: Size form protein coding regions operons RNAs gene transfer evolving
- Small
- Circular & plasmids
- Gene density high >90% (short intergenic regions, introns rare, little repetitive/ non-coding seq)
- Protein coding regions short <1Kb
- Operons w/ proctor just upstream
- few non-coding RNAs
- Frequent horizontal gene transfer
- rapidly evolving
Eukaryotic
General features of genomes: Size form protein coding regions operons RNAs gene transfer evolving
- Large Gb
- linear chromosomes
- gene density low <25% (long intergenic regions, introns common, repetitive non-coding seq common)
- Protein coding regions larger, 30Kb +
- Average of 3 splice variants
- many non-coding RNAs
- Infrequent HGT
- Less rapid evolution?
What can genomics do ?
- everything about an organism and its developmental program
Metabolic capabilities. - Phylogeny: position in the tree of life
-Disease resistance or susceptibility - understand what life is, to create and manipulate it
- Metagenomics: how an ecosytem functions and an organisms role in that ecosystem
importance of bioinformatics
Individuals & populations:
DNA-> RNA ->protein -> phenotype -> selection -> evolution
Computational goals of bioinformatics:
- Learn & generalise: conserved patterns
- Prediction
Organise & intergrate: systematic & genomic approach
-Simulate: model gene expression
-Engineer: construct
-Target: mutations, drugs