L15: Genomics & Bioinformatics Flashcards
What is genomics?
Study of entire genomes, including gene structure, function & evolution
What can genomics seek to identify?
Similarities & differences between genomes of different species
- Compare genomes of diff species
- Compare genoms of diff individual humans
Define bioinformatics
Collecting & analysing complex biological data e.g genetic codes
What role does bioinformatics play in genomics?
It plays a role in the comparison of DNA sequences from different organisms
Advancements in genome sequencing
- Allow rapid genome sequencing
- Large-scale genome analysis led to bioinformatics development
4 uses of bioinformatics
1) Organising & analysing genome sequences
2) Identifying genes & predicting protein functions
3) Studying gene regulation & evolution
4) Storing & retrieving genetic data
Major aim of genomics
Identify protein coding genes present in the genome
Define genome assembly
Stitching together DNA fragments to form a complete genome
What is comparative genomics?
Comparing gene sequences between species
What is gene identification?
Locating protein-coding regions in DNA
What is regulatory region identification?
Finding promotors & enhancers
What is evolutionary analysis?
Understanding species relationships
What is gene expression studies?
Determining when & where genes are active
4 steps in genome sequencing
1) DNA Mapping: identifying fragment locations
2) Fragment Sequencing: determining nucleotide order
3) Sequence Overlap Detection: aligning overlapping fragments
4) Genome Assembly: using software to generate a complete sequence
What is the Shotgun Sequencing Method?
Breaks DNA into random fragments, sequences them & reassembles the genome using overlapping regions
FASTER
What does genome annotation involve?
Finding functional elements in DNA
Key DNA sequences used for gene identification
1) TATA box: promotor region
2) CAAT box: promotor region
3) Start codon (ATG): signals start of synthesis
4) Stop codons (TAA, TAG, TGA): makrs end of gene
5) Poly-A signal (AATAAA): RNA processing
6) Exons & Introns: coding/non-coding region
Example of genome annotation
BLAST (basic local alignment search tool): compares DNA & protein sequences between species to find genes
What can bioinformatics predice once a gene is identified?
- Amino acid sequence of the protein
- Protein function, based on similarities to known protenis
Example of hormone that can predict structure & function
Human Growth Hormone
Tools used in prediction
1) CLUSTAL-W: identifies gene families with similar sequences
2) DeepMind AlphaFold: uses AI to predict 3D protein structures
What allows the mapping of genes to each of the chromosomes?
Characterisation of the human genome sequence
- locate position of genes coding for specific proteins sequences
What can protein structure be identified using?
X-ray diffraction patterns generated from protein crystals
Key findings from the Human Genome Project (2003)
1) Genome size: 3 billion nucleotides
2) Protein-coding genes: only 2% of genome codes for proteinns
3) Alternative splicing: allows 1 gene to make multiple proteins