Exam Flashcards
(94 cards)
What is the primary goal of functional genomics?
To describe gene/protein functions and interactions, focusing on dynamic aspects like transcription and protein-protein interactions, rather than static aspects like DNA sequence. Functional genomics uses data from genomic projects.
What is the difference between genomics and functional genomics?
Genomics studies the structure of genomes (DNA sequence), while functional genomics focuses on the dynamic aspects such as gene transcription, translation and protein-protein interactions.
What are the steps in a typical genome sequencing project?
Isolate chromosomes.
Fragment DNA into smaller pieces.
Sequence the fragments (reads).
Assemble the reads to reconstruct the genome.
Annotate the genome, identifying genes and other elements.
Validate annotation results using BLASTp.
What is the purpose of a gene finder?
To identify and label different features in the genomic sequence, such as coding regions (genes), regulatory elements, and other functional elements. Gene finders are more sophisticated than ORF finders. They can detect elements like the TATA box. Examples include GeneMark and Augustus.
What is the difference between an ORF and a gene?
A gene has a scientifically demonstrated associated function, while an ORF (Open Reading Frame) is a DNA sequence that might encode for a protein but does not have a proven function.
What are the two main approaches to genome assembly?
Guided Assembly: Uses an existing reference genome to map reads.
De Novo Assembly: Constructs the genome solely from sequencing data when no reference genome is available.
What is hybrid assembly?
Hybrid assembly combines long and short reads to create a more accurate and complete genome representation. It uses data from different sequencing technologies like PacBio (long reads) and Illumina (short reads).
Why is paired-end sequencing useful?
Paired-end sequencing sequences both ends of a DNA fragment, providing mate reads. This increases read length and improves alignment, especially in repetitive regions.
What is the difference between vertical and horizontal coverage?
Vertical coverage is the number of reads that align to a given position of the genome. High vertical coverage is important for comparing strains.
Horizontal coverage refers to how much of the entire genome has been sequenced. High horizontal coverage is preferred when sequencing an entire genome.
What is the main advantage of PacBio sequencing?
PacBio produces long reads, which facilitates de novo assembly, and provides good horizontal coverage of the genome. Also, PacBio does not require amplification.
Why is Illumina sequencing often used in combination with PacBio?
Illumina provides high accuracy and high vertical coverage, while PacBio provides long reads and high horizontal coverage. Combining the two can improve genome assemblies, especially for small genomes. Illumina uses bridge PCR for amplification and reversible termination sequencing.
What is a contig?
A contig is a contiguous sequence of DNA, assembled from overlapping reads. Contigs represent parts of a genome sequence that are assembled but might not be in their correct order.
How can duplicated regions in a genome affect assembly?
Duplicated regions in the genome can cause a spike in the vertical coverage and may cause mis-assembly, leading to a higher than expected number of contigs.
What is the purpose of karyotyping?
Karyotyping allows researchers to determine the number and size of chromosomes in an organism. It can also be used to determine the genome size.
What is the purpose of BLASTp?
BLASTp is used to validate gene models by comparing predicted protein sequences to known protein sequences in databases. The result with the highest similarity for the highest coverage is usually selected.
What is the goal of comparative genomics?
Comparative genomics compares the genomes of different species to study evolutionary processes and genetic diversity. It identifies variations in chromosome number and size, and large-scale genomic modifications.
How can comparative genomics be used to study phenotypes?
By comparing genomes of strains that differ in a particular phenotype, comparative genomics can be used to identify genetic changes responsible for adaptive traits.
What are the key steps in transcriptomics analysis?
- Define experimental conditions.
- Extract RNA from cells.
- mRNA enrichment or RNA Depletion (only in eukaryote) - mRNA enrichment is done through oligo(dT) to enrich poly(A) tails; RNA depletion is done through removing rRNA with Rebo-Zero
- Convert RNA to cDNA.
- Sequence the cDNA.
- Quantify gene expression by comparing read coverage.
What is the difference between general and specific transcription factors?
General TFs form the basic machinery needed for transcription.
Specific TFs regulate the expression of individual genes.
What is a microarray and how is it used?
A microarray contains immobilized DNA probes designed to bind specific transcripts. Labelled cDNA samples are hybridized to the probes and the signal intensity corresponds to the quantity of transcript present. It is used to quantify gene expression.
What are the controls used in microarray experiments?
Positive Controls: Samples with known concentrations of the target to validate the results.
Negative Controls: Samples without DNA/RNA or non-specific sequences to determine background noise and non-specific bindings.
What is the proteome?
The entire set of proteins expressed by an organism, tissue, cell, or organelle at a given time under defined conditions.
What are the steps in a typical expression proteomics experiment?
Select a biological model and question.
Define experimental conditions.
Collect samples and obtain protein extracts.
Separate proteins using 2D gel electrophoresis.
Visualise the proteins.
Quantify protein abundance using softwares.
Identify proteins through mass spectrometry.
What is the purpose of trypsin in mass spectrometry?
Trypsin cleaves proteins at specific amino acid residues, generating peptides of different sizes that can be analyzed by mass spectrometry.