Genomics Flashcards

1
Q

What does the tree of life assume ?

A

A monophyletic view of life

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does the tree of life highlight ?

A

The first divergence events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a synapomorphy ?

A

A characteristic or trait present in an ancestral species that is only shared by its descendants in that distinguishes a clade from other organisms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What changed the traditional classification of prokaryotes being a single kingdom

A

Studies on methanobacterium and methanosarcina

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What allowed the classification of the new taxon archaebacteria

A

Methanogens that are equidistant from eukaryotes and bacteria

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the differences between archaebacteria and eubacteria ?

A

Their membranes are composed of glycerol-ether phospholipids.
They have a L-glycerol
Phospholipids consist of isoprenoid side chains with multiple side branches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a paralogous gene ?

A

Homologous gene that occurred due to duplication event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a homolog ?

A

Homologs denote genes that derive from the same ancestral sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between orthologs and paralogs ?

A

Orthologsare corresponding genes in different lineages and are a result of speciation, whereasparalogsresult from a gene duplication.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does reticulate evolution refer to ?

A

An evolutionary processes that allows some lineages to merge and produce new lineages. The evolutionary process cannot be modelled by trees

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Who first described prokaryotes ?

A

Robert hooke

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the types of first generation sequencing ?

A

Maxam-Gilbert Method

Sanger Dideoxy Method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the types of second generation sequencing ?

A

Illumina sequencing

Ion Torrent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the types of third generation sequencing ?

A

Pacific biosciences

Nanopore

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What method can determine DNA molecules of up 500bp and relies on toxic chemicals ?

A

Maxam-Gilbert Method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the key steps in the maxim-gilbert method ?

A
  1. Attain single strand of DNA
  2. Label the DNA radioactively with 32 p to the 5’ end
  3. Separate DNA into 4 tubes and add chemicals that cleave specific nucleotides
  4. This gives different sized DNA strands
  5. Run gel electrophoresis on acrylamide gel
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the key element of Sanger sequencing ?

A

2’,3’ dideoxynucleoside triphosphates, which lack the hydroxyl group at the 3’ position.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What do ddNTPs do ?

A

They terminate DNA synthesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are the key steps in the Sanger Dideoxy Method ?

A
  1. Denature DNA and add radioactive primer
  2. Seperate into 4 tubes and add ddNTP polymerase and dNTPs
  3. Run through denaturing gel
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does a dNTP do ?

A

Extends DNA strand

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are the 4 limitations of the initial Sanger dideoxy method ?

A

Cant run everything in one lane
Unable to read sequences at the top of gel
Cost
Low throughput

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

How are the limitations of Sanger dideoxy method over come ?

A

fluorescently labelled ddNTPs which can be read by a laser

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How many bp does a single Sanger sequencing cover ?

A

Up to 1000

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Why do we need to assemble sequences into Contigs ?

A

Genomes are too big

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is a contig ?
DNA sequence built up from a number of smaller overlapping sequences during a sequencing project.
26
What is scaffolding ?
is a technique used to link together a non-contiguous series of genomic sequences into a scaffold, consisting of sequences separated by gaps of known length.
27
What is coverage
the number of reads that include a given nucleotide in the reconstructed sequence
28
What is a read ?
each sequenced piece of a genome
29
What are the two types of Sanger sequencing ?
Hierarchal shotgun sequencing | Whole genome shotgun sequencing
30
What are the steps in whole genome sequencing ?
``` DNA extraction Library construction Random sequencing phase Gap closure phase Annotation ```
31
What do the 2nd generation sequencing methods have in common ?
1. The DNA has to be fragmented | 2. Clonal amplification (different PCR methods)
32
What is the problem with Illumina sequencing ?
Slow and produces short sequence lengths
33
What sequencing methods are synthesis based ?
Illumina | Ion torrent
34
What sequencing methods use the ion semiconductor approach ?
Oxford Nanopore Ion torrent Illumina
35
What does illumina sequencing rely on ?
Reversible terminators
36
What is a benefit of illumina sequencing and what was a major advancement ?
Don't have to move DNA around | Paired end sequencing
37
What does illumina sequencing generate in the flow cell ?
millions of dense clusters of dsDNA
38
What are the key steps in illumina sequencing ?
1. Fragment DNA and attach to surface of flow cell using adaptors which contains a primer 2. Initiate bridge PCR with unlabelled NTPs 3. When PCR complete, millions of clusters are formed, add 4 labelled reversible terminators, primers and DNA pol to begin sequencing. 4. Excite clusters with a laser allowing fluorescents to be captured 5. Remove reversible terminators and repeat cycle by adding more ddNTPs 6. data aligned and compared to reference sequences
39
What is a key aspect of ion torrent sequencing ?
Exploits the fact that addition of a dNTP | to a DNA polymer releases an H+ ion
40
What are the steps in ion torrent sequencing ?
1. The input DNA is fragmented, Adaptors are added and one molecule is placed onto a bead. 2. molecules are amplified on the bead by emulsion PCR 3. Each bead is placed into a single well of a slide. The slide is flooded with a single species of dNTP, along with buffers and polymerase, one NTP at a time. 4. The pH is detected in each of the wells, as each H+ ion released will decrease the pH.
41
How is third generation sequencing characterised ?
The lack of DNA or RNA amplification in template library preparation.
42
What are the benefits of third generation sequencing ?
1. Avoid polymerase chain reaction-introduced error and amplification bias. 2. Real time measurements 3. Much higher throughput than first and second generation techniques. 4. Lower price per Mbp generated sequence 5. Longer reads
43
What is a ZMW ?
zero-mode waveguide wells- creates an illuminated observation volume that is small enough to observe only a single nucleotide being incorporated by the DNA polymerase
44
What are the steps within Pacific bioscience sequencing ?
1. The platform uses a DNA polymerase anchored to the bottom surface of a ZMW. 2. Differentially labelled nucleotides enter the ZMW via diffusion and occupy the 'detection volume' . 3. During an incorporation event, the labelled nucleotide is 'held' within the detection volume by the polymerase for tens of milliseconds. 4. As each nucleotide is incorporated, the label located on the terminal phosphate is cleaved off and diffuses out of the ZMW
45
What are the benefits of Pacific bioscience sequencing ?
Very efficient: fewer expensive chemicals have to be used Very sensitive Long reads (10-15 Kbp)
46
What are the positive and negatives of nanopore sequencing ?
Strengths: cheap, fast, simple, large reads, scalable, … | Problem: rather high error rate
47
What is a Q score ?
measures the probability that a base is incorporated incorrectly (Higher Score indicates a smaller probability of error).
48
What is Phredd 33 ?
A method of quality encoding during genome assembly.
49
What are the 2 methods of assembling contigs ?
De novo= No scaffold sequence to guide alignment. | Read mapping=Reads are aligned to a reference sequence.
50
What are the 2 assembly algorithms ?
Greedy method- Incremental build. Builds using highest scoring overlap. Graph based: Pre-processes the data to produce a graph structure of pairwise overlap info.
51
What type of graph deals with repetitive DNA regions and how does it work ?
de Bruijn- Split short reads into shorter uniform reads.
52
What is the K value ?
The length of the string
53
What does velvet do and what is it ?
A genome assembler that use de Bruijn graph based algorithm. Takes short reads and removes errors to produce high quality contigs.
54
What is the N50 statistic ?
The N50 value informs researcher that 50% of the genome is assembled in contigs larger than the N50 value.
55
What is genome annotating ?
Attaching biological meaningful info to genome sequences.
56
What is an ORF ?
Open reading frame- Part of the reading frame that contains no stop codons. UAA UGA UAG
57
What are the 3 methods for gene prediction ?
AB initio method- Uses info from genomic sequence such as GC content and ORFs Database similarity - Uses public databases to identify sequences that are similar Combiners: Use both techniques
58
What does hierarchical shotgun sequencing require ?
A genome map as preliminary data
59
What are the 9 steps in hierarchical shotgun sequencing?
1. Fragmentation of genome 2. Inserted into plasmid 3. Plasmid Cloned in e.coli 5. Investigate order of clones in genome using hybridisation, to give full genome map. 6. Take each clones and fragment DNA 7. Apply sequencing method 8. Form contigs 9. Merge contigs
60
What are the steps in bridge PCR ?
1. initiated by the addition of unlabelled NTPs. 2. DNA amplification occurs and then DNA folds and form a bridge due to hybridisation of adapters. 3. Add polymerase and primers to make a complementary strand. 4. Increase the temp to break bridge connection leaving two ssDNA, the forward and reverse strand. 5. This form millions of clusters for illumina sequencing