MPS Technology Part 2 Flashcards

Question 1

Q

What are the common features of massively parallel sequencing technologies

Even though they have similarities are all of them diff

Answer

A

Random fragmentation of DNA

Ligation with custom adaptors to make a library

Paired-end read capability (read the seqeunce from either end)

“Digital” reads that enable quantitative comparison between one sample and the next

Yes they are all very different these are just similarities

Question 2

Q

What is an illumina sequencer

How does it work

Answer

A

Second (next) generation sequencing: amplification and cluster based sequencing

Don’t use it anymore

In vivo:
1. DNA fragmentation of the thing you want to seqeunce

Cloning and amplification (in vivo) or adaptor ligation (in vitro), then denature the two strands by changing PH
The adaptors on the seqeunce bind to oligonucleotide probes fused to the glass in the flow cell
Each nucleotides added has a diff colour fluorescent probe
As nucleotide get added to the adapter on the chip a single fluorescent dot of a single colour is on the chip.
Via PCR that one dot gets amplified to made a bundle of them in one place. This makes a polony (PCR colony)

Question 3

Q

What does third generation sequencing mainly focus on

Answer

A

Individual/ single molecule seqeuncing

Question 4

Q

Question 5

Q

Explain in depth the sequencing by synthesis in illumina sequencing

Answer

A

So you have the ss fragmented DNA, it binds to the surface of the flow cell

Bridge amplification: one adapter on the ssDNA on the chip flips and compliments a oligonucleotide adaptor coupled to the chip

Then labelled nucleotides added to adaptor and PCR type reaction happens where nucleotides make bridge bundles

The the double strand bundle dna is denatured for sequencing reaction to happen

Flush on the probes nucleotide that only one nucleide added at a time , next nucleotide in the strand is incorporated

Capture Fluor image using microscopy, since on nucleotide added to the single bundle you can see the instense Fluor

Unblock the nucleotide (so another can get added), flush in new nucleotides then image

Question 6

Q

Question 7

Q

What does sequencing by synthesis mean

Answer

A

Using enzymatic (dna pol) synthesis Kd DNA to carry the reaction forward

Question 8

Q

Challenges with illumina sequencing data

Answer

A

Lots of depth but cant easily assemble reads

Its short read sequecning (getting a bunch of little mini reads) so you get lots of contigs

However, the genome has a lot of repeats, so if you have a repeat of 500bp but the machine can only give 200bp, you can assemble the rest of the repeat

If try to map the contigs over a reference genome (finished genome) it’s very hard because so many diff small peice to put together

Question 9

Q

What is SMART: Pacbio

Answer

A

Single molecule real time sequencing

Massively parallel since diff templates in each well

Produced very long readd which allow you to computationally assemble the genome easier than illumina

Still big machine not that accessible

Sequencing by synthesis at the level of single molecules

Question 10

Q

How is the sample prepared for pacbio SMART

Answer

A

Do shotgun sequencing or PCR amplification

Then ligate adapters to the repaired ends of the fragmented DNA

Then you get adapters at the ends of the DNA that can be denatured and circularized at a certain temperature

Question 11

Q

Explain in depth how SMART sequencing works

Answer

A

Nanofludic chambers in each well, at the bottom of each well is an immobilized DNA pol

The fragmented dna with the circular adaptors is threaded through the pol

Since circular, it can keep seqeuncing and repeating to get better consensus

The wells diameter is smaller than the wavelength of light, so since it’s so small only space for one nucleotide to be detected

There are labelled nucleotides in the well, when a nucleotide gets added there is a single pulse for a quick second of fluoresces at the bottom of the well (because nucleotide is fluorescent) then can identify what was added

Then the fluorophore detaches and pulse is gone, next nucleotide comes in

In each well there is a diff fragmented dna strand so you get long read of each fragment

Question 12

Q

What would be a source of error for SMART seqeluencing PACBIO

Answer

A

Irregular fluroecenrce rate, some nucleotides get added slowly and some really fast, Irregular rate causes problems in how the fluor signal is detected

Also if you have homopolymeric tracks of nucleotides, the same Fluor fluorophore gets added mutiple times in a row: if this happens really quickly the computer can’t tell how many of those nucleotide repeat there were

This leads to misjudging the number of bases/length of the homopolymeric tract

Question 13

Q

How do you fix the homopolymeric tract error in SMART PACBIO

Answer

A

use statitsitc, if 20% of readd are wrong then keep repeating the sequencing of the one molecule

Then make a consensus read to have a really high average

Question 14

Q

What is the length order of readd in Pacbio, Sanger, illumina

Answer

A

Illumina shortest

Sanger (600-1000)

Pacbio longest (more than 10,000bp)

Question 15

Q

How do you filter out the data from PACBIO SMART

What does coverage mean

Answer

A

Throw out the lowest 75% of readd and keep the best 25% of reads

Coverage means that nucleotide has been read many times so it’s accurate

Question 16

Q

Explain the HGAP

Answer

Study These Flashcards

A

Hierarchical genome assembly process

Take the longest reads, assemble them as the longest seed reads, then assemble those seed reads into a finished genome using a reference

Does this in a heirachical way

Question 17

Q

Short read and long reads

Answer

Study These Flashcards

A

Illumina short

Pacbio nanopore long

Question 18

Q

How does nanopore sequencing work

Answer

Study These Flashcards

A

A nanometer sized pore inserted into a membrane

Current devices use E.coli CsgG porin as a nanopore

Single DNA strand goes into the pore, ions flow with it

The ions cause a change in conductance of the membrane that corresponds to the base going through the pore

1D reading (only one strand of DNA read)

Question 19

Q

What are the characteristics of nanopore

What is the read size in nanopore limited by

Answer

Study These Flashcards

A

Lowest accuracy , but can make up for this but repeating the experiment many times, also use hybrid assemblies to get better quality genomes

Read size is limited only by the shearing of DNA during the extraction

Gives accessibility: low cost , $1000 for starter kit

Bioinformatics solutions for it are more improved (used to be more limited)

Readd >100,000 bp

Question 20

Q

Flognle and nanopore designed for what

What to consider for illumina and Pacbio

Answer

Study These Flashcards

A

To be smaller scale and operated in a single lab

But don’t have the biggest output (might need more data than they actually give)

Ex. Want to finish a bacterial genomes, Pacbio is good for that but so much through put that you wouldn’t be able to seqeunce just a single bacterial genomes, you can seqeunce if many time over

But for a human genome Pacbio may not be sufficient because now too high BP

If want many reads and doing a counting of individual read to see what molecules are there, illumina is good

Question 21

Q

What technology to use for a particular work flow

Answer

Study These Flashcards

A

Okay

Question 22

Q

Answer

Study These Flashcards

A

MPS Technology Part 2 Flashcards

(22 cards)