DNA sequencing Flashcards

(46 cards)

1
Q

What is DNA sequencing?

A

DNA sequencing is the process of determining the order of nucleotides (A, T, C, G) in a DNA molecule, which is critical for understanding gene function, mutations, and evolutionary relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is DNA sequencing important?

A

It helps study genes, identify mutations causing diseases, diagnose infections, understand evolutionary relationships, and trace ancestry.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the three generations of DNA sequencing?

A

First generation: Sanger sequencing (up to 1000 bases)
Second generation: Next-generation sequencing (NGS, short reads, millions of fragments at once)
Third generation: Long-read sequencing (reads over 10,000 bases).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Sanger sequencing and how does it work?

A

Sanger sequencing uses chain-terminating nucleotides to stop DNA synthesis at random points, creating fragments that can be read using capillary electrophoresis and fluorescence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the key steps in Sanger sequencing?

A

Denature DNA into single strands
Add primer and polymerase for DNA extension
Incorporate chain-terminating nucleotides (ddNTPs) randomly
Separate fragments by size and visualize with fluorescence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the pros and cons of Sanger sequencing?

A

Pros: Simple data analysis, accurate, longer reads (up to 1000 bases), affordable for small sample sizes.
Cons: Low throughput, impractical for large numbers of samples, requires significant DNA input.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is next-generation sequencing (NGS)?

A

NGS involves sequencing millions to billions of DNA fragments simultaneously, generating short reads (~500 bases), and assembling them into larger sequences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the process for preparing samples for NGS?

A

DNA is fragmented into 1 KB pieces, then adapters are added to each fragment for amplification and sequencing using machines like Illumina

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the advantages of NGS?

A

It can sequence large numbers of DNA fragments quickly and is ideal for samples with low DNA quantity or for sequencing entire genomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does third-generation sequencing (long-read) differ from previous methods?

A

Third-generation sequencing generates long reads (>10,000 bases), providing more accurate assembly of repetitive regions and larger genomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the purpose of adaptor sequences in NGS?

A

To anchor DNA fragments to a flow cell and allow primers to bind for sequencing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is “bridge amplification”?

A

PCR-based amplification of immobilised DNA fragments to form clusters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why are DNA fragments amplified into clusters in Illumina?

A

To generate strong fluorescent signals for detection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is “sequencing by synthesis”?

A

DNA polymerase adds fluorescently labelled, chain-terminating nucleotides one at a time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are paired-end vs. single-end reads?

A

Paired-end sequences both ends of DNA; single-end sequences one end.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What happens during preprocessing of reads?

A

Adaptors are removed, low-quality ends are trimmed, and poor reads are filtered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is de novo assembly?

A

Building a genome from overlapping short reads without a reference genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is reference mapping?

A

Aligning reads to a known genome to reconstruct a sample’s sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Key Illumina pros?

A

High accuracy (~0.01% error), cost-effective.

20
Q

Key Illumina cons?

A

Short reads (~200–300 bp), poor resolution in repetitive regions, large data storage needs.

21
Q

Difference between genome, exome, and targeted panel sequencing?

A

Whole genome: all DNA; exome: only expressed genes; targeted panel: specific known genes.

22
Q

Key feature of long-read sequencing?

A

Sequences very long fragments (10 kb to >1 Mb), often single molecules.

23
Q

What is SMRT sequencing?

A

Polymerase reads circular DNA in real time using fluorescent bases in zero-mode waveguides.

24
Q

Benefit of circular DNA templates?

A

Enables multiple passes of the same DNA for error correction.

25
How does Nanopore sequencing detect bases?
Measures changes in ionic current as DNA bases pass through a nanopore.
26
What makes ONT useful in the field?
It’s portable (e.g., MinION) and fast (can run on a laptop).
27
Key benefit of ONT for epigenetics?
Detects methylation and modifications on native DNA.
28
Main ONT disadvantage?
Higher raw error rates than short-read NGS (mitigated by high coverage).
29
What are common sources of sequencing errors?
Degraded DNA, contamination, low-quality samples.
30
How is sequencing accuracy improved despite errors?
Increased coverage (e.g., 30–50x) improves consensus confidence.
31
Why is contamination a concern in genome sequencing?
Non-target DNA (e.g., from fungi/bacteria) can be accidentally sequenced and incorporated into the genome assembly.
32
What happens if you submit a contaminated genome to a database?
It may be rejected due to mixed or impure sequences; you must remove contaminants and resubmit.
33
Why is sufficient DNA input important for sequencing?
Low DNA amounts produce weak signals, increasing error rates during sequencing.
34
How are low-DNA samples (e.g., ancient DNA) handled?
Special extraction and sequencing protocols are used.
35
When is Sanger sequencing appropriate?
For small projects targeting 1–2 genes; low-throughput, high accuracy.
36
When to use Illumina?
For high-throughput sequencing where DNA can be degraded or fragmented (short reads).
37
What is PacBio best suited for?
Creating high-quality reference genomes, especially for previously unsequenced species.
38
Why is PacBio costly?
Full genome sequencing and assembly can cost ~£20,000 due to depth and quality.
39
How is PacBio used in agriculture?
Sequencing wild wheat relatives to create reference genomes for breeding programs.
40
What makes Oxford Nanopore suitable for fieldwork?
t's portable (e.g., MinION), fast, and supports real-time, on-site sequencing.
41
How can ONT be used for pathogen surveillance?
Field samples can be sequenced to identify and compare pathogen virulence genes in real-time.
42
Which platforms can detect methylation?
ONT excels in methylation detection; PacBio also supports it but is less effective in plants.
43
Why do we need a heart in larger organisms?
Diffusion is too slow over large distances; the heart enables bulk transport of nutrients, gases, and waste.
44
What happens if you rely on diffusion alone?
Transport over >1 mm becomes inefficient—e.g., 1 cm takes ~15 hours for glucose to diffuse.
45
Why did circulatory systems evolve early?
To overcome diffusion limitations in multicellular organism
46