Shane 3 (AI) Molecular Diagnostics - Whole Genoms/Whole Exome Sequencing Flashcards

(118 cards)

1
Q

What percentage of the human genome is similar across individuals?

A

99.9%

The differences in the human genome are only found in 0.1% of the genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the significance of genetic mutations in cancer?

A

Mutations are usually found in many different genes.

An example is the BRCA1 mutation, which often occurs alongside other mutations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the relationship between mutations and cancer development?

A

Multiple mutations occur before someone develops cancer.

This indicates that cancer is often the result of cumulative genetic changes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define Mendelian disorders.

A

Mutations in one gene.

Examples include Cystic Fibrosis (CF) and Prader-Willi Syndrome (SNRPN gene).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some characteristics of Mendelian disorders?

A

Some remain mysterious with unknown causative genes.

This highlights gaps in current genetic understanding.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a method for prenatal diagnosis?

A

Cell-free fetal DNA from the mother.

This method is less invasive compared to traditional techniques.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What was previously used for prenatal diagnosis but is no longer commonly done?

A

Amniocentesis.

It was considered too risky for routine use.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Fill in the blank: Genetic mutations associated with cancer usually occur in _______.

A

[many different genes].

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

True or False: The majority of human genome differences are found in more than 0.1% of the genome.

A

False.

Differences are only in 0.1% of the genome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What year did Watson and Crick infer DNA’s structure?

A

1953

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What significant event in 1977 involved nucleotide sequencing?

A

The first nucleotide sequence of the gene encoding yeast alanine tRNA was reported

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Who introduced initial DNA sequencing methods in 1980?

A

Sanger, Maxam, and Gilbert

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What was the first human linkage map based on in 1983?

A

Restriction fragment length polymorphism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What significant discovery was made in 1985?

A

Mullis discovered the PCR technique

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What idea was proposed in 1986?

A

The idea of human genome sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What was cloned in 1990?

A

The first human disease gene

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What was accomplished in 1993 regarding homozygosity?

A

The first homozygosity mapping was done

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What was the focus of the first positional cloning done in 1995?

A

A recessive disease gene on the basis of linkage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What was reported in 1996 regarding genome sequencing?

A

The first genome sequence of an organism (Hemophilus influenza)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What major milestone occurred in 2001?

A

The first human chromosome was sequenced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What was the first organism whose genome was sequenced in 2000?

A

Fruit fly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

In what year was the first draft of the human genome sequence published?

A

2001

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What key event occurred in 2003 regarding the human genome?

A

The human genome sequence was completed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What technology emerged in 2005 that transformed sequencing?

A

Massively parallel sequencing platforms (next-generation sequencing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What year saw the introduction of the first NGS instrument?
2009
26
What was the first application of WES reported in 2010?
Proof of principle: disease gene identification by WES
27
Fill in the blank: The PCR technique was discovered by _______.
Mullis
28
True or False: The first individual genome based on NGS was published in 2010.
True
29
What percentage of Watson's genome was published in 2008?
33%
30
What major advancement in genetics was made in 1985?
PCR technique
31
Who received a Nobel Prize for the development of the PCR technique?
Mullis
32
What did Watson's genome reveal about his health?
Predisposed to lots of conditions
33
What is the primary purpose of Whole Genome Sequencing?
To determine the complete DNA sequence of an organism's genome ## Footnote This includes identifying all of the organism's genes and their functions.
34
What is a BAC in the context of genome sequencing?
Bacterial Artificial Chromosome ## Footnote BACs are used to clone large fragments of DNA for genomic studies.
35
What is the first step in the Hierarchical Shotgun Approach?
Fragmenting genomic DNA into large pieces ## Footnote This initial step allows for the organization and mapping of the genome.
36
What does the term 'minimal tiling path' refer to in genome sequencing?
The process of using restriction enzymes to digest DNA into smaller fragments ## Footnote This approach helps to create a more manageable set of sequences for assembly.
37
Fill in the blank: In the shotgun approach, DNA is fragmented into different sizes like _______.
shotgun pellets ## Footnote This metaphor illustrates the random nature of fragmenting DNA.
38
What is the significance of overlapping reads in genome assembly?
They allow for the reconstruction of the original sequence ## Footnote Overlapping reads are crucial for accurate genome assembly.
39
True or False: The process of preparing chromosome spreads is simple and quick.
False ## Footnote Preparing chromosome spreads is described as a tedious process.
40
What is the role of restriction enzymes in the shotgun sequencing process?
To cut DNA into fragments of varying sizes ## Footnote This facilitates the cloning and sequencing of DNA.
41
What are shotgun clones made from?
BAC clones that have been digested into smaller fragments ## Footnote Shotgun clones are the actual sequences that are read during sequencing.
42
What was a major reason why the Human Genome Project took approximately 10 years to complete?
The tedious process of assembling sequences from overlapping reads ## Footnote The complexity of genome sequencing contributed to the lengthy timeline.
43
What does the term 'reads' refer to in the context of genome sequencing?
Short sequences of nucleotides obtained from shotgun clones ## Footnote Reads are the data points used to assemble the genome.
44
What is the primary technique used in whole genome sequencing known as the shotgun approach?
The shotgun approach involves randomly breaking up DNA into smaller fragments and then sequencing those fragments to reconstruct the entire genome.
45
True or False: The shotgun sequencing method requires the genome to be assembled in a specific order before sequencing.
False
46
Fill in the blank: In shotgun sequencing, the DNA is first ______ into smaller fragments.
fragmented
47
Which of the following is a key advantage of the shotgun sequencing method? A) It is labor-intensive B) It requires prior knowledge of the genome C) It allows for rapid sequencing of large genomes
C) It allows for rapid sequencing of large genomes
48
What are the main steps involved in the shotgun sequencing process?
The main steps are fragmentation of DNA, sequencing of the fragments, and computational assembly of the sequences.
49
What does BAC stand for in BAC libraries?
Bacterial Artificial Chromosome
50
True or False: BAC libraries are used to clone large fragments of DNA for genomic studies.
True
51
Fill in the blank: BAC libraries can contain inserts of DNA fragments up to _____ kilobases in size.
300
52
How are BAC libraries utilized in whole genome sequencing?
BAC libraries are used to organize and sequence large sections of the genome, allowing for more efficient mapping and assembly.
53
Which of the following is a primary advantage of using BAC libraries in genomic research? A) Small insert size B) Stability of cloned DNA C) High mutation rate D) Low cloning efficiency
B) Stability of cloned DNA
54
What is a minimal tiling path (MTP) in whole genome sequencing?
A minimal tiling path is the shortest sequence of overlapping DNA fragments that covers the entire genome of an organism.
55
True or False: A minimal tiling path includes gaps between DNA fragments.
False
56
Fill in the blank: The primary purpose of a minimal tiling path is to ______ the genome efficiently for sequencing.
cover
57
Which of the following best describes the utility of a minimal tiling path in genome sequencing? A) Reducing redundancy B) Increasing fragment size C) Enhancing computational complexity D) None of the above
A) Reducing redundancy
58
How does a minimal tiling path contribute to genome assembly?
It provides a framework for assembling overlapping sequences into a complete genome, ensuring all regions are represented.
59
What is the Celera Genomics Approach?
A private method for faster DNA sequencing that skips mapping of clones and goes straight to shotgun clones. ## Footnote This approach allowed for quicker sequencing compared to public methods.
60
What sequencing method did Celera Genomics utilize?
Shotgun sequencing. ## Footnote This method involves directly sequencing random DNA fragments.
61
What is the main advantage of the Celera Genomics Approach over public methods?
It took a lot less time to sequence DNA. ## Footnote This efficiency was due to the direct sequencing of shotgun clones.
62
What percentage of the genome was Celera Genomics attempting to patent?
28%. ## Footnote This indicates their focus on a specific portion of the genome for commercial purposes.
63
What is the significance of skipping the mapping of clones in the Celera Genomics Approach?
It allows for a more rapid sequencing process. ## Footnote Mapping typically requires significant time and resources.
64
Fill in the blank: The Celera Genomics Approach went straight to _______ clones.
shotgun. ## Footnote Shotgun clones are fragments of genomic DNA that are sequenced directly.
65
What is the starting material for Whole Genome Sequencing?
Double-stranded genomic DNA.
66
How are DNA fragments prepared for sequencing?
DNA is fragmented using a sonicater, such as Covaris.
67
What is the size range of DNA fragments for sequencing?
Small fragments, typically 200-500 nucleotides long.
68
How do you verify the size of DNA fragments?
Run the fragments on a gel alongside a size marker.
69
What is added to the DNA fragments to facilitate ligation?
A nucleotides (deoxy version of A nucleotide) to create an A overhang.
70
Why is an A overhang important?
It allows for easier ligation of an adaptor starting with a T.
71
What can you do if the DNA fragments are not the correct size?
Run the fragments on a gel again to check their size.
72
What technique can be used to recover a specific DNA sequence from a gel?
Cut the sequence out of the gel using a blade and melt down the agarose.
73
What can be designed using the adaptor sequences?
Primers for sequencing the DNA.
74
What are the three types of genetic variation among people?
* Single nucleotide variants (SNVs) * Short insertions/deletions (indels) * Structural variation (SV) ## Footnote Each type represents different forms of genetic differences that can occur in the human genome.
75
Define single nucleotide variants (SNVs).
Variations at a single nucleotide position in the genome, with approximately 3.7-4.1 million occurring per person. ## Footnote SNVs can affect gene function, but many do not result in changes to the amino acid sequence.
76
What is the range of short insertions/deletions (indels) in base pairs?
1-100 bp ## Footnote Individuals can have approximately 300-600,000 indels in their genomes.
77
What is structural variation (SV) in genetics?
Large blocks (1000 bp or more) of DNA that are deleted, inserted, or inverted, with over 2,500 differences relative to the reference human genome sequence. ## Footnote SVs can be associated with various genetic conditions.
78
What technology is primarily used for sequencing complete genomes?
Massively parallel DNA sequencing, with Illumina being the most efficient. ## Footnote Other sequencing technologies can also be employed.
79
What percentage of people have structural variations associated with genetic conditions?
26% ## Footnote Structural variations can lead to serious genetic conditions, especially if DNA is deleted.
80
What is the significance of short insertions/deletions in the human genome?
They are very common, and a person can have thousands of these variations. ## Footnote Many occur in non-coding regions, leading to no observable effects.
81
True or False: Most single nucleotide variants change the amino acid that is expressed.
False ## Footnote Many single nucleotide variants do not result in changes to the protein produced.
82
Fill in the blank: A person can have approximately _______ differences in structural variation compared to the reference human genome.
2500 ## Footnote This includes large changes such as deletions, insertions, and inversions.
83
What notable aspect is mentioned about Watson's genome?
It contains lots of mutations and was sequenced when he was about 80 years old. ## Footnote This highlights the accumulation of genetic variations over a lifetime.
84
What was the first complete genome sequenced using massively parallel DNA sequencing?
The genome of James D. Watson ## Footnote Sequenced by Wheeler DA et al. in 2008
85
How long did it take to sequence James D. Watson's genome?
Two months ## Footnote Using massively parallel sequencing technology
86
What was the redundancy level achieved in sequencing James D. Watson's genome?
7.4-fold redundancy ## Footnote Indicates the depth of coverage during sequencing
87
How many single nucleotide polymorphisms were identified in Watson's genome?
3.3 million ## Footnote Includes 10,654 that cause amino acid substitutions
88
What is the range of base pairs for copy number variations identified in Watson's genome?
26,000 to 1.5 million base pairs ## Footnote Refers to large-scale gain and loss of chromosomal segments
89
What percentage of James D. Watson's genome variants caused amino acid substitutions?
Approximately 0.3% ## Footnote 10,654 out of 3.3 million variants
90
What types of polymorphisms were identified in Watson's genome?
Small-scale insertion and deletion polymorphisms ## Footnote Ranging from 2 to 40,000 base pairs
91
True or False: James D. Watson was about 80 years old when his genome was sequenced.
True ## Footnote Indicates the age of Watson at the time of sequencing
92
Fill in the blank: The sequencing of Watson's genome revealed _______ variants.
[3.3 million] ## Footnote Includes a significant number of mutations
93
What is the significance of the amino acid substitutions found in Watson's genome?
They can be very similar or very different, potentially pathologic ## Footnote The impact of substitutions varies based on the properties of the amino acids
94
What was the primary goal of the 1000 Genomes Project?
To sequence the genomes of individuals from populations around the world ## Footnote The project aimed to provide a comprehensive map of human genetic variation.
95
How many individuals' genomes were sequenced in the 1000 Genomes Project?
1,092 individuals ## Footnote These individuals were selected from diverse populations globally.
96
What types of genetic variations were mapped in the 1000 Genomes Project?
SNPs, short insertions and deletions, larger deletions ## Footnote Specifically, the project identified 38 million SNPs, 1.4 million short insertions and deletions, and more than 14,000 larger deletions.
97
What is the significance of the data obtained from the 1000 Genomes Project?
Used to screen variants in genetic disorders and cancer genome projects ## Footnote This data helps researchers understand the genetic basis of diseases.
98
What is a SNP?
Single Nucleotide Polymorphism ## Footnote SNPs are the most common type of genetic variation among people.
99
What is considered normal variation in humans according to the 1000 Genomes Project?
Genetic variations found in healthy individuals ## Footnote These variations include common SNPs, insertions, and deletions.
100
What is the current status of the 1000 Genomes Project?
Morphed into 10,000 genomes and aims for 100,000 genomes ## Footnote The project continues to expand its research on genetic variation.
101
Fill in the blank: The 1000 Genomes Project provides a map of _______ variations in the human genome.
genetic ## Footnote This includes SNPs, insertions, deletions, and other variations.
102
True or False: The 1000 Genomes Project only sequenced individuals with genetic disorders.
False ## Footnote The project focused on healthy individuals to establish normal genetic variation.
103
What is the purpose of screening out common SNPs in genetic research?
To focus on rarer SNPs ## Footnote Rarer SNPs may have more significant implications for understanding diseases.
104
Where is the information from the 1000 Genomes Project stored?
In libraries like Ensembl ## Footnote Ensembl is a genome browser that provides access to genomic data.
105
What is a shotgun library in molecular biology?
A collection of DNA fragments used for sequencing and mapping ## Footnote Shotgun libraries are created to facilitate the sequencing of large genomes by breaking them into smaller pieces.
106
What percentage of the human genome is made up of exons?
Approximately 1% ## Footnote Exons constitute about 180,000 exons and 30 million base pairs of the human genome.
107
What percentage of disease-causing mutations are found in protein-coding genes (exons)?
85% ## Footnote This highlights the importance of exome sequencing in identifying genetic diseases.
108
What system is used for hybrid capture of target sequences in exome sequencing?
Agilent SureSelect Capture Array ## Footnote This system utilizes a solution-based approach for capturing specific DNA sequences.
109
What is the purpose of the shearing step in preparing genomic DNA?
To produce small DNA fragments ## Footnote This step is crucial for creating a library suitable for sequencing.
110
What are biotinylated RNA library baits used for?
To hybridize with sample DNA for target sequence capture ## Footnote These baits are designed to be complementary to specific exon sequences.
111
What is the role of streptavidin in the capture process?
It binds to biotin to isolate target DNA sequences ## Footnote Streptavidin has a high affinity for biotin, allowing for effective separation of bound and unbound sequences.
112
How are non-exons removed during the capture process?
By washing away unbound DNA after magnetic pull-down ## Footnote This ensures that only the desired exonic sequences remain for further analysis.
113
Fill in the blank: The process of _______ involves digesting RNA probes to isolate pure DNA.
RNase treatment ## Footnote RNase is an enzyme that specifically digests RNA, leaving behind only DNA.
114
What type of assay is used to pull down target sequences in the capture process?
Magnetic pull-down assay ## Footnote This assay utilizes magnetic beads coated with streptavidin to isolate biotin-labeled DNA.
115
True or False: Most of the human genome is coding DNA.
False ## Footnote Most of the human genome is non-coding; only about 1% consists of exons.
116
What is the primary goal of whole exome sequencing?
To identify mutations in protein-coding regions of the genome ## Footnote This approach focuses on the parts of the genome that are most likely to affect disease.
117
What are cRNA baits?
Complementary RNA baits used to capture specific DNA sequences ## Footnote These baits are critical for targeting exons during exome sequencing.
118
What is the significance of high-temperature conditions during hybridization?
To maintain stringency and prevent unwanted binding ## Footnote High temperature helps ensure that only perfectly matched sequences hybridize.