Di Resta Flashcards
(43 cards)
Positives of genetic testing
Carrier screening (eg. in pregnancy), predictive testing, identification of susceptibility genes
Negatives of genetic testing
Privacy issues
Ethical testing - consanguinity
incidental findings, which can lead to psychological impact
Informed consent is needed
Mendelian disorders
One causative gene for a specific gene
Eg. DMD for Duchenne
Causative relationships, because the presence of a mutation on that specific gene gives the disease phenotype
Opposite in oligogenic disorders
Utility of genetic testing
- Mendelian disorders
- Susceptibility genes - associated with increased risk of having the disease
- Chromosomal alterations
- Carrier screening
When is It possible to perform genetic testing
- Pre-natal : analysis of specific mutations in the fetus. Either invasive or non-invasive techniques
- Post-natal
DNA extraction samples
- Peripheral blood
- Chorionic villi
- Amniocytes
- Saliva
- Bone and teeth - rare, used in autopsies
Genetic counseling
Multidisciplinary process involving patient, the referring clinician and the genetical, and the lab doctor
Evaluates clinical data, pedigrees and family history to identify the appropriate genetic test
Genetic lab
Pre PCR area –> DNA is extracted, ready for amplification
PCR area
Post-PCR area –> after the amplification we have amplicons, which undergo sequencing
Storage
issue of privacy - physical storages are used
Supercomputers are needed to store the terabytes
First generation sequencing
PCR amplification
1. Sanger or direct sequencing (chain terminator sequencing)
highest accuracy
Second generation sequencing
Library preparation (amplification) for the region of interest
1. Roche 454 (pyrosequencing)
2. Illumina (bridge amplification) - used for diagnostic testing
3. Ion torrent (label free technology)
The read length is shorter than in Sanger sequencing (200bp)
Its data accuracy depends on coverage
Library amplification depends on coverage
It Is used to perform re-sequencing, which is seuqnecing against the reference genome
Third generation sequencing
No amplification needed, and NOT diagnostic
Avoids error by sequencing the entire molecule of DNA, but has the least accuracy
1. Heliscope (sequencing by synthesis)
2. Pacific bioscience (real time sequencing)
3. Nanopore technology
Sanger sequencing
- Amplification of the DNA belonging to the region of interest
- uses chain terminators, which are linked with a fluorophore
- Obtain different amplicons of different lengths
- Can use an electrophoretic gel to measure the migration of the molecule based on its molecular weight
Used to obtain long reads (max. 1000 base pairs)
highest accuracy
Uses ddNTPs
Limitations of sanger sequencing
Can’t be used for the diagnosis of oligogenic disorders eg. cardiomyopathies (which have a panel of 20-50 genes associated with the phenotype)
Sanger sequencing also takes around 3 months
Used only for the analysis of main causativee genes (the gene with the highest penetrance)
Thus, it has no clinical Utility in oligogenic disorders
Coverage
The proportion of the genome, or the targeted region, that has been sequenced at least once
Google: It is the proportion or percentage of a genome that has been sequenced at a certain depth. It gives an idea of how much of the entire genome has been effectively read and is usually expressed as a multiple of the genome’s size.
Mean of the total number of reads that cover the target DNA
it is influenced by the number of clusters generated on the flow cell:
- the size of the target region analyzed
- the number of samples analyzed
Ion torrent
Second gen. sequencing technique based on hydrogen ions
The incorporation of a nucleotide releases a H+, which is detectable at a specific voltage chain
Uses flow chips, with microwells.
The concentration of the hydrogen ion in solution changes the pH
In a homopolymeric region, if we have more than one nucleotide of the same type the change in pH is proportional to the number of nucleotides incorporated
Illumina sequencing
- Library prep
- Bridge amplification
- Sequencing by synthesis
- Alignment and data analysis
The index, a short nucleotide sequence identify the molecule of DNA belonging to the specific sample. It allows the analysis of multiple samples in the same analysis without confusing them - called pooling
External to the indexes are the adaptors, which are complementary to the adaptors on the flow cell
Forms bridges
The natural competition between dNTPs reduces incorporation bias
Using a higher number of reads allows us to discriminate artifacts
Quality score
Prediction of th probability of an error in base calling
- Q30: Error occurs 1/1000 times
- the lower the Q core, the higher the false positive variant calling, leading to inaccurate conclusions and higher costs of validation
Read
Sequence of a single enriched fragment of sample DNA
Read depth
Total number of bases sequenced and aligned at a given reference base position
NGS data analysis
After sequencing, it involves forming a file that is readable
consists of three phases: Primary, secondary and tertiary
Primary analysis
Image capturing and processing - fastQ file
FastQ file
Master file of all the sequencing testing. Formed at the end of the primary analysis
Name of the sequencer, number of clusters, all the reads of the specific sample with all the other information about the testing
must be stored forever
BAM file
Secondary analysis –> data filtering and variant calling
It involves comparing the reads with the reference genome
uses IGV software
Heterozygous variation - when 50% of the alleles of the sequence doesn’t match the allele of the reference genome
If all reads, however, are detecting one allele, this is a homozygous variant. In this case, the reference genome contains the minority allele
Problem: Correct alignment also in the presence of a pseudogene, and computation problems during alignment