Genomics & Genetic Sequencing Flashcards

1
Q

How frequent are SNPs?

A

~1 SNP per 1,000 nucleotides, between any two individuals

Any two genomes are separated by ~ 3 million SNPs - still 99.9% identical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Genome composition

A

1.5% protein coding
5% regulatory (gene expression, development)
20-25% genes (including regulatory sequences)
40-50% repetitive DNA (total)
50% unique (single copy) DNA (total)

*does not add up to 100% because of reasons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Short Interspersed Repetitive Elements (SINEs)

A

Ex: Alu family - ~300bp related members, 500,000 copies in genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Long Interspersed Repetitive Elements

A

Ex: L1 family - ~6kb related members, 100,000 copies in genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Satellite DNAs

A

Short repeat sequences organized tandemly (head-to-tail) and in clusters; comprise 10-15% of genome

Families vary with regard to location in genome, total length of tandem array, and length of constituent repeat units that make up the array

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Hot spots for human-specific evolutionary changes

A

Satellite DNAs, including a specific pentanucleotide sequence, found as part of human-specific heterochromatic regions on the long arms of Chr 1, 9, 16, and Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Alpha Satellite Repeats

A

171 bp repeat unit found near centromeric region of all human chromosomes; likely important for chromosome segregation in mitosis and meiosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Alu family

A

Short interspersed nuclear element (SINE); related repeats, ~300 bp long, dispersed throughout the genome; ~10% of genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

L1 Family

A

Long Interspersed Nuclear Element (LINE); related repeats, ~ 6kb long, dispersed throughout the genome; ~20% of genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Pseudogenes

A

DNA sequences that closely resemble known genes/gene families but are nonfunctional, either as a result of inactivation mutations (nonprocessed) or by retrotransposition (processed)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Retrotransposition

A

Process involving transcription, followed by reverse transcription and integration of that cDNA back into the genome; formation of non-functional, processed pseudogenes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Single Nucleotide Polymorphisms (SNPs)

A

A difference in a single DNA nucleotide base, within a particular gene, that gives rise to 2 discreet alleles; allele frequencies differ in different ethnic groups/populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Simple insertion-deletion polymorphisms (indels)

A

Variation caused by insertion or deletion of segments between 2 and 100 nucleotides; gives rise to 2 discreet alleles - presence or absence of the inserted or deleted segment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Short Tandem Repeat Polymorphisms (Microsatellites)

A

Stretches of repetitive DNA consisting of units of 2-4 nucleotides, repeated between 1 and 30 times; different alleles result from differing numbers of repeated nucleotide units; often used as a marker in forensics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Minisatellites

A

A class of variable number tandem repeats (VNTR); results from insertion of varying numbers of copies of a DNA sequence 10-100bp in length; many alleles, due to variation in the number of copies of the tandem repeat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Variable Number Tandem Repeats (VNTR)

A

Variation in the number of minisatellites that are repeated in tandem

17
Q

Copy Number Polymorphisms (CNPs)

A

Recurring deletions or insertions of larger sections of a chromosome, leading to gaps or duplications; may have two alleles (related to presence or absence of the copy) or multiple alleles (due to number of copies of the segment present)

18
Q

Short Tandem Repeat Polymorphisms (STRPs)

A

Different alleles resulting from variation in the number of nucleotide repeat units within a microsatellite region

19
Q

Polymorphic DNA markers

A

Scoreable differences at known genomic positions; most often microsatellites, SNPs, and CNVs

20
Q

Haplotype Blocks

A

10-50 kb chromosomal regions located in-between rearrangement hot-spots, where recombination is rare; SNPs and marker alleles within blocks tend to be co-inherited and are said to be in linkage disequilibrium (LD)

21
Q

Candidate gene association study

A

Hypothesis-driven approach relying on a priori biological or positional hypothesis; done by genotyping markers within a candidate gene and then comparing allele frequencies in cases versus controls; association does not prove causation by the associated variant but implies at least LD with casual mutation

22
Q

Population stratification

A

Genetic variation within grossly homogenous populations; variation may be linked to disease and therefore cause false positive results in case-control candidate gene studies

23
Q

Genetic-linkage studies

A

Used to study genome segments that are disproportionately co-inherited along with disease to determine if the loci are linked; “multiplex” families with multiple cases of a disease are studied to determine the frequency of recombination events between two loci of interest; a LOD score (logarithm of the odds) of +3 or greater is evidence that two loci are linked

24
Q

Genome-wide association studies (GWAS)

A

Tests many markers (SNPs) across entire genome searching for significantly different allele frequences in cases vs. controls; can identify many genes that contribute to a certain disease

Same as candidate gene case-control association study but tests MANY markers across entire genome; requires larger sample size (>1,000 cases/controls) to correct for multiple testing problem

Can discover new mutations that contribute to disease

25
Q

Chromosomal Analysis

A

Detects anomalies in chromosome number as well as large structural rearrangements (deletions, insertions, etc.) ex: Trisomy 21

26
Q

FISH

A

Flourescence In-Situ Hybridization; flourescently labeled, locus-specific probe (several Kb) can identify micro-deletions or multiple copies of loci of interest in patient DNA (isolated in metaphase or interphase)

Detects chromosomal microdeletion/duplication, chromosomal rearrangements (in cancers), and gene copy numbers

27
Q

Chromosomal microarray analysis (CMA)

A

Test and reference DNA samples are labeled with different colors, mixed, and affixed to an array; oligonucleotide probes containing DNA fragments can examine many different loci simultaneously; abnormal ratios of the colors are indicative of deletions or duplications

Cannot detect balanced translocations, inversions, or point mutations

May also detect benign CNVs (vs. test DNA) creating “noise” on the array

28
Q

Diagnostic Testing

A

A positive genetic test result in a patient with signs or symptoms of a genetic disease can confirm an already suspected disease or diagnose the underlying and current disease

29
Q

Predictive Testing

A

A positive genetic test result in a patient with NO signs or symptoms of genetic disease provides estimate of future disease risk

30
Q

Chaperone-Based Therapy

A

Useful in genetic disorders characterized by a misfolded mutant protein; chaperone can help the mutant protein fold appropriately, restoring protein function

Ex: Small percentage of Fabry disease patients benefit from administration of galactose, a chemical chaperone

31
Q

Retroviral Approach to Gene Therapy

A

Advantages: Vector can accomodate up to 8 kb of added DNA and the integrated DNA is stable

Limitations: Target cell must undergo division for integration of recombinant DNA into host genome; limited use in nondividing cells (i.e. neurons)

32
Q

Adenoviral Approach to Gene Therapy

A

Advantages: Can accommodate inserts of 30 to 35 kb and infect dividing or nondividing cells

Limitations: One case of strong immune response resulting in death

33
Q

Non-Viral approach to gene therapy

A
  1. Naked DN?a (ex: cDNA with regulatory elements in a plasmid)
  2. DNA packaged in liposomes
  3. protein-DNA conjugates
  4. Artificial chromosomes
34
Q

Positional cloning

A

Identifies disease genes by first determining the gene’s location through linkage analysis, followed by attempts to identify the gene on the basis of it’s map position

35
Q

Candidate gene sequencing

A

Hypothesis-driven approach depending on an a priori biological or positional hypothesis; gene is identified and sequenced directly

36
Q

Case-control genetic association test

A

Compares allele frequencies between cases and controls in an ethnically-matched sample; association does not imply causation by the associated variant but does suggest LD with a causal mutation

Population stratification may lead to false positives