Genome organization Flashcards

(44 cards)

1
Q

number of human chromosome pairs

A

23; 22 autosomal, 1 pair sex

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

base pairs in haploid genome

A

3X10^9

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what percent of the genome codes proteins

A

<1.5%, only about 5% contains regulatory information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

point of attachment for cell microtubules for cell division

A

centromere

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

centromeric DNA sequence

A

alpha satellite family, and appx 171-bp unit critical for the attachment of microtubules of the spindle apparatus

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

the Alu family and the LINE (L1) family

A

two salient examples of repetitive DNA sequences that have implications in disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

define: single nucleotide polymorphism

A

occurs when a single nucleotide differs in a sequence that ultimately accounts for two alleles.
average of 1 SNP for every 1000 bp

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

the three gene poor chromosomes

A

13, 18, 21; these are the only ones that can have viable life

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

unstable dynamic regions

A

common mutations, so common sites for disease

eg. SMA (Chr 5q13); DiGeorge syndrome (Chr 22q); 12 diseases (1q21)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

the dynamic nature of the human genome

A

~30 new mutations for each new individual, Shuffling of regions at each meiosis due to recombination
Can produce somatic DNA changes as well as germ-line DNA changes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Gene-rich regions/chromosomes

A

chromosome 19

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

the non-random distribution) of GC-rich (38% of genome) and AT-rich (54%) regions is called:

A

clustering. this is the basis for chromosomal banding patterns (cytogenetics, karyotype analysis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

euchromatic regions

A

more relaxed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

heterochromatic regions

A

more condensed; repeat rich

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

percentage of DNA represented by genes (exons, introns, flanking sequences involved in regulating gene expression)

A

20-25%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

percentage of DNA that is “single copy” sequences

A

50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

percentage of DNA that are classes of “repetitive DNA”

A

40-50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

how big is the dystrophin gene (on the X chromosome)

A

big, over 2 million base pairs, less than 1% are coding exons. mutations lead to Duchenne muscular dystrophy

19
Q

very large arrays of tandemly repeating, non coding DNA

A

satellite DNA; the main component of functional centromeres (alpha satellite), and heterochromatin, or even in human-specific heterochromatic regions on the long arms of Chr 1, 9, 16 and Y (hotspots for human-specific evolutionary changes)

20
Q

of SINEs: Short Interspersed repetitive Elements

A

Alu family, - ~300 bp related members

- 500,000 copies in genome

21
Q

of LINES: Long Interspersed repetitive Elements

A

L1 family,- ~6 kb related members

- 100,000 copies in genome

22
Q

Repeats may facilitate aberrant ______________ between different copies of dispersed repeats leading to diseases

A

Non-allelic homologous recombination (NAHR)

23
Q

Retrotransposition may cause __________ inactivation of genes

24
Q

giemsa banding

A

A,T rich regions (gene poor) take up the dark stain,

25
Insertion-deletion polymorphisms (indels):
Minisatellites and microsatellites
26
Minisatellites
tandemly repeated 10-100 bp blocks of DNA VNTR (variable number of tandem repeats) dozens in the genome
27
Microsatellites
-di-, tri-, tetra-nucleotide repeats ->5 x 104 per genome STRPs (Short Tandem Repeat Polymorphisms) thousands in the genome
28
Copy number variations (CNVs)
- variation in segments of genome from 200 bp – 2 Mb - can range from one additional copy to many - array comparative genomic hybridization (array CGH) - implicated in lots of diseases
29
typical gene
intron containing
30
retroposed gene
intronless
31
evolutionary hotspots on chr 1, 9, 16, Y
a particular pentanucleotide sequence on human specific heterochromatic regions
32
SNPs can easily be detected by
PCR
33
describe gene families
Gene family is composed of genes with high sequence similarity (e.g. >85-90%) that may carry out similar but distinct functions
34
how do gene families arise?
through gene duplication, thus is a major mechanism for evolutionary change
35
beta globin # of exons? mutations can cause?
3 exons; hemoglobin deficiencies
36
BRCA1 # of exons? mutations can cause?
24 exons; inherited breast and ovarian cancer
37
MYH7 # of exons? mutations can cause
40 exons; inhereted cardiomyopathy
38
CNV loci may cover ________% of the genome
12
39
examples of CNV implicated in human disease
1q21.1; 9p13.3-9q21.12, 5q13.3
40
CNV example of a Link between evolutionarily adaptive copy number increases and increase in human disease
1q21.1 macrocephaly, autism
41
1q21.1 deletions
microcephaly, schizophrenia
42
genome wide association studies
focuses on SNPs
43
nextgen sequencing limitations
relies on short read sequences; Complex, highly duplicated regions are typically unexamined. Such regions are implicated in numerous diseases, e.g. 1q21
44
explain "missing heritability"
Many large-scale studies implicate loci (e.g. SNPs) that account for only a small fraction of the expected genetic contribution. answer may lie in unexamined DNA