Human Genome Organization Flashcards

(35 cards)

1
Q

size of haploid human genome sequence

A

3 x 10^9 bp

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Human genomic DNA is distributed on __________ chromosomes

A

46

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The chromosomes are found in __________ pairs:

1) 22 \_\_\_\_\_\_\_\_\_\_ 
2) 1 \_\_\_\_\_\_\_\_\_\_
A

23
autosomes (1-22)
pair of sex chromosomes (XX or XY)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Each chromosome is believed to consist of a __________

A

single, continuous DNA double helix

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Human Genome

A
  • is a record of human evolutionary history
  • Reflects results of different selection pressures that have occurred over evolutionary time and shaped our genome (and shaped us)
  • Genes and genomic features that have been adaptive have been retained
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Genotype + environment = __________

A

phenotype

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Random genomic variation

A
  • is the fuel of evolution
  • Random variation in a highly ordered structure = almost always deleterious consequences
  • Genetic disease is the price we pay as a species to continue to have a genome that can evolve, i.e., that can adapt to new and changing environments
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The human genome is __________ and continues to evolve

  • ~30 new mutations occur in each individual
  • Shuffling of regions at each meiosis due to __________
  • Can produce __________ DNA changes as well as __________ DNA changes
A

dynamic
recombination
somatic
germ-line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Organization of the genome

A
  • Gene-rich regions/chromosomes
  • Gene-poor regions/chromosomes
  • Stable regions: majority of genome
  • Unstable, dynamic regions; many are disease-associated
  • GC-rich regions (38% of genome), AT-rich regions (54% of genome)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

__________ (i.e. non-random distribution) of GC-rich and AT-rich regions is basis for chromosomal banding patterns (cytogenetics, karyotype analysis)

A

Clustering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Clustering (i.e. non-random distribution) of GC-rich and AT-rich regions is basis for __________ (cytogenetics, karyotype analysis)

A

chromosomal banding patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Euchromatic regions

A

more relaxed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

heterochromatic regions

A

more condensed; more repeat-rich

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Genome sequencing effort focused on __________ region

A

euchromatic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

__________ regions essentially unsequenced

A

Heterochromatic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Genome composition

A

1) 1.5% is translated (protein coding)
2) 20-25% is represented by genes (exons, introns, flanking sequences involved in regulating gene expression)
3) 50% “single copy” sequences
4) 40-50% classes of “repetitive DNA” - Sequences that are repeated hundreds to millions of times

17
Q

Tandem repeats

A

i.e. “satellite DNAs”

18
Q

examples of tandem repeats

A
  • Some are in different parts of genome, e.g. used as the basis for cytogenetic banding
  • Some (a particular pentanucleotide sequence) are found as part of human-specific heterochromatic regions on the long arms of Chr 1, 9, 16 and Y (hotspots for human-specific evolutionary changes)
  • “α-satellite” repeats (171 bp repeat unit) found near centromeric region of all human chromosomes; may be important to chromosome segregation in mitosis and meiosis.
19
Q

Dispersed repetitive elements

A
  • Alu family
  • L1 family
  • Alu’s and L1’s can be of significant medical relevance
20
Q

In dispersed repetitive elements, __________ may cause insertional inactivation of genes

A

Retrotransposition

21
Q

In dispersed repetitive elements, __________ may facilitate aberrant recombination events between different copies of dispersed repeats leading to diseases

22
Q

Alu family

A

e. g. of SINEs: Short Interspersed repetitive Elements
- ~300 bp related members
- 500,000 copies in genome

23
Q

L1 family

A

e. g. of LINES: Long Interspersed repetitive Elements
- ~6 kb related members
- 100,000 copies in genome

24
Q

Insertion-deletion polymorphisms

A

Minisatellites

Microsatellites

25
Minisatellites
- tandemly repeated 10-100 bp blocks of DNA | - VNTR (variable number of tandem repeats)
26
Microsatellites
- di-, tri-, tetra-nucleotide repeats - ->5 x 104 per genome - STRPs (Short Tandem Repeat Polymorphisms)
27
Single Nucleotide Polymorphisms (SNPs)
- frequency of 1 in ~1000 bp | - PCR-detectable markers, easy to score, widely distributed
28
Copy number variations (CNVs)
- variation in segments of genome from 200 bp – 2 Mb - can range from one additional copy to many - array comparative genomic hybridization (array CGH) - loci may cover 12% of genome
29
Gene family is
composed of genes with high sequence similarity that may carry out similar but distinct functions
30
__________ arise through gene duplication, a major mechanism underlying evolutionary change
Gene families
31
Some CNV regions are involved in rapid & recent evolutionary change. Such regions are often
- enriched for human specific gene duplications - enriched for genome sequence gaps - enriched for recurrent human diseases
32
Role of genome architecture
Link between evolutionarily adaptive copy number increases and increase in human disease
33
Limitations of genome sequencing and genotyping platforms
Nextgen DNA sequencing and Genome-wide association studies (GWAS)
34
Nextgen DNA sequencing
– No mammalian genome has been completely sequenced & assembled – Nextgen sequencing relies on short read sequences ---Complex, highly duplicated regions are typically unexamined ---Such regions are implicated in numerous diseases, e.g. 1q21
35
Genome-wide association studies (GWAS)
– “Missing heritability” for complex diseases: Many large-scale studies implicate loci (e.g. SNPs) that account for only a small fraction of the expected genetic contribution – Many regions of the genomes are unexamined by available “genome-wide” screening technologies: is this where the “missing heritability” lies?