Why sequence the human genome Flashcards
(24 cards)
why was the human genome sequenced
- identify all human genomes and their roles
- analyze genetic variation among humans
- sequence the genomes of several model organisms used in genetics
- develop new sequencing techniques and computations analysis
- to share genome information with scientists and the general public as fast as possible
what is a genome
a complete set of DNA of ana organism, including all its genetics
what is genomics
the study of genomes
what are key findings of the Human Genome
- There are fewer genes than expected
- Less than 2% of our genome codes for
proteins - The genome is dynamic
- We still don’t know what many of our protein
coding genes do - Most human genes are related to those of
other animals - All humans are 99.9% similar at sequence level
what have they identified so far about the human protein coding genes and their roles
- define a gene and search for things that look like genes in sequence
- less than 2% are coding sequences (exons)
- 20% are introns
- there are more of less 20 000 genes
- many (about 20%) still have an unknown function
human genomes are 99.99% the same, so what does that little bit of difference mean
there is still a lot of variation between humans, which is important
how can that 0.01% change range between humans
can range from single base to chromosome arrangements
what are single nucleotide polymorphisms (SNPs)
are sites in the DNA that commonly vary within a population
how common are SNPs
around 1 in every 300 nucleotides
where does an individual get their SNPs from
mostly from their parents
SNPs don’t do anything, so why are they still important
they are inherited variants which can tell a lot about your common variants as the diversity in genome sequencing adds to variation on record
when analyzing common variants (genotyping), what can it tell you
- who you are related too
- where (some of) your ancestors come from
- disease risk/association (masked data outside the USA)
- if you will lose your hair
- your muscle type
- how you may respond to drugs
- data can also be used in crime solving
what does SNPs reveal about of our ancestral species
that interbreeding between them and other hominins occurred
what are short tandem repeats (STRs)
repeats of 2-5 nucleotides, found in specific regions of genome
are the 2 alleles a person inherits from each parent the same
no - they can have different lengths
what can STRs be used for
the create genetic profiles or DNA fingerprints
what are InDels
small insertions or deletions
what is a common disease of InDels
the genetic disease Cystic Fibrosis - which is caused by CFTR deltaf508 which is a 3 nucleotide deletion
what type of variant is copy number variants
structural variant
what are copy number variants (CNVs)
chunks of DNA (more than 500 base pairings) that are present at different amounts of copy numbers relative to a reference genome
what can be the cause of CNVs
deletion or duplication
how big can CNVs span
can span over multiple genes
how many CNVs can be found between and within human genes
10 000 CNVs
what are the genes associated with that are found in CNVs
sensory perception and immunity