Genomics Flashcards

Question

Read depth/coverage

Answer 1

- The average number of times each base appears in the final assembly. - A coverage of 10X means that each base is on average found in 10 reads. - The deeper the coverage, the more clearly any sequence or structure changes can be discerned from sequence error

Answer 2

- The number of copies of the genome in the organism. - Bacteria =1; Human=2; Potato=4; Strawberry=8 - The higher the ploidy, the harder it is to accurately assemble.

Answer 3

- to look for a variant - identify differences between strains/organisms/individuals - assembly against a reference is much easier than de-novo sequecing - may impact how you are treated medically in the future/potential of personalised medicine

Answer 4

- different to reference sequence - gap compared to reference sequence - duplicated gene or region?

Answer 5

- deleting a whole genome, hard to look for something that's not there - duplication is same kind of problem - inversion, if sequence is short its hard to tell

Answer 6

- long read (10kb+) - high error rate (14%) - cyclising the template means it can be read many times and an accurate consensus obtained - iontorrent works in a similar way but detects the pH change on nucleotide addition

Answer 7

- as DNA is passed through the nanopore by the molecular motor under the influence of a potential - the current changes in a detectable way depending on the bases occluding the pore - the current can be interpreted to read the DNA sequence - to improve accuracy, a hairpin adapter is ligated to the end of the DNA fragment - this causes both strands of DNA to be read sequentially

Answer 8

Accuracy - at present around 95-98% Throughput - much slower than Illumina (5Gb/48hr vs 150Gb/96hr) Toolset - these are new technologies and the analysis tools are still being developed

Answer 9

- most sequencing is by synthesis - current sequencing technologies can produce terabases per day - assembly is a challenge, especially for large genomes - repetitive regions are challenging - small changes compared to a reference are challenging - new technologies are helping to solve the challenge - careful experimental design can help solve the challenge - long reads - lower throughput but better for genome structure - short reads - higher throughput but better for sequence accuracy

Answer 10

we differ from each other in small polymorphisms and structural variation

Answer 11

A standard sequence against which we can compare other sequences

Answer 12

- The reference is from a very small subset of donors and is a mosaic - People vary, in some regions far more than others. (GRCh38 has 261 alternate scaffolds) - The reference is incomplete (603 gaps)

Answer 13

a collection of genomic variation for human and other species

Answer 14

- substitution - deletion - insertion

Answer 15

changes in the overall structure of the genome - duplication - loss - translocation - inversion - repeat - deletion

Answer 16

- a human pathogen that lives in the upper intestinal tract - can cause conjunctivitis and meningitis - sequenced in 1995 - first whole genome sequences using a shotgun method - since then 28 different strains have been sequences and the pathogens have been identified

Answer 17

the virulence of the organism

Answer 18

the degree of pathogenicity within a group or species of parasites as indicated by case fatality rates and/or the ability of the organism to invade the tissues of the host

Answer 19

genes which produce products essential for virulence

Answer 20

- in clustered regions on the chromosome | - provided support for the concept of lateral gene transfer

Answer 21

In some cases the genes coding for a specific cluster of genes can arise from a different source. Instead of progressive stepwise evolution, a cluster of genes from ‘foreign’ DNA is incorporated as a plasmid or integrated into the genome

Answer 22

- smallest known free living organism- commensal in the genitourinary tract

Answer 23

- human pathogen - causes trachoma (blindness), pharyngitis, bronchitis - obligate intracellular parasite transmitted by sexual contact

Answer 24

- prevents fusion of phagosome and lysosome | - takes in ATP from host cells as it cannot produce it itself

Answer 25

- metabolic pathways are patchy - part of TCA missing - cannot synthesise ATP - doesn't appear to synthesise amino acids - contains a well defined recombinase pathway - reported to recombine and reshuffle the genome quite readily - contains many fatty acid and phospholipid synthesis

Answer 26

- resources of the host - taking things is more efficient than making them - their genomes have adapted and lost key metabolic processes

Answer 27

there is no competitive advantage for a bacterium to keep DNA that is of no use or redundant

Answer 28

- take a small genome and make it smaller by knocking out genes

Answer 29

- 'mobile' DNA element that codes for enzymes that allow it to relocate in the genome - can be many 10's of kb long and include many genes

Answer 30

- reduced essential gene count from 482 to 389 - synthesised the entire genome and transplanted into an empty cell - the new synthetic organism grew

Answer 31

- virus - ingest DNA. foreign DNA can be taken up by a variety of mechanisms Phage infection. direct introduction of DNA into cells (competent cells). - ingest organism. by ingesting an organism then using its DNA - conjugation (mating). Exchanging DNA with a related organism. Inside the cell the DNA could stay as a plasmid or integrate into the host genome via viral integrases - OR transposon jumps from ingested DNA to genome (mobile DNA element)

Answer 32

- using phylogeny (horizontally acquired gene cluster) - using sequence properties e.g. GC content - genes incorporated from different sources may have different baseline GC content, or different kmer usage

Answer 33

genes are transferred laterally between species e.g. up and down between C and D

Answer 34

- apparent close relationship of lineages inferred from sequences of x reflects the lateral transfer of this gene rather than the phylogeny of the organisms

Answer 35

Based on multiple genes more accurately reflects the organismal phylogeny

Answer 36

- the phylogeny of four hypothetical prokaryote species, two of which have been involved in a lateral transfer of gene x - a tree based only on gene x shows the phylogeny of the laterally transferred gene, rather than the organismal phylogeny - a consensus tree based on multiple genes is more likely to reflect the true organismal phylogeny, especially if those genes come from a stable core of genes involved in fundamental processes

Answer 37

``` Cell attachment - adhesins, fimbrae etc. Capsules - prevent attack by macrophages and digestion Degrading enzymes - hyaluronidase, proteases, lipases ```

Answer 38

Toxins - endotoxins and exotoxins Immunosuppressants - e.g. anti-immunoglobulin proteases

Answer 39

part of the bacterial structure e.g. lipopolysaccharide

Answer 40

secreted by bacteria e.g. shiga toxin, pertussis toxin, cholera, botox

Answer 41

toxins etc and where they are coded

Answer 42

- they have incorporated the virulence factors into their genome - do not have a plasmid

Answer 43

- two plasmids - pXO1 contains the toxins - pXO2 produces the capsule, preventing phagocytosis and is used for immunization of domesticated animals worldwide

Answer 44

has lost the pXO2 plasmid

Answer 45

In 2010 a group of researchers identified a bacterium responsible for a fatal anthrax-like disease in chimpanzees which had closer sequence similarity to B.cereus and B.turingiensis than to B.anthracis but contained the pXO1 and pXO2 plasmids (and a third plasmid.)

Answer 46

sequencing all the RNA molecules in a cell

Answer 47

Sequence every organism in the environment

Answer 48

if a gene isn't required then it tends to be lost

Answer 49

the genes tell us about the life of the organism

Answer 50

they are reshaped with additional plasmids, transposons etc. to add new functions

Answer 51

... new areas for study

Answer 52

identify disease loci through genome wide association studies

Genomics Flashcards

(76 cards)