6.21 - Manipulating genomes Flashcards
(41 cards)
genome
all the genetic material an organism contains. Includes the base sequence in the nucleus and in mitochondria
exons
coding regions of DNA
introns
non-coding regions of DNA. Removed from mRNA before it is translated into a polypeptide chain. Contains satellite DNA
DNA profiling
producing an image of the patterns of DNA in an individual to help identify an individual in forensics or determine familial relationships through the similarity of satellite DNA patterns
Satellite DNA patterns
- short sequences of DNA that are repeated many times
- always appear in the same positions on each chromosome
- the number of repeats of each minisatellite or microsatellite DNA varies between each person
- only identical twins have the same satellite pattern
VNTRs (minisatellites)
variable number tandem repeats
20-50 base pairs repeated 50 to several hundred times
STRs (microsatellites)
short tandem repeats
2-4 base pairs repeated 5-15 times
What is gel electrophoresis
- a technique used to separate DNA fragments according to their size
- DNA samples are loaded into wells (indentations) at one end of a gel, and an electric current is applied to pull them through the gel.
- DNA fragments are negatively charged, so they move towards the positive electrode
- Because all DNA fragments have the same amount of charge per mass, small fragments move through the gel faster than large ones.
- When a gel is stained with a DNA-binding dye, the DNA fragments can be seen as bands, each representing a group of same-sized DNA fragments.
how to produce a DNA profile
- Extraction
DNA is extracted from the sample. DNA from a small sample can be replicated using PCR - Digestion
The strands of DNA are cut into small fragments by restriction endonucleases. Scientists use a mixture of endonucleases that cut DNA at specific points that leave the satellite DNA intact - Separation
The fragments of DNA are separated to form a clear recognisable pattern using gel electrophoresis. The gel is then immersed in alkali in order to separate DNA into single strands. DNA pattern transferred from gel to nylon membrane using Southern blotting - Hybridisation
Radioactive or fluorescent DNA probes are added (that are complementary to a known DNA sequence) and bind to complementary strands of DNA under particular conditions of pH and temperature. These DNA strands identify microsatellite regions as they are more varied than minisatellite regions. The excess probes are washed off - Seeing the evidence
X-ray images or UV light gives a pattern of bars which are unique to everyone except identical twins. This is a DNA profile
PCR
A version of the natural process in which DNA is replicated, so allows scientists to produce a lot of DNA from a small original sample
1. Separating the strands
- temperature in PCR increased to 90-95°C for 30 seconds
- this denatures the DNA by breaking the hydrogen bonds holding the DNA strands together so they separate
2. Annealing the primers
- the temperature is decreased from 55-60°C and the primers bind (anneal) to the ends of the DNA strands
- the primers are required for the replication of strands to occur
3. Synthesis of DNA
- temperature increases to 72-75°C for one minute, as it is the optimum temperature for DNA polymerase
- DNA polymerase adds bases to the primer, building up complementary strands of DNA and so producing double-stranded DNA identical to the original sequence. Taq polymerase is used, obtained from thermophilic bacteria
uses of DNA profiling
- forensic science in criminal investigations. DNA left at a crime scene matched to suspects/criminal databases
- prove paternity of child when in doubt or in immigration cases
- identifying species to which an organism belongs
- identifying people who are at risk of developing particular diseases as certain microsatellite patterns can be linked to an increased incidence of particular diseases such as heart disease
origins of DNA sequencing
- Frederick Sanger developed techniques for sequencing nucleic acids from viruses and then bacteria
- it involved radioactive labelling of bases and then gel electrophoresis
- Sanger sequencing can read 500-800 bases at a time and has gone on to sequence whole genomes
The human genome project (HGP)
An international project in which scientists from a number of different countries worked to map the entire human genome, making the data freely available to scientists all over the world
- once a genome has been assembled, scientists identify the parts of a genome that code for specific characteristics or regions that are linked to a particular disease
capillary sequencing
- where DNA sequences are separated by length in minute capillary tubes (line gel electrophoresis)
next generation sequencing
- Sanger sequencing can be difficult and time consuming even for small sequences of DNA
- technological developments have led to new automated sequencing processes
- sequencing reaction takes place on a plastic slide called a flow cell (instead of electrophoresis)
- millions of fragments of DNA are attached to the slide and are replicated
using PCR - the replication produces clusters of identical DNA fragments that have the same coloured terminator
- all of the clusters are sequenced and imaged at the same time (massively parallel sequencing)
- it is integrated with computer technology which is highly efficient, so can sequence genomes quickly
terminator bases
- modified versions of the 4 nucleotide bases that stop DNA synthesis when they are included
- the terminator bases are given different colour tags
- A is green, G is yellow, T is red, C is blue
- for example, an A base terminator would stop DNA synthesis at the location an A base would be added.
The process of DNA sequencing
- the DNA strand is cut into fragments and mixed with primer, bases, DNA polymerase and terminator bases
- the mixture is placed into a thermal cycler which separates DNA strands at 96°C and anneals primer to strands at 50°C
- at 60°C, DNA polymerase adds complementary bases to create a new DNA strand
- terminator bases are added at random, ending the strands
- after many cycles, all possible DNA strands are produced and then separated by size using capillary sequencing
- lasers detect the colour of the fluorescent tags that correspond to the terminator base. This therefore determines the sequence of bases
- DNA analysis gives the correct DNA sequence (it produces a complementary strand)
Bioinformatics
- the developments of the software, computing and statistical tests needed to organise and analyse raw biological data.
- used to generate data on RNA and DNA sequences and the relationship between genotype and phenotype
Computational biology
- the use of data from bioinformatics to build theoretical models of biological systems
- can be used to predict what will happen in different circumstances
- important in the analysis of data from sequencing genomes to studying evolutionary relationships, epidemiology and structures of proteins
- high powered computers are used to make databases
genomics
applying DNA sequencing methods and computational biology to analyse the structure and function of genomes
why analyse the genomes of pathogens
- find out the source of an infection
- identify antibiotic-resistant strains of bacteria so antibiotics are only used when effective and making it easier to track the spread
- track the progress of an outbreak
- identify genetic markers for use in vaccines and regions of the genome that may be useful to target when developing new drugs
DNA barcoding / International Barcode of Life project (IBoL)
- scientists use relatively short sections of DNA from a conserved region of the genome to identify species
- the section of DNA is small enough to be sequenced cheaply, yet varies enough to give clear differences between species
- it is easier than traditional methods of observation
- scientists have not yet come up with suitable regions to classify fungi and bacteria
searching for evolutionary relationships
- DNA sequences of different organisms can be compared
- the base mutation rate of DNA can be calculated, so scientists can calculate how long ago two species diverged from a common ancestor
- scientists can build more accurate evolutionary trees
proteomics
the study and amino acid sequencing of an organism’s full range of proteins produced by the genome (protein complement)
- the DNA sequence of the genome should in theory enable scientists to predict the sequence of amino acids and therefore the proteins it produces, but this is not always true