Lecture 1: Intro to Nucleic Acids Flashcards by Zoe Phillips

_______ - a polymer of deoxyribonucleotides
* double stranded molecule that is twisted into a
helix

DNA

How well did you know this?

Not at all

Perfectly

where is DNA found?

chromosomes, mitochondria/chloroplasts, plasmids

How well did you know this?

Not at all

Perfectly

what does DNA do?

carry genetic information

How well did you know this?

Not at all

Perfectly

what does each strand of DNA consist of?

sugar-phosphate backbone
bases attached in pairs (adenine, cytosine, thymine, guanine)

How well did you know this?

Not at all

Perfectly

the primary structure of DNA is…

the sequence!

How well did you know this?

Not at all

Perfectly

DNA is written in the ____ direction

5’ - 3’

How well did you know this?

Not at all

Perfectly

what is the secondary structure of DNA?

double helix!

How well did you know this?

Not at all

Perfectly

______ structure:
Two anti-parallel polynucleotide
chains wound around the same axis.
* Sugar-phosphate chains wrap around
the periphery.
* Bases (A, T, C and G) occupy the
core, forming complementary A · T
and G · C Watson-Crick base pairs

double helix

How well did you know this?

Not at all

Perfectly

DNA binding proteins
bind to _______ groove, why?

major

we can actually see the base-pairs, easier access and more information (some proteins are specifically designed to work in the major groove)

How well did you know this?

Not at all

Perfectly

what kinds of proteins interact with the major groove (2 examples)

helix-turn-helix configuration
zinc-fingers (both minor and major)

How well did you know this?

Not at all

Perfectly

Most of the DNA is in the classic Watson-Crick model
simply called as ________

B-DNA or B-form DNA

How well did you know this?

Not at all

Perfectly

In certain conditions, different forms of DNAs are found… what are the two kinds and why?

A-DNA upon dehydration or protein binding (viral packaging, biochemical assays).

Z-DNA is “reverse” helix found in very salty condition
(4M NaCl), or to relieve supercoiling strain – disease
association, seen in Alzheimers and Lupus (SLE)

these are both found when we change the ionic concentration of the environment

How well did you know this?

Not at all

Perfectly

A-DNA has a ______ helix, what does it look like?

right-handed helix

LARGE major groove, almost no minor groove

How well did you know this?

Not at all

Perfectly

Z-DNA has a ______ helix, what does it look like?

left-handed helix

flips backwards from supercoiling, symmetrical (no LARGE distinction between major and minor grooves)

How well did you know this?

Not at all

Perfectly

what is the tertiary structure of DNA?

DNA packaging inside the cell

How well did you know this?

Not at all

Perfectly

_______→reduces the space
and allows for DNA to be
packaged into cell (bacterial
DNA is ~500 x length of cells)

Supercoiling

How well did you know this?

Not at all

Perfectly

what kind of supercoils do prokaryotes have?

plectonemic
supercoils→ negative twist in
DNA, coils back onto itself

How well did you know this?

Not at all

Perfectly

what kind of supercoils do eukaryotes have?

Solenoidal, with
proteins (higher compaction
needed!)
since there’s so much more DNA!

How well did you know this?

Not at all

Perfectly

can prokaryotes have proteins similar to histones for DNA organization and compacting?

yes, there can be proteins that provide a bit more structure but they’re not nearly as organized or developed as those found in eukaryotes

How well did you know this?

Not at all

Perfectly

In Eukaryotic cell: DNA is folded into ______

Chromatin

How well did you know this?

Not at all

Perfectly

explain how DNA double-helical structure form chromosomes, which structures are intermediate?

double helix
wound on histones to create nucleosomes (euchromatin)
chromatin fiber (heterochromatin)
chromosomes

How well did you know this?

Not at all

Perfectly

why does DNA unwind from chromosomes to 10-30 nm fibers?

very hard to transcribe, need better access to the DNA so when not actively dividing we unwind!

How well did you know this?

Not at all

Perfectly

if we all have the same genes- why are we different?

epigenetics!

How well did you know this?

Not at all

Perfectly

epigenetic mechanisms are affected by what kinds of things?

development
envirpnmental chemicals
drugs
aging
diet

How well did you know this?

Not at all

Perfectly

what are the two modifications we can make to DNA to control epigenetics?

DNA methylation histone modification

________ components: 1.Genes = [mostly] ORF = open reading frame (ATG-Stop) regulatory regions: 2a. Promoters = RNApol binding sites 2b. Operators = protein binding sites in DNA to regulate transcription; Others – translational regulation (for example)

Bacterial genomes

promoter sequences are ______ elements, why?

CIS-acting they must physically be in the area they're acting upon

regulatory factors are ______ elements, why?

trans-acting they're movable!

what are some main examples of regulatory factors?

proteins and RNA molecules metabolites that use allosteric interactions

what is gene finding?

identifying regions of DNA that encode genes—particularly protein-coding genes—within a genome and figuring out their function!

what are the key elements of gene finding? (i.e. what are we looking for?)

ORFs start/stop codons promoters and regulatory sequences splice sites (eukaryotes only)

what are the advantages and disadvantages of prokaryote genomes?

Advantages  Simple gene structure  Small genomes (0.5 to 10 million bp)  No introns  High coding density (>90%)  Disadvantages  Some genes overlap (nested)  Some genes are quite short (<60 bp)

_____: Complex gene structure  Exons and Introns  Large genomes (0.1 to >100 billion bases)  Low coding density (<30%)  3% in humans, 25% in Fugu, 60% in yeast  Alternate splicing (40-60% of all genes)  Considerable number of pseudogenes

eukaryote genome

what is an open reading frame?

a continuous stretch of DNA or RNA that could potentially encode a protein. It starts with a start codon and ends with a stop codon, without any stop codons in between.

An open reading frame is defined within ___________

one specific reading frame

T/F: Since codons are three nucleotides long, any sequence of DNA has three possible reading frames on a single strand (and six total if you count the reverse complement).

true!

T/F: there are six possible reading frames for each DNA strand (3 for each strand), but only one open reading frame

true!! our ORF needs to start with the start codon, so the six different frames helps us find the exact 3-base codon we need to start the ORF

what are the four ORF/gene finding approaches?

rule-based (start/stop) feature-based (recognizable elements) content based (GC/codon ratio) similarity based (orthologs)

________: Look for putative start codon (ATG)  Staying in same frame, scan in groups of three until a stop codon is found  If # of codons >=50, assume it’s a gene (>150 bps)  If # of codons <50, go back to last start codon, increment by 1 & start again  At end of chromosome, repeat process for reverse complement

rule-based gene finding

T/F: the number of codons in rule-based gene finding is somewhat arbitrary

true!! the statistics of something being really true or just being chance after 50 codons significantly decreases... why we picked 50!

what is fasta format?

>NAME sequence

do bacteria have alternate start codons?

yes!! they don't always use ATG for their ORF, allows them to be more flexible!! there are some Class I and Class II Class I: ATG, GTG, TTG Class II: CTG, ATT, ATA, ACG

T/F: When applied to whole genomes, simple ORF finding programs tend to overlook small genes and tend to over predict the number of long genes

true!! we either have to prioritize one or the other

what do we look for when using feature-based gene finding?

Cis-acting regulatory features (transcription or translation initiation or termination signals) * RNA polymerase binding (promoter) site * Shine-Dalgarno sequence (Ribosome binding site-RBS) * Transcriptional terminators

the ______ consists of two short sequences at -10 and - 35 positions upstream from the transcription start site.

promoter

The sequence at -10 is called the ________, or the -10 element, and usually consists of the six nucleotides TATAAT

Pribnow or TATA box - where we first pull apart DNA sequence because of the weaker bonds

what is the shine-dalgarno motif?

ribosome bindign site located 13 bases upstream of AUG start codon (sequence: 5'- AGGAGGU-3' almost always there!! ribosome has similar sequence to match- the motif recruits the ribosome to the right place- ensuring accurate translation

what kind of structures are stem-loop terminators?

cis-acting elements

______: Mechanism utilized to terminate transcription via release and dissociation of RNA polymerase

stem-loop terminators rho-independent!!

T/F: rho-dependant termination is nearly impossible to predict

true!!

T/F: An organism may have changes in relative proportion of A/T to G/C bases in coding regions

true!

do organisms have codon preference?

yes!! based on tRNA availability and ratios

_______: Take all known genes from a related genome and compare them to the query genome via BLAST

similarity-based gene finding

what are the disadvantages to similarity-based gene finding?

Orthologs/paralogs sometimes lose function and become pseudogenes. * Not all genes will always be known in the comparison genome * The best species for comparison isn’t always obvious

how has tech improved to mitigate the issues with similarity-based gene finding?

more sophisticated data allows us to pck out likely pseudogenes, many unidentified genes appear repeatedly in different species, can now compare large databases covering many different species at once!

______: Modern genomics typically uses a combination of homology, content/feature and rule based methods to identify genes in new genomes

gene annotation

T/F: homology replaces experimental characterization

FALSE!!! homology is not infalliable and we still need experimental testing to confirm results

why do eukaryote genes have low coding density? what does that mean?

most are not useful and meaningless... lots of junk there's also lots of pseudogenes present... basically glorified fossils from evolution

what are the five ways we find eukaryotic genes?

rule-based content-based feature-based similarity-based methods pattern-based

why don't we use rule-based gene finding for eukaryotes as much?

not as applicable- too many false positives

________: CpG islands, GC content, hexamer repeats,, codon frequencies in eukaryotic genomes

content-based gene finding

_______: donor sites, acceptor sites, promoter sites, start/stop codons, polyA signals in eukaryotic genomes

feature-based methods

what program do we use to perform sequence homology in similarity-based gene finding

BLAST

_______: uses HMMs, Artificial Neural Networks (“AI”)

pattern-based gene finding

what is the best way to perform gene finding on eukaryotes?

a combination of all five gene finding methods! eventually have to go to the lab and find out experimentally (physically sequence if no homology)

can we use AI to sequence genes?

yes! becoming increasingly popular! lots of different programs available to combine many different search methods

Lecture 1: Intro to Nucleic Acids Flashcards

(66 cards)