CHAP 10: BIOINFORMATICS Flashcards
interdisciplinary science of collecting, analyzing, and interpreeting biodata
BIOINFORMATICS
primarily deals with data derived from GENOM SEQUENCE, PROTEIN STRUCTURES, and GENE EXPRESSION
BIOINFORMATICS
bridges gab bet BIOLOGY and COMP SCIENCE
BIOINFORMATICS
it represents raw or processed info
BIOLOGICAL DATA
5 TYPES OF BIOLOGICAL DATA
- PROTEIN SEQUENCE
- NUCLEOTIDE SEQUENCE
- GENOMIC DATA
- EXPRESSION DATA
- STRUCTURAL DATA
DNA/RNA sequences
NUCLEOTIDE SEWURNES
amino acid chains or proteins
PROTEIN SEQUENCES
whole genome seq
GENOMIC DATA
data from gene expression
EXPRESSION DATA
3D structures of biomolecules
STRUCTURAL DATA
organized collection of biological data
BIOINFORMATICS DATABASE
TYPES OF BIOLOGICAL DATABASE
- PRIMARY DATABASE
- SECONDARY DATABASE
- TERTIARY DATABASE
contain raw data like nucleotide seq (GenBank)
PRIMARY DATABASE
contain analyzed and interpreted data (Pfam)
SECONDARY DATABASE
provide curated data (OMIM)
TERTIARY DATABASE
arranges DNA, RNA, or protein seq to identify regions of similarity
SEQUENCE ALIGNMENT
essential for gene annotation, etc
ALIGNMENT
TYPES OF SEQUENCE ALIGNMENT:
- PAIRWISE ALIGNMENT
- MULTIPLE SEQUENCE ALIGNMENT (MSA)
comparison bet 2 alignment (BLAST, etc)
PAIRWISE ALIGNMENT
aligning more than 2 sequences (MUSCLE)
MULTIPLE SEQUENCE ALIGNMENT
used to score alignments
PAM and BLOSUM
identify locations of genes in a genome
GENE PREDICTION
assigning functions to the genes
ANNOTATIONS
can identify OPEN READING FRAMES (ORF)
tools