Finals Flashcards
(100 cards)
Functional genomics
The functional annotation of genes is a large field that utilizes extensive experimentation to describe the function and interactions of gene and gene products
For functional annotation, what is BLAST and InterPro Software framework based on?
Sequence similarity
Examples of functional classification schemes
Gene Ontology (GO)
Enzyme Commission (EC) Numbers
Kyoto Encyclopedia of Genes & Genomes (KEGG) BRITE
How many classification schemes have been devised for protein structures and what are they?
Three:
SCOP (Structural Classification of Proteins)
CATH (Class, Architecture, Topology, Homologous superfamily)
FSSP (Families of structurally similar proteins)
What is the success or reliability of functional prediction influenced by?
Accuracy of the alignment of homologous characters in two or more sequences
What is the twilight zone?
Sequence similarity between two protein sequences is 15-25%, and the reliability of the prediction that two proteins are homologous, or evolutionarily related is only 10%
What is the percent identity that might occur between two protein sequences of longer than 100 amino acids simply by chance?
10-20%
What is the reliability of prediction that two protein sequences are homologous when the sequence identity is above 30%?
90%
By what percentage of amino acids in the sequence is the protein fold determined which determines the general structure of a protein?
3-4%
What is the likely sequence similarity of proteins with similar structure?
> 33%
What is the midnight zone?
Sequence identity is very low <15%, sequences are so different that the relationship is nearly invisible at sequence level, but may adopt very similar 3D structure
What percentage of gene annotations in public databases are incorrect or misleading?
5-63%
How are the errors in gene annotations in public databases propagated?
Via analyses of new genomes
Where do the errors in gene annotations arise from?
They originate from various sources including genome assembly and gene prediction.
Genome assembly: Erroneous or incomplete genome assembly - Truncated or chimeric genes
Genes and gene function prediction: Single nucleotide errors
Which databases are the best-curated for protein functional annotations and why?
RefSeq
UniProt/SwissProt
They require multiple lines of experimentally derived evidence
Which sequence databases are integrated in the InterPro framework?
HAMAP, Panther, PIRSF, TIGRFAM
Which method to predict signal peptide is integrated in the InterPro framework?
SignalP
Which method to predict transmembrane region is integrated in the InterPro framework?
TMHMM
Which fingerprint databases are integrated in the InterPro framework?
PRINTS
Which motif databases are integrated in the InterPro framework?
ProSite
Which domain databases are integrated in the InterPro framework?
Gene3D, Pfam, ProDom, ProSite (Profile), SMART, Superfamily
The sensitivity of BLAST is comparable to what algorithm?
Smith-Waterman
How can BLAST recognize distant homologues?
An iterative algorithm using a position specific score matrix is devised and implemented in PSI-BLAST. A matrix is reconstructed for individual iterations using sequences from previous iterations.
What could lead to an erroneous transfer of function in BLAST-based annotation methods?
Homologues may align only over a small portion of their overall lengths.
Homologue may have been wrongly annotated in the first place.