Flashcards in Bioinformatics Deck (32)
Loading flashcards...
1
What is transciptomics?
study of gene expression data
2
What is GenBank?
Large datasets of DNA sequence
3
What is in vivo?
traditional biological experiments within an organism
4
What is in vitro?
traditional biological experiments in an artificial environment
5
What is in silico?
On the computer
6
What is bioinformatics?
Application of computer science for the management and analysis of biological information
7
What are computers used for in bioinformatics?
GATHER biological data
STORE intelligently and efficiently
Provide TOOLS to allow extraction of meaningful biological information
MERGE information from several sources to increase understanding
8
What is a primary database?
information about e.g. DNA sequences
9
What is a secondary database?
contain results of analysis of primary resources e.g. sequence patterns or mutations
10
What are the three main nucleotide sequence databases?
GenBank (National Center for Biotechnology Information -NCBI)
EBI -EMBL (European Molecular Biology Laboratory)
DDBJ (DNA Data Bank of Japan)
11
What is easier to obtain protein or DNA sequences?
Protein sequences
12
What is UniProtKB
(Universal Protein Resource Knowledgebase) – Comprehensive catalogue of information about proteins.
13
What was the first protein to have its structure determined?
myoglobin
14
What is Protein Data Bank (RCSB PDB)?
a protein data bank that stores over 100,000 structures
15
How is bioinformatics used to annotate genomes?
identify genes, protein coding regions
predict structures and functions
16
True or False?
Bioinformatics is a new research tool which will replace the need for laboratory experiments in the future
False
Experiments will still be needed as proof
17
True or False?
A bioinformatics database is an archive or store of biological data
True
18
True or False?
This is a nucleotide sequence ACTRFCGTRECATCGNKL
False
Protein sequence
19
True or False?
Protein structures are more difficult to determine than protein sequences
True
20
What is BLAST?
commonly used bioinformatics tool for rapidly comparing new sequences with known sequences e.g. All known nucleotide sequences
21
What can you find out if you have the complete sequence of a gene?
Gene name, organism, complete gene + protein sequences
Information about protein’s function, structure, evolution
22
Even if you don't have the complete sequence of the gene you can still find out stuff about it.
What can you find out and how?
function, evolution etc from looking at related sequences
Because similar sequences may
have similar structure and/or function
have evolved from common ancestor
23
What can use to find out more about the human genome?
http://www.genecards.org/
24
What is http://www.ensembl.org/ ?
A whole genome browser
“software system which produces and maintains automatic annotation on selected eukaryotic genomes”
Can browse genome at different levels: chromosome ->gene->exon->nucleotide
25
What is sequence alignment?
‘line up’ sequences so that similar features are in same columns
26
What are ClustalW2 and Clustal omega
Most commonly used multiple sequence alignment program
Can align DNA or protein sequences
Requires fixed format input e.g. file with sequences in FASTA format
27
What is shotgun sequencing?
copy DNA many times
chop each up into many random fragments
sequence each little piece (“read”)
assemble fragments where they overlap
28
What is functional annotation?
Finding out what genes are for by looking at what the proteins do
29
Why would you rather use bioinformatics over experimental approaches to research functional annotation of genes?
Bioinformatics is a scaled up automatic approach making it much faster
30
During functional annotations what should you find out about the proteins?
Are there related proteins? (BLAST search) – cautiously infer function from these
Search secondary databases (e.g. SMART) for functional domains
Predict 3D structure – help to determine function where sequences are only weakly related to known proteins
31
What can you use genome sequencing for from a single genome?
What is necessary for life?
Genes that control economically important traits e.g. crops
Personalised medicine
32