Bioinformatics Flashcards

(23 cards)

1
Q

Bioinformatics

A

The use of computers to collect, analyse and
store biological data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Genome

A

all the genetic
material in the chromosomes
of a particular organism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Genomics

A

the study of genes and their function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q
  1. Sequence Assembly
A
  • DNA sequencers generate short overlapping sequences (~500bp)
  • These fragments must be assembled into their correct order
    along the chromosome
  • This generates a single consensus sequence of DNA
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q
  1. Finding the genes
A
  • Less than 2% of the human genome encodes proteins
  • How do we find the regions (i.e. genes) that do?
    Gene Prediction tools look for sequence features that occur at
    defined sites within eukaryotic genes that perform specific roles
    during transcription/translation.
    These include:
  • Promoter sites
  • Translation start sites
  • Termination codons
  • Intron/Exon splice sites
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
  1. Assigning function
A
  • First translate DNA into a theoretical protein sequence
  • Then find the open reading frame (ORF)
  • Use the translated protein sequence to look for similar
    sequences with known function – similarity searching
  • Many protein domains are evolutionarily conserved (i.e. will
    look the same even in un-related species; human vs worm)
  • Proteins that are similar in sequence across several species are
    likely to have a similar function
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
  1. Annotation
A
  • The process of attaching biological information to sequences
  • Provides a record of valuable information for scientists trying
    to understand gene function
  • Can include:
  • Location of ORFs
  • Gene structure (introns/exons)
  • Regulatory elements (e.g. promoters)
  • Biochemical function of the encoded protein
  • Conserved domains
  • Protein interactions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Dotplot

A

a simple matrix that gives an overview of the
similarities between two sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Global Alignments

A

compare sequences in their entirety

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Local alignments

A

find the regions of highest similarity between two sequences and extend the alignment outwards

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Multiple Sequence Alignment (MSA)

A

used to compare
three or more sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Percentage Sequence Identity

A

a simple scoring method which
indicates the extent to which two sequences are invariant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

BLAST

A

Basic Local Alignment Search Tool
a search program that finds regions of local similarity
between amino acid or nucleotide sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Protein Family

A

a group of proteins that share a common evolutionary origin, reflected by their related functions and similarities in sequence or structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Domains

A

distinct functional and/or structural units in a
protein.
- Domains are responsible for a particular function or
interaction, contributing to the overall role of a protein
- Similar domains can be found in proteins with different
functions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Sequence features

A

small groups of amino acids that confer some biochemical property upon a protein

17
Q

Active sites

A

catalytic residues of enzymes

18
Q

Binding sites

A

residues that directly bind molecules or ions

19
Q

Post-Translational modifications (PTM) sites

A

chemically modified residues

20
Q

Sequence Motifs

A

short conserved
regions of amino acid (or DNA)
sequence that are important
structurally & functionally

21
Q

Sequence Profiles

A

describe motifs using quantitative information captured in a position specific scoring matrix

22
Q

Signal Sequences

A

deliver proteins to specific sites within or outside the cell

23
Q

Signal Peptides

A

short sequences that direct proteins for secretion outside the cell