Lecture 8 Flashcards

(10 cards)

1
Q

What are 3 features of UniProt?

A

Integrates:

  • Sequence
  • Structure
  • Function data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do secondary databases enhance the PDB (Protein Data Bank)?

A

They add annotation about:

  • Structure
  • Function
  • Evolution of protein domains
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the main obsticle in Natural Language Processing (NLP)?

A

Natural language is ambiguous and context dependent and requires syntactic analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the result of machine pairwise associations?

A

Can form chains of reasoning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is indexing?

A

Extraction of possible search terms is compiled and searched in leu of full text which is too slow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are 3 issues with indexing?

A
  • English/American spelling
  • Synonyms
  • Contamination (Such as author’s names being locations)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are 3 benefits of Electronic Lab Notebooks (ELNs)?

A
  • Backed up
  • Non-linear organisation
  • Easily searchable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are 3 approaches of machine learning?

A
  • Statistical techniques; clustering, classification, principle components analysis; Hidden Markov model
  • Artificial Neural Networks; protein structure prediction, gene prediction
  • Support Vector Machines; classification algorrithms
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is BLAT?

A

Very rapid genomic sequence searching algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you avoid the challenge of synonyms?

A

Use of official and standardised names; ie the use of HGNC names is increasing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly