Question 1

requirements of a good descriptor

Accepted Answer

- correlated with properties - obeys physics - adapted to size of the molecules - cheap - not related to other descriptors

Question 2

what is a zero dimension descriptor

Accepted Answer

- provides atom and bond count - tells us the molecular weight - doesn't provide info ab molecular structure or connectivity

Question 3

what is a 1 dimensional descriptor

Accepted Answer

- lists of substructures/fragments such as functional groups

Question 4

what is a 2 dimensional descriptor

Accepted Answer

- provide info on molecular topology - often based on the graph representation of the molecules

Question 5

what is a 3d descriptor

Accepted Answer

- provides information about spatial coordinates of atoms of a molecule

Question 6

what is a 4d descriptor

Accepted Answer

- grid based that introduce a fourth dimension to a 3d descriptor

Question 7

what is SMILES

Accepted Answer

simplifed molecular input line entry system - is a specification in form of a line notation for describing the structure of chemical species using short ASC strings

Question 8

what is a molecular graph

Accepted Answer

a connected, undirected graph which admits one to one correspondence with the structural formula - vertices = atoms - edges = chemical bonds

Question 9

whats a morgan fingerprint

Accepted Answer

- representation of a molecules that identifies the presence of substructures/fragments within it

Question 10

what is a coulomb matrix

Accepted Answer

a simple global descriptor which mimics the electrostatic interaction between nuclei

Question 11

what does SOAP stand for

Accepted Answer

smooth overlap of atomic positions

Question 12

what is SOAP

Accepted Answer

a descriptor that encodes regions of atomic geometries by using a local expansion of a gaussian smeared atomic density with orthonormal functions based on spherical harmonics and radial basis functions

Question 13

why is feature selection important

Accepted Answer

- model explainability - model debugging - improve model performance

Question 14

gini importance (mean decrease impurity)

Accepted Answer

- evaluate how feature reduces impurity of node in decision tree ( 0 to -0.5) = likelyhood new data being misclassified

Question 15

permutation feature importance

Accepted Answer

- measures increase in prediction error of model after relationship between feature and true output breaks

topic 4 Flashcards

(25 cards)