Midterm 2 Flashcards
All content necessary for midterm 2 (71 cards)
What Is artificial intelligence?
A computer program that mimics the intelligence of humans
What is machine learning?
technique which a computer can use to learn from data using complex rules
What is deep learning?
A technique for machine learning based on the neurons in the brain
List the three types of Machine learning and explain them
- Unsupervised learning - no feedback given to algorithm
- supervised learning - every example has a label
- Reinforcement - reward or punishment per action
How does supervised learning training work?
Training is a collection of labelled examples {xi, yi} where xi is a feature vector with D dimensions and y is a label
What is k-nearest neighbors?
Looks at neighbors closest to the data (similar feature values) and what they are classified as
What is linear regression?
Supervised machine learning used on continuous numerical data. It enables us to identify a linear trend and outliers
What is binary classification
Supervised learning where the objective is to organize everything into one of 2 classes (logistic regression)
what is multi-class classification
supervised learning into 3 or more discrete classes. can be transformed into binary
- one vs all (OvA)
Explain one vs all
separate binary classifier for each class. each classifier labels one class as positive and all others as negative. final assignment is based on the classifier with the highest confidence score
What is a decision boundary?
A boundary which partitions the underlying feature space into regions corresponding to different class labels
What is linearly separable data?
data is linearly separable when 2 classes can be perfectly separated by a single linear boundary (line for 2d, plane for 3d, hyperplane for >3d)
what is the difference between a simple decision boundary and a complex one?
simple is when the boundary comes from a polynomial function.
complex is an irregular decision boundary generated by decision trees
What is logistic regression?
it is a binary (0,1) classification algorithm which determines the probability that a given instance xi belongs to the positive class
Explain the logistic function
maps a real valued input to the open interval 0-1. It is called a squashing function because it maps a wide input domain to a constrained output
What is underfitting?
Machine learning concept where the model is too simple to accurately classify the data. It is underfitting if it has poor performance on both training and test data and adding more data doesn’t correct the issue
What is overfitting?
When the model is too complex for a given classification problem (tall decision tree, deep and wide neural networks). Too many features creates excellent performance on the training set but poor performance on the testing set
Explain learning curves?
Displays the performance of our model by using Root mean square error (RMSE) on both the training and test sets
What is the Bias/Variance trade off?
Bias -> error created by overly simplistic models, high bias = underfitting
Variance -> error from overly complex models that is sensitive to fluctuations in the training data. High variance = overfitting
Tradeoff -> aim for a model that generalizes new data well
Explain the confusion matrix
A matrix which displays the true positives, false negatives, false positives and true negatives for all labels
What is accuracy?
The ratio of correctly predicted instances and the total number of predictions
What is precision
ratio of true positives (TP) / total number of positives
Explain the holdout method
Allocate roughly 80% of your dataset for training and reserve the remaining 20% for testing
- Training error generally low otherwise there is something wrong
- Generalization error - error rate observed when the model is evaluated on new unseen data
What is cross validation?
method to evaluate models and improve performance. Involves partitioning the dataset into multiple subsets