Your Deep Learning Journey Flashcards
(30 cards)
Label
The data that we’re trying to predict, such as “dog” or “cat”
Architecture
The template of the model that we’re trying to fit; i.e., the actual mathematical function that we’re passing the input data and parameters to
Model
The combination of the architecture with a particular set of parameters
Parameters
The values in the model that change what task it can do and that are updated through model training
Fit / Train
(1) Update the parameters of the model such that the predictions of the model using the input data (2) match the target labels
Pretrained model
A model that has already been trained, generally using a large dataset, and will be fine-tuned
Fine-tune
Update a pretrained model for a different task
Epoch
One complete pass through the input data; the model has seen every item in the training set.
Loss
A measure of how good the model is, chosen to drive training via SGD (Stochastic Gradient Descent)
Metric
A measurement of how good the model is using the validation set, chosen for human consumption
Validation set
A set of data held out from training, used only for measuring how good the model is
Training set
The data used for fitting the model; does not include any data from the validation set
Overfitting
Training a model in such a way that it remembers specific features of the input data, rather than generalizing well to data not seen during training
CNN
Convolutional neural network; a type of neural network that works particularly well for computer vision tasks
Deep learning is…
…a specialty within machine learning that uses neural networks with multiple layers.
Machine learning is…
…a discipline in which we define a program not by writing it entirely ourselves, but by learning from data.
What is distinctive about Deep Learning architectures?
They are based on neural networks. (e.g.: CNN, RNN, Transformers)
What is segmentation?
At its core, segmentation is a pixelwise classification problem. We attempt to predict a label for every single pixel in the image. This provides a mask for which parts of the image correspond to the given label.
Tabular Data
Data that is in the form of a table, such as from a spreadsheet, data‐base, or a comma-separated values (CSV) file.
Categorical Value
contain values that are one of a discrete set of choices, such as occupation
Continuous Value
contain a number that represents a quantity, such as age
What are “hyperparameters”?
choices regarding network architecture, learning rates, data augmentation strategies, and other factors […] Training models require various other parameters that define how the model is trained. For example, we need to define how long we train for, or what learning rate (how fast the model parameters are allowed to change) is used. These sorts of parameters are hyperparameters.
What was the name of the first device that was based on the principle of the artificial neuron?
Mark I perceptron built by Frank Rosenblatt
Why is it hard to understand why a deep learning model makes a particular prediction?
Deep neural networks have thousands of layers. It is hard to determine which factors are important in determining the final output. The neurons in the network interact with each other. All of this makes it very difficult to understand why a neural network makes a given prediction.