What is deep learning?
It is one of the many branches of machine learning where operations are applied one after another.
It is a mathematical framework to learn representations from data
Operations in deep learning are structured into models called _____, deep learning models are typically stacks of _____ (same word)
Layers
What is the goal of the process of learning for deep learning?
To find good values for weights in the layers (i.e. values that minimise a loss function)
What are the 2 limitations of machine learning, as compared to deep learning?
ML cannot extract features for complex problems like object recognition
e.g. face detection
> ML : We need to define/specify features like eyes, ears etc and ML program will identify which features are more important for different people
> DL : The deep learning framework will automatically find out features which are important for foace detection, with large amount of data
Why did Deep Learning become popular since the 2000s, and why do we use DL now? [3]
**1. Better algorithms and its understanding
What are pretrained models?
Pretrained models are machine learning models that has been trained on a large amount of data and pre-existing knowledge. It has the optimal parameters and users just need to input their data into the model.
List some applications of DL in computer vision (other than object recognition/classification in pictures)
** What are the 2 properties of CNN?
What is the core of CNN (main components of CNN?
Each pixel is multiplied by the corresponding kernel value in the kernel, result is summed up for a single value representing a grid cell (like a pixel) in the output feature map
What are the no of channels for input images for CNN?
RGB images — 3 channels
Black and white images - 1 channel
How can we normalise pixel values of images in CNN?
What is the purpose of normalising or standardising image pixels?
For RGB images, can normalise by dividing each pixel by 255.
Data normalization is an important step which ensures that each input parameter (pixel, in this case) has a similar data distribution. This makes convergence faster while training the network. It caan also avoid the possibility of exploding gradients.
What is the goal of the pooling layer?
Whaat functions are used?
What are the 4 different layers in a CNN?
What does the fully-connected / dense layer do in CNN?
it classifies the image into its class
- like a normal neural network
- e.g. there a are 9 different classes of animals. Dense layer has neurons of 9 different output animal classes, and last dense layer has a single neuron to classify which animal the image is.
What does NLP deal with?
It deals with building computational algorithms to automatically analyze and represent human language.
It allows machines to have the ability to perform complex natural language related tasks.
Give some examples of applicatons of DL in NLP
What kind of problems are RNNs designed for and what is a charateristic of RNN that differentiates it from an ANN?
what are the 2 limitations of RNN?
What is the structure of LSTMs (Long Short Term Memory)?
What does an LSTM consist of?
An LSTM have memory blocks that are connected into layers
3 gates of LSTM
1. Input gate – decides which info from input to be used to update the memory state
How are words represented in one hot encoding?
How are words represented in word embedding?
They are also represented as vectors, but lower dimensional as relationship between words are learnt