CNN and RNN Flashcards by ROWAN Gomanee

What is deep learning?

A subset of machine learning using multi-layered neural networks to learn hierarchical representations from data.

How well did you know this?

Not at all

Perfectly

What are the three historical waves of neural networks?

Cybernetics (1940–1970), Connectionism (1980–2000), and Deep Learning (2006–present).

How well did you know this?

Not at all

Perfectly

What was the major contribution of the 1986 backpropagation paper?

An efficient algorithm to compute gradients in multi-layer neural networks, enabling end-to-end training.

How well did you know this?

Not at all

Perfectly

What is the purpose of LSTM networks?

To handle long-term dependencies in sequences by mitigating vanishing gradients through gated memory units.

How well did you know this?

Not at all

Perfectly

What problem did LSTM solve?

Vanishing gradients in standard RNNs during long sequence learning.

How well did you know this?

Not at all

Perfectly

What is LeNet-5?

An early convolutional neural network developed in 1998 for handwritten digit recognition.

How well did you know this?

Not at all

Perfectly

What were the key features of LeNet-5?

Convolutional layers, pooling, and fully connected layers to achieve spatial invariance.

How well did you know this?

Not at all

Perfectly

What was AlexNet?

A deep CNN that won the 2012 ImageNet competition and popularized deep learning.

How well did you know this?

Not at all

Perfectly

What innovations did AlexNet introduce?

ReLU activation, GPU training, data augmentation, and dropout regularization.

How well did you know this?

Not at all

Perfectly

What is supervised learning in the context of deep learning?

Learning from labeled input-output pairs to minimize a loss function.

How well did you know this?

Not at all

Perfectly

What are CNNs used for?

Visual recognition tasks like image classification, object detection, and segmentation.

How well did you know this?

Not at all

Perfectly

What are the key ideas behind CNNs?

Local receptive fields, weight sharing, and hierarchical feature extraction.

How well did you know this?

Not at all

Perfectly

What makes CNNs efficient for images?

They reuse the same filters across space and reduce spatial dimensions via pooling.

How well did you know this?

Not at all

Perfectly

What does a typical CNN architecture consist of?

Convolutional layers, ReLU activation, pooling layers, and fully connected layers.

How well did you know this?

Not at all

Perfectly

What is a recurrent neural network (RNN)?

A neural network designed to process sequences by maintaining hidden state across time steps.

How well did you know this?

Not at all

Perfectly

What are common applications of RNNs?

Study These Flashcards

Text generation, speech recognition, image captioning, and video analysis.

Why do RNNs struggle with long sequences?

Study These Flashcards

Because gradients can vanish or explode over time, making learning difficult.

What are LSTM and GRU used for?

Study These Flashcards

They are variants of RNNs that solve the vanishing gradient problem with gating mechanisms.

What is image captioning?

Study These Flashcards

A task where a model generates a natural language description of an image.

How does image captioning work?

Study These Flashcards

A CNN encodes the image and an RNN decodes it into a sentence.

What are some failure modes of image captioning systems?

Study These Flashcards

Incorrect object identification, hallucinated relationships, or vague descriptions.

What is the function of softmax in a classification network?

Study These Flashcards

It converts the final layer’s outputs into class probabilities.

What role do benchmarks like ImageNet and COCO play in deep learning?

Study These Flashcards

They provide standardized datasets and evaluation metrics for comparing models.

What is the significance of the ImageNet 2012 competition?

Study These Flashcards

It demonstrated the power of deep CNNs with AlexNet outperforming previous methods by a large margin.

What is AlphaGo?

A system that combined deep neural networks with tree search to defeat world champions at Go.

What are transformers in deep learning?

Architectures based on self-attention mechanisms, replacing RNNs for many sequence tasks.

What is BERT?

A bidirectional transformer model trained on masked language modeling and next sentence prediction.

What is GPT-3?

A large autoregressive language model with 175 billion parameters capable of few-shot learning.

What is Semi-Supervised Learning?

Learning that uses a small amount of labeled data combined with a large amount of unlabeled data.

What is synthetic data and how is it used in deep learning?

Artificially generated data used to pretrain or augment training sets, especially when real data is scarce or expensive to annotate.

CNN and RNN Flashcards

(30 cards)