Technical Flashcards

Question

What is Deep Learning?

Answer 1

A subset of machine learning that uses artificial neural networks to enable systems to learn like humans, with multiple 'deep' layers.

Answer 2

In machine learning, features are manually selected, while in deep learning, the model automatically determines important features.

Answer 3

A small amount of data.

Answer 4

A large amount of data.

Answer 5

High-end machines with significant computing power.

Answer 6

By breaking the problem into parts and solving them individually before combining the results.

Answer 7

In an end-to-end manner.

Answer 8

Email Spam Detection, Healthcare Diagnosis, Sentiment Analysis, and Fraud Detection.

Answer 9

By training a model on labeled emails categorized as spam or not spam.

Answer 10

By training a model on labeled images to detect diseases.

Answer 11

Using algorithms to analyze documents and determine if their sentiment is positive, neutral, or negative.

Answer 12

By training a model to recognize suspicious patterns to identify possible fraud cases.

Answer 13

A learning method where the training data contains a small amount of labeled data and a large amount of unlabeled data.

Answer 14

Supervised learning uses completely labeled data, while semi-supervised learning uses a mix of labeled and mostly unlabeled data.

Answer 15

Clustering and Association.

Answer 16

Dividing data into subsets (clusters) where data points in each cluster are similar to each other.

Answer 17

Grouping customers based on purchasing behavior for targeted marketing.

Answer 18

Identifying patterns of association between different variables or items.

Answer 19

E-commerce websites suggesting other items based on your previous purchases and other customers' habits.

Answer 20

Supervised learning uses labeled data for training, while unsupervised learning uses unlabeled data and lets the algorithm find patterns on its own.

Answer 21

A learning method that observes instances based on principles to draw conclusions.

Answer 22

Explaining to a child to avoid fire by showing a video where fire causes damage.

Answer 23

A learning method that concludes from direct experiences.

Answer 24

Letting a child touch fire, and after getting burned, they learn it’s dangerous.

Answer 25

Unsupervised learning.

Answer 26

A clustering algorithm.

Answer 27

Supervised learning.

Answer 28

A classification algorithm.

Answer 29

It groups data points into K clusters where points within each cluster are similar.

Answer 30

It classifies an unlabeled observation based on the majority class of its K nearest neighbors.

Answer 31

Because it assumes that all features are independent of each other given the class label.

Answer 32

A fruit might be classified as a cherry if it’s red and round, assuming these features are independent of each other, even if other fruits share them too.

Answer 33

When your target variable is categorical, such as predicting yes/no, gender, or animal breed.

Answer 34

When your target variable is continuous, like estimating sales, prices, or rainfall.

Answer 35

A supervised machine learning algorithm that builds multiple decision trees during training and outputs the majority decision for classification problems.

Answer 36

The error introduced when a model makes assumptions about data, causing predicted values to be far from actual values.

Answer 37

Underfitting — the model misses important relationships between features and target outputs.

Answer 38

The amount a model’s predictions would change if trained on different data.

Answer 39

Overfitting — the model captures random noise in the training data instead of the actual pattern.

Answer 40

Making a model more complex reduces bias but increases variance. The goal is to balance both to minimize total error.

Answer 41

It will be consistent but inaccurate on average (underfitting).

Answer 42

It will be accurate on training data but inconsistent across different datasets (overfitting).

Answer 43

The ratio of true positive predictions to the total predicted positives. Precision = TP / (TP + FP)

Answer 44

The ratio of true positive predictions to the actual total positives. Recall = TP / (TP + FN)

Answer 45

A supervised algorithm that builds a tree-like model by splitting the dataset into subsets based on feature values, handling both categorical and numerical data.

Answer 46

It breaks the dataset into smaller subsets recursively, developing a tree structure with decision nodes and branches based on feature conditions.

Answer 47

A technique to reduce the size of decision trees by removing sections that provide little power, to reduce complexity and prevent overfitting.

Answer 48

It can be done top-down from the root or bottom-up starting from the leaf nodes.

Answer 49

A pruning method where nodes are replaced with their most popular class starting at the leaves, and the change is kept if accuracy is not affected.

Answer 50

It simplifies the model and improves speed while reducing overfitting.

Answer 51

A classification algorithm that predicts a binary outcome (0 or 1) based on independent variables.

Answer 52

Using a threshold value, typically 0.5 — values above 0.5 are considered 1, and below 0.5 are considered 0.

Answer 53

A classification algorithm that assigns a new data point to the class most common among its K nearest neighbors.

Answer 54

The new data point is assigned to the class with the majority vote among the K neighbors.

Answer 55

It is an integer value greater than 1, often selected based on experimentation or cross-validation.

Answer 56

Classifying a black ball based on whether its five nearest neighbors are more like tennis balls, basketballs, or footballs — and assigning it to the majority class.

Answer 57

When the null hypothesis is true, but we reject it.

Answer 58

When the null hypothesis is false, but we accept it.

Answer 59

A measure of how strongly two random variables are related, with values ranging from -1 to +1.

Answer 60

Between -1 and +1.

Answer 61

A measure indicating the direction of the linear relationship between two random variables.

Answer 62

It can range from negative infinity to positive infinity (-∞ to +∞).

Answer 63

Data points nearest to the hyperplane that influence its position and orientation in a Support Vector Machine.

Answer 64

Because removing them would alter the position of the hyperplane and change the model’s decision boundary.

Technical Flashcards

(89 cards)