Question 1

What is the importance of data transformation in the preprocessing pipeline?

Accepted Answer

Data transformation converts raw data into a format suitable for analysis or machine learning. It helps standardize data, handle outliers, normalize scales, and encode categorical variables, ensuring that algorithms can process the data efficiently and accurately. Without transformation, models may misinterpret features, leading to poor performance or biased results.

Question 2

What are the challenges of data preprocessing in real-world machine learning projects and how can data quality be ensured?

Accepted Answer

Real-world data is often messy, containing missing values, inconsistencies, outliers, and irrelevant features. Challenges include handling incomplete or noisy data, integrating data from multiple sources, and ensuring privacy and security.

Question 3

What is the impact of improper handling of missing data and what are common imputation methods?

Accepted Answer

Improper handling of missing data can introduce bias, reduce model accuracy, or cause algorithms to fail. Common imputation methods include:

Mean/Median/Mode Imputation: Simple but may distort distributions
K-Nearest Neighbors (KNN) Imputation: Uses similarity between samples; more accurate but computationally expensive
Regression Imputation: Predicts missing values using other features
Dropping Missing Values: Only suitable if missingness is rare and random

The choice of method affects model performance; inappropriate imputation can lead to misleading patterns or overfitting.

Question 4

What is the role of feature engineering and what are some examples?

Accepted Answer

Feature engineering, in data science, refers to manipulation — addition, deletion, combination, mutation — of your data set to improve machine learning model training, leading to better performance and greater accuracy.

data binning.

Question 5

Compare label encoding and one-hot encoding for categorical data.

Accepted Answer

Label encoding assigns each category a unique integer (best for ordinal data); one-hot encoding creates a new binary column for each category (best for nominal data). Label encoding is simple and memory-efficient but imposes an ordinal relationship

Question 6

What is an activation function and what is its purpose in neural networks?

Accepted Answer

An activation function is the output of neurons given an input or set of inputs; it acts as a switch for artificial neurons. Its purpose is to introduce non-linearity and control neuron activation.

Question 7

What are the main types of gradient descent and when should each be used?

Accepted Answer

gradient descent is an optimization algorithm used to minimize a function (normally a cost/loss function in machine learning) by iteratively moving through the direction of the steepest descent.It’s foundational for training models like neural networks and linear regression. vanilla gradient descent(batch)(review all the notes) use the entire dataset to compute the descent in each iteration. pros:stable, precise result; cons: heavy computation, slow, may get stuck in a local minima. Stochastic Gradient Descent.（student review notes with random flashcards) use an random start point per iteration to compute the whole dataset.( a chef to taste the soup while cooking so it could adjust immediately) pros: fast updates, can escape local minima due to noise; cons: noisy convergency may overshoot the global minima.

Question 8

What is the difference between random search and grid search for hyperparameter tuning?

Accepted Answer

grid search is exhaustively try every combination of the provided hyper parameter values in order to find the best for this model. random search is samples from the entirety of the search space. search space is a space where each dimension represents a hyper parameter and each point represents one model configuration.

Question 9

How are neural networks trained?

Accepted Answer

Step 1: Input data is divided into mini-batches and passed through the network (forward pass).
Step 2: The network produces outputs, which are compared to the true labels.
Step 3: A loss function calculates the error between the network’s output and the true label.
Step 4: The “Error” is obtained—this is the difference between the prediction and the target.

step 5: repeat to reduce the error. and optimize the model.

Cross Entropy is the distance between what the model believes the output distribution should be & what the original distribution is.

Question 10

What are the main differences between artificial neurons and neural networks?

Accepted Answer

Artificial neuron is the basic unit mathematical model to process the input and produces the output, neural networks are group of AN connected and organized in different layers.

Question 11

What are the main types of artificial neural network topologies?

Accepted Answer

Feedforward neural networks connect neurons of one level to the next without backward connections. Recurrent neural networks have feedback connections and are suitable for sequence data.

Question 12

What is imputation of missing values and what are the main methods?

Accepted Answer

1) listwise deletion : delete data with missing values
2). pairwise deletion: delete data when its missing a variable which requires specific analysis.
3)hot-deck imputation: replace the missing values with similar data from the same datasets
4)cold-deck imputation: replace the missing values with the similar data in different datasets.
5)mean substitution: replace the missing values with the mean of the variables for all other cases.

Question 13

What is the intuition behind neural network training?

Accepted Answer

Training involves forward pass (input passed through network)

Question 14

What is the "one versus all" strategy for multiclass classification?

Accepted Answer

its a strategy for multiclasses, provides a way to use binary classification for a series of yes or no predictions across multiple possible labels.

During training, the model runs through a sequence of binary classifiers, training each to answer a separate classification question.

pear (is it an apple yes or no; is it an orange yes or no; is it a pear yes or no ; is it a grape yes or no)

Question 15

What is backpropagation and how does it work?

Accepted Answer

Backpropagation propagates error backward through the network to compute how much each weight contributed to the error.

propagates the error to the input of a ANN. We cannot directly compute the derivative of the loss function with respect to the outputs of the network.

Backpropagation is a local process: neurons are completely unaware of the complete topology of the network.

1 Flashcards

s (25 cards)