Flashcards for exam

1
Q

Linear Regression (y = mx + c)

A

It’s a basic form of regression analysis. ‘m’ represents the slope of the line, indicating how much ‘y’ changes for a unit change in ‘x’. ‘c’ is the y-intercept, showing where the line crosses the y-axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Residue (ith value)

A

The difference between the observed value (yi) and the predicted value (ŷi). It’s a measure of the error in predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Unsupervised Learning Examples

A

Clustering (like K-means), Association (like Apriori algorithm), and Dimensionality Reduction (like PCA).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Applications of Computer Vision

A

Object detection, facial recognition, medical image analysis, autonomous vehicles, and surveillance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Logistic Regression (Sigmoid Curve)

A

Used for binary classification problems. The sigmoid function outputs a value between 0 and 1, representing the probability of a particular class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Examples of Unstructured Data

A

Text files, images, videos, social media posts, and emails.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Range for Classification

A

In binary classification, the output is typically in the range of 0 to 1, indicating the probability of belonging to a certain class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Image Extraction

A

Involves processing and analyzing images to derive meaningful information from them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Relationship Between AI and ML

A

Machine Learning is a subset of Artificial Intelligence. AI is a broader concept of machines being able to carry out tasks in a smart way, while ML is a current application of AI based on the idea that we should be able to give machines access to data and let them learn for themselves.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Steps in Machine Learning

A

Typically include data collection, data preprocessing, model selection, training the model, model evaluation, and model tuning/deployment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Data Collection for Machine Learning

A

This is often referred to as ‘Data Mining’ or ‘Data Gathering’.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Improving Facial Recognition Accuracy

A

Techniques include using more diverse datasets, applying robust algorithms, and incorporating 3D facial recognition technologies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Turing Test

A

A test of a machine’s ability to exhibit intelligent behavior equivalent to, or indistinguishable from, that of a human.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Regression Models

A

Examples include Linear Regression, Logistic Regression, and Polynomial Regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Classification Model

A

A type of model that is used to separate data into different classes. This can be binary classification (like spam detection) or multi-class classification (like image categorization).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Linear Function vs. Logistic Regression

A

Linear functions are used for predicting continuous values(relationship between height and weight), whereas logistic regression is used for binary classification (yes or no). The choice depends on the nature of the problem (regression vs classification).

17
Q

Supervised Machine Learning

A

Involves training a model on a labeled dataset. Steps include data collection, data cleaning, choosing a model, training the model, and evaluating its performance.

18
Q

Process of Classification:

A

It involves taking some kind of input, processing it, and categorizing it into a certain class or category.

19
Q

Confusion Matrix Terms:

A

True Positive: Correctly predicted positive class.
True Negative: Correctly predicted negative class.
False Positive (Type I Error): Incorrectly predicted positive class.
False Negative (Type II Error): Incorrectly predicted negative class.

20
Q

Confusion Matrix and Accuracy:

A

A tool to measure the performance of a classification model. Accuracy is calculated as (True Positives + True Negatives) / Total number of samples.

21
Q

ML for Natural Language Processing (NLP):

A

Involves applying ML techniques to understand and manipulate human language.

22
Q

Syntax and Semantics in NLP

A

Syntax refers to the arrangement of words in a sentence to make grammatical sense. Semantics refers to the meaning conveyed by a text.

23
Q

Statistical Machine Translation

A

An approach to machine translation that uses statistical models based on bilingual text corpora.

24
Q

Linear Regression Interpretation on a Graph:

A

Part 1 (Data Cleaning): It’s the process of correcting or removing inaccurate records from a dataset, improving its quality.
Part 2 (Line Interpretation): In y = mx + c, ‘m’ represents the slope (rise over run), and ‘c’ the y-intercept. The line of best fit minimizes the sum of the squares of the vertical distances of the points from the line.

25
Q

Point Residual:

A

The difference between an observed value and the value predicted by a model.

26
Q

Mean Absolute Error

A

Average of the absolute errors between predicted and actual values.

27
Q

Root Mean Square Error:

A

Square root of the average of squared differences between prediction and actual observation.

28
Q

Coefficient of Determination (R²):

A

Measures how well future outcomes are likely to be predicted by the model.

29
Q

Relative Squared Error

A

Sum of the squared differences between the actual and predicted values, normalized by the total variation in the dataset.