CAIC 9.5 Flashcards

(136 cards)

1
Q

What are common marketing tactics employed by retailers?

A

Direct marketing emails, digital advertisements, incentives, discounts

These tactics are often based on customer demographics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the goal of using ML models in marketing campaigns?

A

To optimize effectiveness, target the right customers, achieve high conversion rates, minimize costs

This involves analyzing customer data and demographics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is unsupervised clustering in customer segmentation?

A

A method to group customers based on data, such as basic demographics

This helps create unique marketing campaigns for each segment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do highly personalized marketing campaigns utilize?

A

Accurate individual profiles using behavior data, historical transaction data, social media data

This leads to higher conversion rates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is contextual advertising?

A

A targeted marketing technique that displays ads relevant to web page content

Example: Cooking product ads on cooking recipe websites.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does generative AI enhance targeted marketing?

A

By creating dynamically personalized content, such as customized images and text

This is tailored to individual customer preferences and interests.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is sentiment analysis?

A

A text classification problem that determines if sentiment is positive, negative, or neutral

It uses labeled text data, such as product reviews.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What techniques do retailers use to assess brand perception?

A

Soliciting feedback, monitoring social media channels

This helps retailers understand customer emotions and sentiments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the main purpose of inventory planning and demand forecasting?

A

To manage inventory costs while maximizing revenue and avoiding out-of-stock situations

Traditional methods have limitations in accuracy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which techniques do retailers use for demand forecasting?

A

Statistical techniques, ML techniques such as regression analysis and deep learning

These approaches create accurate demand forecasts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the three main stages of the autonomous driving system architecture?

A

Perception and localization, decision and planning, control

Each stage plays a crucial role in the functioning of autonomous vehicles.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the role of the perception stage in autonomous driving?

A

To gather information about surroundings and determine the vehicle’s position

It uses sensors like RADAR, LIDAR, and cameras.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the function of the decision and planning stage in autonomous vehicles?

A

Controls motion and behavior based on data from the perception stage

It analyzes data to determine the optimal path for the vehicle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do AI and ML enhance the control module in autonomous vehicles?

A

By translating decisions into physical actions and optimizing the vehicle’s performance

This includes adaptive control systems and reinforcement learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the purpose of Advanced Driver Assistance Systems (ADAS)?

A

To enhance driving experience and safety by detecting hazards and issuing warnings

Examples include lane departure warnings and automatic emergency braking.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the main role of ML solutions architects?

A

To understand common ML algorithms and design technology infrastructure for deployment

This knowledge helps in selecting suitable data science solutions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is an objective function in ML algorithms?

A

A metric used to minimize or maximize, such as the disparity between projected and actual sales

It guides the optimization process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the purpose of gradient descent in ML?

A

To optimize model parameters by calculating the rate of error change

This iterative approach helps reduce errors in predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the primary purpose of gradient descent?

A

To optimize neural networks and various ML algorithms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does gradient descent calculate to update model parameters?

A

The rate of error change (gradient) associated with each input variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the role of the learning rate in gradient descent?

A

Controls the magnitude of parameter updates at each iteration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

List the key steps involved in the gradient descent optimization process.

A
  • Initialize the value of W randomly
  • Calculate the error (loss) using the assigned value of W
  • Compute the gradient of the error with respect to the loss function
  • Update the value of W to reduce the error
  • Repeat until the gradient becomes zero
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the normal equation in relation to machine learning?

A

A one-step analytical solution for calculating the coefficients of linear regression models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are some factors to consider when selecting a ML algorithm?

A
  • Problem type
  • Dataset size
  • Number and nature of features
  • Computational requirements
  • Interpretability of results
  • Assumptions about data distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is classification in machine learning?
A task that assigns categories or classes to data points
26
What is regression in machine learning?
A technique used to predict continuous numeric values
27
What is linear regression used for?
To predict a scalar output based on a linear function of input variables
28
What does logistic regression estimate?
The probability of an event occurring
29
What is the primary output of logistic regression?
A probability score between 0 and 1
30
What are the advantages of logistic regression?
* Fast training speed * Interpretability
31
What is the main advantage of decision trees over linear models?
Ability to capture non-linear relationships and interactions between features
32
What algorithms are used for splitting data in decision trees?
* Gini purity index * Information gain
33
What is a limitation of decision trees?
Prone to overfitting, especially with noisy data
34
What is the primary benefit of random forests?
Improved accuracy by combining predictions from multiple trees
35
How do random forests reduce overfitting?
By introducing randomness and using diverse subsets of features
36
What distinguishes gradient boosting from random forests?
Gradient boosting sequentially aggregates results while random forests use parallel independent learners
37
What is a key advantage of gradient boosting?
Ability to handle imbalanced datasets effectively
38
What is a disadvantage of gradient boosting?
Lacks parallelization capabilities, making it slower in training
39
What is gradient boosting?
A machine learning algorithm that can achieve higher performance than other algorithms when properly tuned.
40
What is a key advantage of gradient boosting?
It supports custom loss functions, providing flexibility in modeling real-world applications.
41
What is one limitation of gradient boosting?
It lacks parallelization capabilities, making it slower in training compared to parallelizable algorithms.
42
How does gradient boosting handle noisy data?
It is sensitive to noisy data, including outliers, which can lead to overfitting and reduced generalization performance.
43
What is XGBoost?
A widely-used implementation of gradient boosting that enables training a single tree across multiple cores and CPUs.
44
What are some improvements XGBoost offers over traditional gradient boosting?
Faster training times and powerful regularization techniques to mitigate overfitting.
45
What is K-NN?
A versatile algorithm used for both classification and regression tasks based on the proximity of data points.
46
What distance metric is commonly used in K-NN?
Euclidean distance.
47
What is the majority voting process in K-NN classification?
The most frequent class among the K nearest neighbors is assigned to the new data point.
48
What is one advantage of K-NN?
Its simplicity and lack of the need for training or tuning with hyperparameters.
49
What is a significant drawback of K-NN?
It is not suitable for high-dimensional datasets due to the diminished meaning of proximity.
50
What is an artificial neuron?
A computational unit that processes inputs and produces an output, similar to a biological neuron.
51
What does the activation function in an artificial neuron do?
Modifies the output of the linear function, capturing non-linear relationships.
52
What is a Multi-Layer Perceptron (MLP)?
An artificial neural network with multiple layers of interconnected neurons.
53
What is backpropagation?
The process of adjusting the weights of neurons based on the total error propagated back through the network.
54
What is the purpose of an MLP in machine learning?
To perform classification and regression tasks by capturing intricate nonlinear patterns.
55
What is clustering in data mining?
A method of grouping items together based on shared attributes.
56
What is the K-means clustering algorithm used for?
Grouping similar data points into clusters based on proximity to centroids.
57
What is a time series?
A sequence of data points recorded at successive time intervals.
58
What are the key characteristics of time series data?
* Trend * Seasonality * Stationarity
59
What does the ARIMA model stand for?
Auto-Regressive Integrated Moving Average.
60
What is the autoregressive component of ARIMA?
The value of a variable in a given period is influenced by its own previous values.
61
What does DeepAR utilize for forecasting?
A recurrent neural network (RNN) to capture patterns in target time series.
62
What is a major disadvantage of DeepAR?
The black-box nature of deep learning models makes forecasts difficult to explain.
63
What is a recommender system?
A machine learning system designed to suggest items to users based on various data inputs.
64
What is a significant drawback of DeepAR?
The black-box nature of the deep learning model, which lacks interpretability and transparency.
65
What is the primary function of a recommender system?
Predicting a user's preference for items based on user or item attribute similarities or user-item interactions.
66
In which industries has the recommender system gained widespread adoption?
* Retail * Media and entertainment * Finance * Healthcare
67
What is collaborative filtering?
A recommendation algorithm that predicts user preferences by analyzing the collective experiences and behaviors of different users.
68
What is a major benefit of collaborative filtering?
It provides highly personalized recommendations matched to each user’s unique interests.
69
What is the cold-start problem in collaborative filtering?
Collaborative models struggle when new users or items with no ratings are introduced.
70
What does matrix factorization do in collaborative filtering?
It learns vector representations for both users and items to predict missing entries in the user-item interaction matrix.
71
What is the primary function of a convolutional neural network (CNN)?
Processing and analyzing image data.
72
What role does the pooling layer play in a CNN?
It reduces the dimensionality of the extracted features.
73
What are two commonly used pooling techniques?
* Max pooling * Average pooling
74
What is the vanishing gradient problem in CNNs?
Signals from initial inputs diminish as they traverse through multiple layers.
75
How does ResNet address the vanishing gradient problem?
By implementing a layer-skipping technique with skip connections.
76
What does natural language processing (NLP) focus on?
The relationship between computers and human language.
77
What is the purpose of embedding in NLP?
To generate low-dimensional representations for words or sentences that capture semantic meaning.
78
What are the two components of TF-IDF?
* TF (Term Frequency) * IDF (Inverse Document Frequency)
79
What is a limitation of Bag of Words (BOW) and TF-IDF?
They lack the ability to capture the semantic meaning of words and often result in large and sparse input vectors.
80
What does the term 'embedding' refer to in machine learning?
Creating numerical representations for entities that capture their semantic similarity.
81
What is the primary advantage of using CNNs for image data?
Efficient training due to high degrees of parallelism.
82
What is a key challenge faced by MAB algorithms?
Striking the right balance between exploration and exploitation.
83
What is a practical application of computer vision technology?
* Object identification * Image classification * Face recognition * Activity detection
84
What is embedding?
A technique used to generate low-dimensional representations (mathematical vectors) for words or sentences that capture the semantic meaning of the text.
85
What does the underlying idea of embedding suggest?
Words or sentences with similar semantic meanings tend to occur in similar contexts.
86
How are semantically similar entities represented in embedding space?
They are closer to each other than those with different meanings.
87
What is cosine similarity?
A metric that measures how similar two vectors are by calculating the cosine of the angle between them.
88
What are the two techniques for learning embedding in Word2Vec?
* CBOW (Continuous Bag of Words) * Continuous-skip-gram
89
How does CBOW work?
It tries to predict a word for a given window of surrounding words.
90
How does continuous-skip-gram work?
It tries to predict surrounding words for a given word.
91
What is the purpose of a sliding window in CBOW?
To run across running text and choose one of the words as the target and the rest as inputs.
92
What type of network is used to train Word2Vec embeddings?
A straightforward one-hidden-layer MLP network.
93
What is the purpose of the hidden layer's weights in Word2Vec?
They serve as the actual embeddings for the words after training.
94
What is the term for using embeddings as features for downstream tasks?
Transfer learning.
95
What limitation does Word2Vec have regarding word meanings?
It produces a fixed embedding representation for each word, disregarding contextual variations.
96
What does BERT stand for?
Bidirectional Encoder Representations from Transformers.
97
What is the main advantage of contextualized word embeddings?
They consider the surrounding words or overall context, allowing for more nuanced representations.
98
What are the two main tasks BERT performs?
* Predicting randomly masked words in sentences * Predicting the next sentence from a given sentence.
99
What is the significance of subword level embeddings in BERT?
They allow BERT to handle out-of-vocabulary (OOV) words more effectively.
100
What component of the transformer architecture does BERT primarily use?
The encoder part.
101
What are some NLP tasks that BERT can be used for?
* Question answering * Named entity extraction * Text summarization
102
What is fine-tuning in the context of BERT?
Adding an additional output layer to the BERT network for a specific task and updating the pre-trained model weights.
103
What is a GAN?
A type of generative model designed to generate realistic data instances, such as images.
104
What are the two networks in a GAN?
* Generator * Discriminator
105
What is the role of the Generator in a GAN?
To generate instances of data.
106
What does the Discriminator in a GAN do?
Learns to distinguish between real and fake instances generated by the Generator.
107
What is few-shot learning?
A process where a model learns how to perform a task with just a few examples.
108
What is the main difference between GPT and BERT?
GPT uses the Transformer decoder block while BERT uses the Transformer encoder block.
109
What is the primary training approach used by GPT?
Next word prediction.
110
What does GPT-3 exemplify in terms of model parameters?
It has 175 billion parameters after training.
111
What are foundation models?
Models that are pre-trained on massive datasets and can handle multiple tasks.
112
What is the architecture called that PaLM uses?
Pathways.
113
What is LLaMA?
An LLM available in multiple sizes from 7 billion to 65 billion parameters.
114
What is a key advantage of LLaMA compared to larger models?
It requires fewer computational resources.
115
What is the parameter range of LLaMA?
From 7 billion parameters to 65 billion parameters
116
What advantages does LLaMA offer compared to larger models?
Requires fewer computational resources
117
What capabilities does LLaMA provide?
* Generating creative text * Answering questions * Solving mathematical problems
118
What type of license has Meta issued for LLaMA?
A noncommercial license emphasizing usage in research contexts
119
How does LLaMA perform when fine-tuned?
It performs extremely well with additional training data
120
What is the parameter count of BLOOM?
176 billion parameters
121
In how many languages can BLOOM generate text?
46 different languages and 13 programming languages
122
How many researchers contributed to the development of BLOOM?
More than 1,000 researchers
123
What is the Responsible AI License associated with BLOOM?
Individuals and institutions can use and build upon the model under agreed terms
124
Where is BLOOM easily accessible?
Within the Hugging Face ecosystem
125
What are domain-specific LLMs?
Models specifically trained for industries to solve tough domain-focused problems
126
What is BloombergGPT?
A domain-focused LLM specifically trained for the finance industry
127
What financial NLP tasks does BloombergGPT enhance?
* Sentiment analysis * Named entity recognition * News classification * Question answering
128
How many tokens comprise the comprehensive dataset used for BloombergGPT?
Over 700 billion tokens
129
What are some significant limitations of LLMs?
* Generating misinformation (hallucinations) * Toxic content * Potential bias * High resource consumption
130
What common problems have existing NLP techniques solved?
* Named entity extraction * Document classification * Sentiment analysis
131
What recent advancements have AI made in image generation?
High-resolution, photorealistic images and generative art
132
What is the new type of deep learning model used for image generation?
Diffusion model
133
How does a diffusion model generate realistic data?
By adding noise to input data until unrecognizable, then reversing the process
134
What is the process of adding noise to data in a diffusion model called?
Diffusion steps
135
What method does a diffusion model use to learn data generation?
Optimizing a set of learnable parameters through backpropagation
136
What does the iterative process of a diffusion model allow for?
Capturing complex dependencies, intricate patterns, and structures