Vocabulary Bank Flashcards by Fernando Faria

Backpropagation through time (BPTT)

gradient-based technique for training RNNs by
unfolding them in time and applying backpropagation to change all the parameters in the RNN

How well did you know this?

Not at all

Perfectly

Batch size

the number of training examples utilized in one forward/backward pass through the
network, before the loss and subsequently gradients are calculated

How well did you know this?

Not at all

Perfectly

Bag-of-words

A text representation method in NLP where a document is represented as a
vector of word frequencies, ignoring grammar and word order.

How well did you know this?

Not at all

Perfectly

Biases

Systematic errors in a dataset that can lead to unfair outcomes in a model

How well did you know this?

Not at all

Perfectly

Types of Biases

Confirmation
Historical
Labeling
Linguistic
Sampling
Selection

How well did you know this?

Not at all

Perfectly

Dataset

A collection of data used for training or evaluating machine learning models

How well did you know this?

Not at all

Perfectly

Deep learning

A subset of machine learning involving neural networks with many layers that
can learn representations of data

How well did you know this?

Not at all

Perfectly

Graphical processing unit (GPU)

A specialized hardware component designed to handle and
accelerate parallel processing tasks, particularly effective for rendering graphics and training
deep learning models by performing simultaneous computations across multiple cores

How well did you know this?

Not at all

Perfectly

Hyperparameter tuning

The process of optimizing the parameters that govern the training
process of machine learning models to improve performance

How well did you know this?

Not at all

Perfectly

Large language model (LLM)

A type of AI model trained on vast amounts of text data to
understand and generate human-like text

How well did you know this?

Not at all

Perfectly

Latency

The delay between the input to a system and the corresponding output.

How well did you know this?

Not at all

Perfectly

Learning rate

controls the size of the steps the model takes when updating its parameters
during training - if the learning rate is increased, weights and biases of the network are updated
more significantly in each iteration

How well did you know this?

Not at all

Perfectly

Long short-term memory (LSTM)

A type of RNN designed to remember information for long
periods and mitigate the vanishing gradient problem

How well did you know this?

Not at all

Perfectly

Long-term dependency

refers to the challenge in sequence models, like Recurrent Neural
Networks (RNNs), of capturing and utilizing information from earlier in the input sequence to
make accurate predictions at later time steps

How well did you know this?

Not at all

Perfectly

Loss function

A function that measures the difference between the predicted output and the
actual output, guiding model training.

How well did you know this?

Not at all

Perfectly

Memory cell state

Study These Flashcards

In LSTM networks, the cell state carries long-term memory through the
network, allowing it to retain information across time steps

Natural language processing

Study These Flashcards

The field of AI focused on the interaction between computers
and human language

Discourse integration

Study These Flashcards

Understanding and maintaining coherence across multiple
sentences or turns in conversation

Lexical analysis

Study These Flashcards

The process of examining the structure of words.

Pragmatic analysis

Study These Flashcards

Understanding language in context, including the intended
meaning and implications

Semantic analysis

Study These Flashcards

The process of understanding the meaning of words and sentences

Syntactical analysis (parsing)

Study These Flashcards

Analyzing the grammatical structure of sentences

Natural language understanding (NLU)

Study These Flashcards

A modular set of systems that sequentially process
text input to better represent their meaning before they are input into a neural network such as a
transformer NN or LSTM

Pre-processing

Study These Flashcards

The process of cleaning and preparing raw data for analysis or model training

Recurrent neural network (RNN)

A type of neural network designed to handle sequential data by maintaining a hidden state that captures information from previous time steps

Self-attention mechanism

A technique in neural networks where each element of the input sequence considers or focuses on every other element, determining their relevance or importance, which improves the model's ability to capture dependencies and relationships within the sequence

Synthetic data

Data that is artificially generated rather than obtained by direct measurement.

Tensor processing unit (TPU)

A type of hardware accelerator specifically designed by Google to speed up machine learning workloads

Transformer neural network (transformer NN)

A type of neural network architecture that relies on self-attention mechanisms to process input data in parallel, rather than sequentially like RNNs

Vanishing gradient

A problem in training deep neural networks where gradients diminish exponentially as they are backpropagated through the network, impeding learning

Weights

The parameters in a neural network that are adjusted during training to minimize the loss function.

Vocabulary Bank Flashcards

(31 cards)