NLP & RNN Flashcards by Franklin Hole

What does NLP stand for?

Natural Language Processing

How well did you know this?

Not at all

Perfectly

What are LLMs?

Large Language Models

How well did you know this?

Not at all

Perfectly

List three tools included in NLP.

N-grams
TF-IDF
Bag of Words

How well did you know this?

Not at all

Perfectly

What is the main goal of NLP?

Make machines understand and interpret human language (spoken or written)

How well did you know this?

Not at all

Perfectly

What are the two major tasks of NLP?

Natural Language Understanding (NLU)
Natural Language Generation (NLG)

How well did you know this?

Not at all

Perfectly

Define Natural Language Understanding (NLU).

Get meaning from language

How well did you know this?

Not at all

Perfectly

Define Natural Language Generation (NLG).

Produce human-like text

How well did you know this?

Not at all

Perfectly

What is lexical ambiguity?

Word meaning ambiguity, e.g., ‘bank’ (money vs river)

How well did you know this?

Not at all

Perfectly

What is semantic ambiguity?

Sentence meaning ambiguity, e.g., ‘I saw him with a telescope’

How well did you know this?

Not at all

Perfectly

What is anaphoric ambiguity?

Referring to something earlier, e.g., ‘He told his dog to sit, and it did’

How well did you know this?

Not at all

Perfectly

List three applications of NLU.

Search
Word prediction
Text classification (e.g., spam detection)

How well did you know this?

Not at all

Perfectly

What is the first step in the NLP pipeline?

Sentence Segmentation

How well did you know this?

Not at all

Perfectly

Fill in the blank: The process of breaking sentences into words is called _______.

Tokenization

How well did you know this?

Not at all

Perfectly

What is the difference between stemming and lemmatization?

Stemming: Chops suffixes crudely
Lemmatization: Uses dictionary rules

How well did you know this?

Not at all

Perfectly

Provide an example of stemming.

‘drove’ → ‘drov’

How well did you know this?

Not at all

Perfectly

Provide an example of lemmatization.

‘drove’ → ‘drive’

How well did you know this?

Not at all

Perfectly

What are stop words?

Common words with little meaning on their own (e.g., ‘the’, ‘is’, ‘and’)

How well did you know this?

Not at all

Perfectly

Why are stop words removed in text analysis?

Helps reduce noise in text analysis

How well did you know this?

Not at all

Perfectly

What does POS tagging stand for?

Part of Speech tagging

How well did you know this?

Not at all

Perfectly

List four types of labels assigned in POS tagging.

Noun
Verb
Adjective
Adverb

How well did you know this?

Not at all

Perfectly

What does Bag of Words (BoW) do?

Converts text into a vector of word counts, ignoring word order

How well did you know this?

Not at all

Perfectly

What is a limitation of the Bag of Words model?

Loses grammar & order info

How well did you know this?

Not at all

Perfectly

What is the purpose of Information Retrieval Models?

Rank documents based on similarity to a search query

What are the components of the TF-IDF formula?

TF = How often word shows up in a doc
IDF = How rare word is in whole corpus
TF-IDF = TF × IDF

What does TF-IDF emphasize?

Unique, meaningful words

What does an N-gram predict?

Next word using previous N-1 words

What is a Bigram?

2 words

What is a Trigram?

3 words

What is a limitation of N-grams?

High memory usage with large N

True or False: N-grams can handle unseen sequences.

False

What do LLMs use to solve limitations of N-grams?

Neural networks

What is the Bag of Words model?

Counts word occurrences, but ignores order

What does TF-IDF measure?

Measures word importance across documents

What is an N-Gram model?

Predicts the next word based on the previous N−1 words

What is a Bigram?

A two-word sequence

What is a limitation of the Bag of Words model?

Ignores long-term relationships (word order & meaning fade fast)

How many total reviews were in the IMDB dataset?

50,000 total reviews

What is the distribution of positive and negative reviews in the IMDB dataset?

50% positive, 50% negative

What type of classification is used with the IMDB dataset?

Binary classification (0 = negative, 1 = positive)

What is the first step in building the IMDB classifier with a Fully Connected Network?

Data Preprocessing

What transformation is applied to word indices in data preprocessing?

Convert to 10,000-length one-hot vectors

What is the structure of the Fully Connected Network (FCN) used for IMDB classification?

Sequential model with three layers: Dense(16, activation='relu'), Dense(16, activation='relu'), Dense(1, activation='sigmoid')

What is the optimizer used in compiling the FCN model?

Adam

What is the loss function used in the FCN model?

Binary crossentropy

What is a major issue with Fully Connected Networks in text classification?

They ignore word order

What do Recurrent Neural Networks (RNNs) retain across time steps?

Memory

What is the key idea behind RNNs?

Input at time t → output + passes info (h_t) to next step

What does the hidden state (h) in RNNs do?

Carries context forward

What is Backpropagation Through Time (BTT) in RNNs?

Learning via forward pass through time, loss computed at final step, backward pass through each time step

What is used to ensure all sequences are the same length in RNNs?

Padding sequences

What are word embeddings?

Dense vectors that encode meaning instead of one-hot vectors

What does Word2Vec learn?

Word relationships

What are the two models of Word2Vec?

* CBOW: Predict center word from context * Skip-Gram: Predict context from center word

What is the first layer in the IMDB + RNN model?

Embedding layer

What does the LSTM stand for?

Long Short-Term Memory Networks

What problem do LSTMs solve in RNNs?

Vanishing gradients

What are the three gates in LSTMs and their roles?

* Input Gate: Allow new info in * Forget Gate: Discard old info * Output Gate: Output current state