NLP & RNN Flashcards
What does NLP stand for?
Natural Language Processing
What are LLMs?
Large Language Models
List three tools included in NLP.
- N-grams
- TF-IDF
- Bag of Words
What is the main goal of NLP?
Make machines understand and interpret human language (spoken or written)
What are the two major tasks of NLP?
- Natural Language Understanding (NLU)
- Natural Language Generation (NLG)
Define Natural Language Understanding (NLU).
Get meaning from language
Define Natural Language Generation (NLG).
Produce human-like text
What is lexical ambiguity?
Word meaning ambiguity, e.g., ‘bank’ (money vs river)
What is semantic ambiguity?
Sentence meaning ambiguity, e.g., ‘I saw him with a telescope’
What is anaphoric ambiguity?
Referring to something earlier, e.g., ‘He told his dog to sit, and it did’
List three applications of NLU.
- Search
- Word prediction
- Text classification (e.g., spam detection)
What is the first step in the NLP pipeline?
Sentence Segmentation
Fill in the blank: The process of breaking sentences into words is called _______.
Tokenization
What is the difference between stemming and lemmatization?
- Stemming: Chops suffixes crudely
- Lemmatization: Uses dictionary rules
Provide an example of stemming.
‘drove’ → ‘drov’
Provide an example of lemmatization.
‘drove’ → ‘drive’
What are stop words?
Common words with little meaning on their own (e.g., ‘the’, ‘is’, ‘and’)
Why are stop words removed in text analysis?
Helps reduce noise in text analysis
What does POS tagging stand for?
Part of Speech tagging
List four types of labels assigned in POS tagging.
- Noun
- Verb
- Adjective
- Adverb
What does Bag of Words (BoW) do?
Converts text into a vector of word counts, ignoring word order
What is a limitation of the Bag of Words model?
Loses grammar & order info
What is the purpose of Information Retrieval Models?
Rank documents based on similarity to a search query
What are the components of the TF-IDF formula?
- TF = How often word shows up in a doc
- IDF = How rare word is in whole corpus
- TF-IDF = TF × IDF