week 6 - language Flashcards by lilli jankovich

Computational Linguistics

The science of how language works using math + logic

How well did you know this?

Not at all

Perfectly

NLP (Natural Language Processing)

AI tools for working with language (like translation, chatbots)

How well did you know this?

Not at all

Perfectly

Chomsky’s Generative Grammar

Early formal models of language structure.

How well did you know this?

Not at all

Perfectly

eliza

One of the first chatbot programs; mimicked a psychotherapist but lacked true understanding.

How well did you know this?

Not at all

Perfectly

Neural Networks

AI models inspired by how neurons work.

How well did you know this?

Not at all

Perfectly

Word Embeddings

words represented as vectors are positioned in a multidimensional space

Examples: “dog” and “cat” vectors placed closely together.

How well did you know this?

Not at all

Perfectly

transformers

sequence to sequence model based on deep neural networks (deep learning) with multi head attention

ie, used in translation

input (encoder): English
output (decoder): french

How well did you know this?

Not at all

Perfectly

Self-attention mechanism:

Model looks at all words in a sequence, not just the last ones.

How well did you know this?

Not at all

Perfectly

The Transformer consists of two main parts:

encoder
embedding
decoder

How well did you know this?

Not at all

Perfectly

encoder

takes the input sequence and maps it into an embedding

How well did you know this?

Not at all

Perfectly

embedding

a n-dimensional vector representing the sequence

How well did you know this?

Not at all

Perfectly

decoder

takes the embedding and turns it into the output sequence

How well did you know this?

Not at all

Perfectly

self attention

allows the transformer to look at the other positions in the input sequence for clues that can help lead to a better encoding of the world

How well did you know this?

Not at all

Perfectly

self attention diff than previous models

can attend at all the words in the sequence, not just the last ones

How well did you know this?

Not at all

Perfectly

Applications of Transformers:

Machine translation

Summarization

doc generation

Named Entity Recognition (NER)

Biological sequence analysis

Computer vision

Protein folding

Code generation

How well did you know this?

Not at all

Perfectly

language model

Study These Flashcards

Predicts the probability of the next word in a sequence.

language model ex

Study These Flashcards

GPT, PaLM, LLaMA, Bard, Claude

language model training

Study These Flashcards

Pre-trained on massive datasets (hundreds of billions of tokens).

GPT-style Models

Study These Flashcards

Decoder-only models.

Predict one word at a time.

Example: ChatGPT (Generative Pre-trained Transformer).

Prompt Engineering

Study These Flashcards

Crafting prompts to guide LLM behavior.

RLHF (Reinforcement Learning with Human Feedback)

Study These Flashcards

Fine-tuning models using human evaluations.

in context learning

Study These Flashcards

an example to teach the llm how to respond (one shot)

art of asking the right question to get the best output from an llm - enables direct interaction with the llm using only plain lang prompts

llm problems

Study These Flashcards

performance disparities
bias
misinformation
privacy and security
ethical concerns
environmental concerns

toxicity

Study These Flashcards

anything that is rude

chatbox could reply with a toxic response

language shift

main driver of lanuage endangerment and extinction speakers switch from a native to dominant national language driven by economical factors and marginization of indigenous communities

role of ai in language decline

reinforcing english

why is english dominant language

global reach data availability research focus

BERT

computer model that helps machines understand the meaning of words in a sentence by looking at the words before and after them. It’s used to improve tasks like answering questions, translating language, and understanding text better.

What does NLP (Natural Language Processing) focus on? A) Building theoretical models of language B) Practical applications like translation and chatbots C) Creating hardware for AI D) Writing poetry

What key paper introduced the Transformer model? A) "Attention is All You Need" B) "Generative Grammars" C) "The History Manifesto" D) "Computational Linguistics"

Which of the following best describes word embeddings? A) Words as vectors based on their context B) Hard-coded grammar rules C) A speech recognition method D) Simple keyword matching

What is a Language Model’s primary function? A) To summarize texts B) To translate languages C) To predict the next word in a sequence D) To scan images

What does RLHF stand for in AI training? A) Reinforcement Learning with Human Feedback B) Recursive Language Handling Framework C) Real-time Language Heuristic Function D) Random Learning High Fidelity

. Which of the following is NOT a risk associated with LLMs? A) Bias and unfairness B) Gender stereotypes C) Toxicity D) Unlimited computing power

What does ‘Prompt Engineering’ involve? A) Writing computer code to train LLMs B) Creating the best instructions or questions to guide AI responses C) Designing hardware for AI systems D) Debugging AI software

What is ‘Emergent Abilities’ in Large Language Models? A) Abilities that appear only after specific training for each task B) Abilities that develop spontaneously after a certain model size C) A type of bug in AI models D) Manual programming of new skills

. Which of these is a core component of the Transformer architecture? A) Recurrent loops B) Self-attention mechanism C) Decision trees D) Support vector machines

Why is the environmental impact of LLMs a concern? A) They require a lot of data storage B) Training and running models use high energy and produce carbon emissions C) They cannot run on solar power D) They produce toxic output

week 6 - language Flashcards

(38 cards)