LLM Flashcards

Question 1

Q

What is subword tokenization?

Answer

A

Words split into meaningful sub-parts

Common in modern tokenizers

Question 2

Q

What is the purpose of TrainingArguments() in fine-tuning?

Answer

A

Customize training settings

See documentation for all parameters

Question 3

Q

What does the output_dir parameter in TrainingArguments represent?

Answer

A

Output directory for the fine-tuned model

Question 4

Q

What is the significance of num_train_epochs in TrainingArguments?

Answer

A

Number of training epochs

Question 5

Q

What does learning_rate control in training?

Answer

A

For optimizer’s learning adjustments

Question 6

Q

What do per_device_train_batch_size and per_device_eval_batch_size define?

Answer

A

The batch size for training and evaluation respectively

Question 7

Q

What is the role of the Trainer class?

Answer

A

To manage the training process

Question 8

Q

What is the eval_dataset used for in the Trainer class?

Answer

A

The data used for evaluation during training

Question 9

Q

In the training output, what does eval_loss indicate?

Answer

A

The loss value during evaluation

Question 10

Q

How are predicted labels derived from model outputs?

Answer

A

Using torch.argmax on outputs.logits

Question 11

Q

What does the label_map dictionary represent?

Answer

A

Mapping of predicted labels to sentiment categories

Question 12

Q

What is the function of the pipeline() method?

Answer

A

Streamlines tasks with automatic model and tokenizer selection

Question 13

Q

What is full fine-tuning?

Answer

A

Updating the entire model weights

Computationally expensive

Question 14

Q

What is partial fine-tuning?

Answer

A

Only task-specific layers are updated; some layers are fixed

Question 15

Q

Define transfer learning in the context of fine-tuning.

Answer

A

Adapting a pre-trained model to a different but related task

Question 16

Q

What is zero-shot learning?

Answer

Study These Flashcards

A

No examples are provided for the model to learn from

Question 17

Q

Fill in the blank: In one-shot learning, the model is provided with _______.

Answer

Study These Flashcards

A

one example

Question 18

Q

What does load_dataset() do?

Answer

Study These Flashcards

A

Loads a dataset for fine-tuning

Question 19

Q

What is the purpose of tokenization in machine learning?

Answer

Study These Flashcards

A

Converts text into a format suitable for model training

Question 20

Q

What does the tokenizer function do to the input data?

Answer

Study These Flashcards

A

It processes the text data into tensors for model input

Question 21

Q

What is the output of the tokenization process?

Answer

Study These Flashcards

A

A dictionary containing input_ids, attention_mask, etc.

Question 22

Q

What does the tokenize_function do?

Answer

Study These Flashcards

A

Applies tokenization to text data

Question 23

Q

What is the difference between tokenizing in batches and tokenizing row by row?

Answer

Study These Flashcards

A

Batch tokenization processes multiple rows at once, while row by row processes one at a time

LLM Flashcards

(23 cards)