week 11 - visual digital turn Flashcards

(30 cards)

1
Q

Quantification & The
Humanities
* Mainly limited by

A

Compute
▪ Digitizatio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Gold and Klein1 define DH as the following

A

digital archives, quantitative analysis, and tool-building
projects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

AI

A

Consist of systems that can handle given tasks on their
own

  • search algorithms, rule based system
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

ML

A

omputer Algorithms that automate actions without
explicit programming. Can learn and improve

Clustering algorithms, K-Means, Gradient Boosting, PC

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

DEEP LEARNING

A

Deep Learning is part of ML and makes use of ‘Deep’
Neural Networks

Convolutional Neural Nets, Recurrent Neural Networks,
LLM’s (ChatGPT)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Supervised Machine Learning

A

Model learns by example
* Human labelled datasets
* Model learns based on examples
* Measures how ‘wrong’ it is and parameters get adjusted.
* Parameters start of randomly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Unsupervised Machine
Learning

A

Does not rely on labelled data
* Aims to identify structures or patterns in data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Training a Network - STEPS

A
  1. Data goes through the model.
  2. Calculate how ‘wrong’ the model is from the labels.
  3. Calculate what parts in a network need to be adjusted to
    perform slightly better.
  4. Change the parameters
  5. Repeat until model no longer improves.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Gradient Descent

A

: ‘Direction’ to reduce ‘wrongness’.

It helps AI models learn better by slowly improving.

It’s used in training almost all machine learning models (like ChatGPT, image recognition, etc.).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

CNNs (Convolutional Neural Networks):

A

Specialized for image processing

Identify features like edges, shapes, textures

Use layers of convolutions to detect increasingly complex patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Wevers and Smits

A

focus on the potential of Computer Vision in the
Humanities.
* Shows three different ways to use CNN

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Wevers and Smits highlight three simple uses of CNNs in the humanities

A

Detecting
Medium (1)

Clustering
with CNNs (2)

Your own
classifier (3)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

detecting medium

A

They used CNNs to tell the type of media in an image — like whether it’s a photo, drawing, or printed ad. This helps researchers sort and study different kinds of visual content in archives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
  1. Clustering with CNNs
A

They grouped images that look visually similar (like sorting ads with the same layout or style) by using the features from a pre-trained CNN. This helps spot visual trends and design patterns in things like advertisements

*used pre trained CNN

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
  1. Your Own Classifier
A

They trained a custom CNN to recognize recurring image types in newspapers (like weather icons or political sketches). With only a few examples, the model got good at picking out these common image categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What has traditionally been the main focus of Digital Humanities?
A) Visual content analysis
B) Text-based analysis using OCR and computational tools
C) Audio and video processing
D) Network security

17
Q

What is meant by the “visual turn” in Digital Humanities?
A) Moving from text analysis to also include visual data
B) Focusing only on photographs in archives
C) Using VR technology for humanities research
D) Ignoring textual data completely

18
Q

Which type of machine learning uses labeled data to train models?
A) Supervised Learning
B) Unsupervised Learning
C) Reinforcement Learning
D) Deep Reinforcement Learning

19
Q

What is a Convolutional Neural Network (CNN) primarily used for?
A) Text translation
B) Image processing and pattern recognition
C) Audio synthesis
D) Database management

20
Q

In Wevers & Smits’ study, what shift did the CNN detect in newspaper images after 1900?
A) More illustrations than photos
B) A surge in photo usage, overtaking illustrations by 1925
C) No change in image types
D) A decline in image usage altogether

21
Q

What does clustering with CNNs involve?
A) Grouping visually similar images based on extracted features
B) Labeling images with text metadata
C) Translating images into text descriptions
D) Removing noise from images

22
Q

Which dataset did Wevers & Smits use to analyze advertisements?
A) CHRONIC
B) SIAMESET
C) ImageNet
D) MNIST

23
Q

. What is a limitation of clustering CNNs mentioned in the study?
A) It can’t process color images
B) It struggles with abstract or text-heavy ads
C) It requires millions of labeled images
D) It only works on handwritten documents

24
Q

How many labeled images did Wevers & Smits use to retrain their custom classifier?
A) About 100
B) About 500
C) About 10,000
D) Over 100,00

25
What is one major challenge in using CNNs for humanities research? A) CNNs cannot process black and white images B) Training data bias can affect results C) CNNs only work on video data D) Lack of computing power
b
26
What is the main focus of your research on colonial Korean print shops? ) Analyzing the sound quality of radio broadcasts B) Studying how the visual style of printed texts changed over time C) Translating Korean texts into English D) Mapping geographic locations of print shop
b
27
Which question is central to understanding print shops’ design choices? A) How did they adapt to historical events? B) How fast were their printing machines? C) What languages did they print in? D) How many workers did each shop have?
a
28
What types of changes are you trying to detect in the printed texts? A) Changes in paper quality B) Changes in fonts, layouts, or printing style C) Changes in ink composition D) Changes in shipping routes
b
29
. How large is the dataset you are working with from Korean newspapers? A) About 1,000 pages B) Over 150,000 pages C) 10,000 pages D) 500 pages
b
30
Which tool do you plan to use for parsing the layout of newspaper pages? A) OCR Only B) SAM (Segment Anything Model) C) Manual inspection only D) Excel spreadsheets
b