{ "@context": "https://schema.org", "@type": "Organization", "name": "Brainscape", "url": "https://www.brainscape.com/", "logo": "https://www.brainscape.com/pks/images/cms/public-views/shared/Brainscape-logo-c4e172b280b4616f7fda.svg", "sameAs": [ "https://www.facebook.com/Brainscape", "https://x.com/brainscape", "https://www.linkedin.com/company/brainscape", "https://www.instagram.com/brainscape/", "https://www.tiktok.com/@brainscapeu", "https://www.pinterest.com/brainscape/", "https://www.youtube.com/@BrainscapeNY" ], "contactPoint": { "@type": "ContactPoint", "telephone": "(929) 334-4005", "contactType": "customer service", "availableLanguage": ["English"] }, "founder": { "@type": "Person", "name": "Andrew Cohen" }, "description": "Brainscape’s spaced repetition system is proven to DOUBLE learning results! Find, make, and study flashcards online or in our mobile app. Serious learners only.", "address": { "@type": "PostalAddress", "streetAddress": "159 W 25th St, Ste 517", "addressLocality": "New York", "addressRegion": "NY", "postalCode": "10001", "addressCountry": "USA" } }

DS Foundations Part 4 Flashcards

(23 cards)

1
Q

What is the law of large numbers?

A

As sample size increases, sample statistics converge to population parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a population in statistics?

A

The entire group that you want to draw conclusions about.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a sample in statistics?

A

A subset of the population used to make inferences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why do we use samples instead of populations?

A

Because it is often impractical or impossible to measure the whole population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is confirmation bias?

A

The tendency to seek or interpret data in a way that confirms existing beliefs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is survivorship bias?

A

Focusing only on ‘successful’ cases and ignoring those that didn’t survive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is anchoring bias?

A

Relying too heavily on the first piece of information encountered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Dunning-Kruger effect?

A

When people with low ability overestimate their competence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why are consistent units important in data analysis?

A

Inconsistent units can lead to incorrect comparisons or aggregation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is derived data?

A

Data calculated from raw measurements, such as ratios or rates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why convert all values to common scale before analysis?

A

To ensure fair comparisons and model compatibility.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is framing in data analysis?

A

The way a question or result is posed, which can influence interpretation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why does reframing a question matter?

A

Different framing can lead to different insights or decisions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is cherry-picking in data communication?

A

Selecting only results that support a desired narrative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is data accessibility?

A

The ease with which data can be retrieved and used by those who need it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is documentation essential for usability?

A

It helps users understand structure, purpose, and limitations.

17
Q

What is data discoverability?

A

How easily data can be found and identified for a purpose.

18
Q

What is data governance?

A

Policies and processes for managing data availability, usability, integrity, and security.

19
Q

What is anonymization?

A

Removing identifying information to protect privacy in a dataset.

20
Q

What is the difference between anonymization and pseudonymization?

A

Anonymization is irreversible; pseudonymization can be reversed with a key.

21
Q

What is exploratory thinking?

A

Asking open-ended questions to investigate data and generate hypotheses.

22
Q

What is explanatory thinking?

A

Explaining observed phenomena using tested relationships and models.

23
Q

What is iteration in data work?

A

Revisiting steps (e.g., cleaning, modeling) based on new findings or feedback.