{ "@context": "https://schema.org", "@type": "Organization", "name": "Brainscape", "url": "https://www.brainscape.com/", "logo": "https://www.brainscape.com/pks/images/cms/public-views/shared/Brainscape-logo-c4e172b280b4616f7fda.svg", "sameAs": [ "https://www.facebook.com/Brainscape", "https://x.com/brainscape", "https://www.linkedin.com/company/brainscape", "https://www.instagram.com/brainscape/", "https://www.tiktok.com/@brainscapeu", "https://www.pinterest.com/brainscape/", "https://www.youtube.com/@BrainscapeNY" ], "contactPoint": { "@type": "ContactPoint", "telephone": "(929) 334-4005", "contactType": "customer service", "availableLanguage": ["English"] }, "founder": { "@type": "Person", "name": "Andrew Cohen" }, "description": "Brainscape’s spaced repetition system is proven to DOUBLE learning results! Find, make, and study flashcards online or in our mobile app. Serious learners only.", "address": { "@type": "PostalAddress", "streetAddress": "159 W 25th St, Ste 517", "addressLocality": "New York", "addressRegion": "NY", "postalCode": "10001", "addressCountry": "USA" } }

DS Foundations Part 2 Flashcards

(23 cards)

1
Q

What is the purpose of data visualization?

A

To communicate patterns, relationships, and insights effectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When should you use a bar chart?

A

To compare quantities across categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When is a pie chart misleading?

A

When comparing many categories or values with small differences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the best chart for time series data?

A

A line chart.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a heatmap useful for?

A

Visualizing correlation matrices or high-dimensional categorical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the four levels of measurement?

A

Nominal, ordinal, interval, and ratio.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What distinguishes interval from ratio data?

A

Ratio data has a true zero; interval data does not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Is temperature in Celsius interval or ratio?

A

Interval — because zero doesn’t represent absence of heat.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Is weight a ratio scale?

A

Yes — it has a true zero and equal intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is sampling?

A

Selecting a subset of a population for analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is selection bias?

A

When the sample is not representative of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a random sample?

A

Each individual has an equal chance of being selected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is stratified sampling?

A

Dividing population into subgroups and sampling from each.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a convenience sample?

A

A non-random sample taken from a readily available group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Simpson’s paradox?

A

A trend appears in groups but disappears or reverses when groups are combined.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is domain knowledge important in data analysis?

A

It helps interpret results and choose relevant features.

17
Q

What is confounding?

A

When a third variable affects both the independent and dependent variables.

18
Q

Why does correlation not imply causation?

A

Because the relationship may be coincidental or due to a third variable.

19
Q

What does ‘garbage in, garbage out’ mean?

A

Poor quality input data leads to unreliable analysis or results.

20
Q

What is metadata?

A

Data that describes other data, like column types or source.

21
Q

Why is metadata important?

A

It provides context, improves transparency, and aids reproducibility.

22
Q

What is data provenance?

A

A record of the origin, lineage, and transformation of data.

23
Q

Why is tracking data provenance valuable?

A

It supports auditing, trust, and reproducibility.