data visualization Flashcards

1
Q

what are bar plots?

A

Bar Plots are the most common way of displaying the distribution of a qualitative (categorical) variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

how do we generate a bar plot?

A

import seaborn as sns
sns.countplot(data = births, x = ‘Maternal Smoker’);

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how do we generate a histogram?

A

sns.histplot(data = births, x = ‘Maternal Pregnancy Weight’);

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is skewed right?

A

If a distribution has a long right tail, we call it skewed right. Mean is typically to the right of the median.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is skewed left?

A

If the tail is on the left, we say the data is skewed left.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is unimodial

A

A plot with one clear maximum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is bimodial

A

A plot with 2 maximums, even if one is taller–it’s like two mountain peaks right next to eachother

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how do we plot a histogram for density?

A

sns.histplot(data = births, x = ‘Maternal Pregnancy Weight’,
kde = True, stat = “density”);

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

how do we plot a violonplot?

A

sns.violinplot(data=births, x=’Maternal Smoker’, y=’Birth Weight’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is KDE?

A

Kernel Density Estimation is used to estimate a probability density function (or density curve) from a set of data.
Just like a histogram, a density curve’s total area must sum to 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly