Summarizing Data and Deducing Probabilities Flashcards

1
Q

Which of the following is a univariate descriptive statistic that measures dispersion?

Covariance matrix

Inter-quartile range

Correlation

Covariance

A

Inter-quartile range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which of the following is a multivariate descriptive statistic?

Inter-quartile range

Covariance matrix

Correlation

Covariance

A

Covariance matrix

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which of these is the best description of Azure notebooks?

Data science virtual machine running on Azure in notebook format

Azure’s special serverless notebook IDE for Python

Hosted Jupyter notebooks on an Azure VM

A

Hosted Jupyter notebooks on an Azure VM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which of the following are Python libraries that can help you work with data?

A. NumPy
B. Pandas
C. Statsmodel
D. Scipy

B and C only

A, B, C and D

A, B, and C only

A and B only

A

A, B, C and D

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which probabilities do you need to know to apply Bayes’ rule?

A. A priori probabilities

B. Independent probabilities

C. Conditional probabilities

D. Dissimilar probabilities

C and D only

A and C only

A and B only

A

A and C only

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is the classification algorithm that uses Bayes rule called Naive?

It is overly simplistic in its predictions

It makes strong assumptions about the independence of features

It takes interactions between variables into account

A

It makes strong assumptions about the independence of features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which of the following charts is most appropriate for visualizing univariate distribution of data?

Strip or Swarm plot

Histogram

Pie chart

Line chart

A

Histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which of the following charts is most appropriate for finding a smooth probability distribution from a dataset?

Line chart

KDE plot

Histogram

Gantt chart

A

KDE plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the standard deviation of a series of constant data?

1

0

-1

Infinite

A

0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How are variance and standard deviation linked to each other?

Given N data points, variance is computed using N as the denominator, while standard deviation is computed using N – 1 as the denominator

Variance is the square root of standard deviation

Given N data points, variance is computed using (N – 1) as the denominator, while standard deviation is computed using N as the denominator

Variance is the square of standard deviation

A

Variance is the square of standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly