Building Features from Nominal and Numeric Data in Microsoft Azure Flashcards

1
Q

A distrubtion of data refers to which of the following?

The skewness of that data

The density of the data

The shape it is in when you graph it

The mean value of that dataset

A

The shape it is in when you graph it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Much of the field of statistics is predicated on understanding what distribution?

Gaussian

Bernoulli

Exponential

Poisson

A

Gaussian

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The measure of the thickness of the tails of a distribution is known as what?

Mesokurtic

Platykurtic

Kurtosis

Leptokurtic

A

Kurtosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

There are four core steps in the machine learning process. What are they and what is the order of that process?

Model, Source, Wrangle, Production

Source, Model, Wrangle, Production

Wrangle, Source, Model, Production

Source, Wrangle, Model, Production

A

Source, Wrangle, Model, Production

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A rare chance of occurrence within a dataset is often referred to as what?

Imputation

Outlier

Kurtosis

Platykurtic

A

Outlier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In Python, missing values often have which entry?

EMPTY

SAN

NAN

NULL

A

NAN

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

If the distribution is not Gaussian or the standard deviation is very small, which common scaler might be your best option?

Robust Scaler

Normalizer

Standard Scaler

Min-Max Scaler

A

Min-Max Scaler

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The process of binning manually based on your own personal insight of the data and setting ranges we would like to bin our data into is referred to as what?

Fixed-Width Binning

Detailed Binning

Binning by Instinct

Quantile Binning

A

Binning by Instinct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Whole numbers that can’t be divided are known as what?

Discrete

Diminished

Qualitative

Continous

A

Discrete

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which type of scale is used for labeling data?

Ratio

Nominal

Numeric

Discrete

A

Nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly