1-3 exam terms and concepts Flashcards by Talia Aguilar

What is the definition of data?

Numbers in context

How well did you know this?

Not at all

Perfectly

What is data analysis?

It tries to summarize and explain the data

How well did you know this?

Not at all

Perfectly

What are variables in statistics?

They are the differences in what is being measured weather it is people or things

How well did you know this?

Not at all

Perfectly

What is a data set/ sample?

It’s a collection of data

How well did you know this?

Not at all

Perfectly

What is a sample?

It is a sample of the whole population

How well did you know this?

Not at all

Perfectly

What are the 2 main types of variables?

Numerical and categorical

How well did you know this?

Not at all

Perfectly

What is numerical data?

Data that involves numbers such as heights or ages

How well did you know this?

Not at all

Perfectly

What is categorical data?

Data that involves qualities such as color or location

How well did you know this?

Not at all

Perfectly

What is the difference between stacked and unstacked data?

Stacked means something has become coded or tuned into number 0 or 1 to correlate with association of a category ex male 1 female 0

How well did you know this?

Not at all

Perfectly

What type of experiment is least likely to prove a correlation?
A) Ancedotes
B) Observational
C) Controlled

How well did you know this?

Not at all

Perfectly

What is a distribution?

A way of displaying data where each datum as a frequency

How well did you know this?

Not at all

Perfectly

What’s the difference between a histogram and a relative frequency histogram?

A histogram uses the numbers as they are while a relative frequency histogram divides the specific point of data by all of the entries ex 4 divided by 11 is 0.36 so that is what would be displayed

How well did you know this?

Not at all

Perfectly

What is the formal definition of an outlier?

Haha sike there isn’t one

How well did you know this?

Not at all

Perfectly

How can you tell if a data set is left skewed?

If the “tail” goes to the left

How well did you know this?

Not at all

Perfectly

How can you tell if a data set is right skewed?

If the “tail” goes to the right

How well did you know this?

Not at all

Perfectly

What is the difference between a bar chart and a histogram?

Study These Flashcards

A bar chart has spaces and a histogram does not

What makes a bar chart “pareto”?

Study These Flashcards

If if goes from largest to smallest aka goes downhill

What does z- score measure?

Study These Flashcards

How far a data point is away from the mean

What are the best ways to show data if it categorical?

Study These Flashcards

With a bar chart, pareto chart, or pie chart

What are the best ways to show data if it is numerical?

Study These Flashcards

With a dotplot, histogram, or stemplot

What does standard deviation measure?

Study These Flashcards

It measures the spread of data

What does the IQR measure?

Study These Flashcards

It measures the spread of data

Is standard deviation or IQR a better measure of spread for a skewed data set?

Study These Flashcards

IQR

Is standard deviation or IQR a better measure of spread for a symmetrical data set?

Study These Flashcards

Standard deviation

How do you find a upper boundary outlier?

Q3+1.5 (IQR)

How do you find a lower boundary outlier?

Q1- 1.5 (IQR)

What are the 5 numbers needed for a five number summary?

Lowest number that is not an outlier Q1 Median Q3 Highest number that is not an outlier

Is a lower or a higher Z-score more unusual?

A higher z score because that means the data point is further form the mean

Fill in the blank: For a distribution that is skewed__________, the mean tends to be larger than the median

right

Fill in the blank: The _________________ measures the average distance of the observations from the mean

Standard deviation

Fill in the blank: The interquartile range (IQR) is the best measure of variation for a skewed distribution, because it tells us how much space in in the middle _____% of the data occupy.

50%

Fill in the blank: The _______ describes the "typical" value in a qualitative data set

mode

Fill in the blank: To make cause-and-effect conclusions about relationships, a researcher needs to set up a ____________ experiment.

Controlled

1-3 exam terms and concepts Flashcards

(33 cards)