Unit 1 Flashcards

(47 cards)

1
Q

what is the study of statistics?

A

is the set of methods for obtaining, organizing, summarizing, presenting and analyzing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is data?

A

a set of measurements or observations taken on a group of objects
ex. the people taking the survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a population?

A

the totality of individuals or units about which we want information
ex. all people in manitoba taking the survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is a variable?

A

a characteristic or property of an individual or unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are some examples of variables?

A

hair color
height
your grade in this course
marital status

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is a sample?

A

a subset of units in a population that we examine in order to gather information about the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what does categorical ordinal mean?

A

data follows a natural order and the order makes sense

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what are examples of categorical nominal?

A

gender (female or male)
marital status (married, widowed, divorced)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what are quantitative variables?

A

have values that are a count or are obtained by measurement
it makes sense to take the average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are examples of quantitative variables?

A

distance ran in 45 mins
measurement in cm of all the students in this classroom
square footage of your house

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what does the distribution of a variable tell us?

A

what values it takes and how often it takes on these values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what type of charts can we use with categorical variables?

A

bar charts
pie charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what type of charts can we use with quantitative variables?

A

histograms
timeplots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the difference between a bar chart and a histogram?

A

in a bar chart the bars don’t touch
in a histogram the bars do touch

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what do pie charts give us a visual representation of?

A

relative frequency
proportion of the observed values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what does a frequency distribution table look like

A

31 37 40 44 49 50 51 53 56 56
62 64 67 67 68 68 69 70 71 72
73 73 74 75 77 78 78 81 82 84

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what are the two types of quantitative variables?

A

continuous variable
discrete variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what values can a continuous variable take?

A

any value within a given range
ex. weight and distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what values can a discrete variable take?

A

only take a countable number of values
ex. number of children in a family and the number of days of rain in a month

20
Q

what do we look for in respect to histograms? (5)

A

shape
any gaps
peaks (center)
spread (how variable the values of the data are)
outliers (observations that fall away from the overall pattern)

21
Q

what are the 3 types of shapes a histogram can have?

A

approximately symmetric
skewed to the left
skewed to the right

22
Q

why would we use a timeplot?

A

if we gather data that comes to us in a sequence over a period of time

23
Q

what is a trend?

A

a time series with a persistent long term rise or fall

24
Q

what is seasonal variation?

A

a pattern that repeats itself at certain intervals

25
what do we use to measure the centre of our data?
we use a measure of central tendency
26
what are the two measures of central tendency?
mean median
27
mean
average value
28
median
in a set of ordered data the median is the value that splits the data into two equal parts
29
what is the sum of deviations always equal to?
0
30
what is an outlier?
a point that falls far away from the majority of the data
31
is the median robust (resistant) or not robust (not resistant) to outliers?
median is robust to outliers (not affected) mean is affected by outliers
32
when does symmetric distribution occur?
when the mean and median are equal (exactly symmetric is ideal)
33
when does skewed to the left occur?
when the mean is less than the median
34
when does skewed to the right occur?
when the mean is greater than the median
35
how do we calculate the measure of spread?
by using the range
36
what is the range?
R a measure of spread and is simply calculated as maximum-minimum
37
what are characteristics about range? (3)
the larger the value of R the more variable the data are R measures the length of the interval containing 100% of the data Range is affected by outliers
38
what is IQR (interquartile range)?
Q3 - Q1
39
when do we use the five number summary?
when describing our distributions with numbers
40
what does the five number summary consist of?
minimum first quartile (Q1) median third quartile (Q3) maximum
41
what does a boxplot consist of?
rectangle that is formed by using the quartiles and whiskers extending from the rectangle to the maximum and minimum values
42
what is the standard deviation?
written as s measure of spread around the mean
43
what is the variance?
written as s^2 the square of the standard deviation
44
what is degrees of freedom?
n-1 in the denominator
45
when do we use deviation as a measure of spread?
when x̄ is the measure of centre
46
when does the standard deviation equal zero?
when there is no spread about the mean
47
is standard deviation affected by outliers?
yes!