STATS T1 Flashcards

(57 cards)

1
Q

Categorical variable

A

Variable with scores that are not on a numeric scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Descriptive statistics –

A

Summarise samples – giving someone the main points in a simple form To describe data, we will use graphical and numerical (statistical) techniques

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Inferential statistics –

A

Examine patterns in the data and consider how much data we have You can then draw conclusions about a population based on the analysis of a sample. -> conceptual replication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Summarising

A

collecting and summarising data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Statistical inference

A

the ability to draw general conclusions from samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How many times does a particular score occur?

A

Percentages/Averages Scores for a particular variable (Frequency statement)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Do scores for one variable correlate with scores for the other variable?

A

Statement about association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How strong is the correlation or association between two variables?

A

Statement about association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Do I trust that there is a “genuine” association (relationship)?

A

Statement about relationship between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Frequency Distribution?

A

show scores in order and their frequency of appearance in the sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Negatively skewed

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Positively Skewed

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When not to describe the skew of data?

A

When we cannot put our scores in order , from lowest
to highest so when we are describing a categorical
variable with unordered categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Unimodal?

A

One major peak

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Bimodal

A

Two major peaks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q
A

Approximately symmetrical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How do outliers and the mean relate to each other?

A

Outliers are extreme values that differ from most values in the data set. Because all values are used in the calculation of the mean, an outlier can have a dramatic effect on the mean by pulling the mean away from the majority of the values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What happens to the mean, median and mode in a skewed distribution

A

in normal distributions, they all take on the same number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Why are histograms good?

A

effective visual summary of a variable’s central tendency and variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is a discrete, continuous, independent and dependent variable?

A

Discrete: variable that is limited (age, gender) Continuous: exists on a continuum basically infinite between highest/lowest IV: variable manipulated/changed to see whether it has an effect on the DV that might change because of the manipulation DV: variable that, though measured, is not being controlled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the role of measurement scales?

A

The numbers don’t necessarily say anything concrete about the objects measured <i>ex.: if I scored high on a test, but someone else scored lower, it’s not necessary because they remembered less even though the data might suggest it → we assume that they mean I remembered more</i>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the purpose of a frequency distribution?

A

Organising data into a meaningful order of how many times

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Which variable do I usually find on the X- and Y-Axis in histograms vs. line graphs?

A

histogram: dv-iv Line/Bar graph: iv-dv

24
Q

What is the mode, median, mean (+formulas)?

A

Mode: the highest point in the graph Median: 50th percentileMean: Sum of N/ N

25
If the mean is slightly larger what does it probably say about or distribution?
Positively skewed
26
When will the mean and the median be equal?
Symmetric distribution
27
The benefit of the mode is?
* Representing categorical data * More informative *But not very reflective of the remaining data set
28
The benefit of the median is?
Not affected by outliers Not stable in comparison and not useful to calculation
29
What does central tendency refer to?
is a statistical measure that identifies a single value as representative of an entire dataset, indicating where the center of the data lies.
30
What is the advantage of a bar chart?
* Comparing categories * Mirrors other visualisation techniques were the spread is along the X-axis and the frequency or percentage is along the Y-axis already hints at modality and skewness
31
What is an alternate name for the y-axis/x-axis?
ordinate/abscissa
32
Suppose you sell ice cream with three different flavours: chocolate, strawberry and yogurt. The ice cream flavours are measured on a ____________ level. You sell ice cream to children, adults and elderly people. These age groups are measured on a ____________ level.
nominal; ordinal
33
operational definition
defining a variable in terms of the set of steps or procedures that the researcher goes through in order to manipulate or measure the variable
34
right skewed
positively skewed
35
What does a negatively skewed distribution reveal?
A lot of people got close to the maximum score
36
What does central tendency mean?
average score
37
Age in months is an example of a variable with a ratio scale of measurement. Select one: True False
T
38
What are two ways to visually represent to measurement data variables?
1. scatter plots 2. contingency tables/crosstabulation
39
What is a way to visually represent a mix of categorical and measurement data?
compound histogram
40
What is a way to visually represent categorical data pairs?
crosstabulation
41
What are the groupings of scores in histograms called?
bins
42
Do these images show the same data?
Yes
43
Which visual representation should you choose if you want to show that variables vary simuntaneously?
scatter-plots
44
What does a boxplot do?
summarises the data while showing the range, interquartile range, as well as the min, max and the median
45
When is the mean most useful?
best for interval/ratio measurement data (categorical data can hardly be split into 2), needs equal spacing between adjacent values
46
What is the mode most useful for?
all but notably for nominal/ordinal categorical data because popular choice
47
Variables are
properties of objects that vary in the values that they take on
48
A score is
an individual value for a variable
49
Measurement data describes
scores on a numerical scale
50
Categorical data describes
scores not on a numerical scale
51
A Population describes
a complete set of scores that might be of interest
52
A Sample is
a sub-set of scores from a population which were obtained
53
A parameter is
a number that summarises the entire set of scores in a population
54
A statistic is
a number that summarises the scores in a sample
55
Descriptive Statistics...
summarise samples by presenting the main points in a simplified way
56
Inferential statistics...
examine patterns in the data and consider the amount of data
57
Ethnicity or political ideology are examples
nominal variables