CENTRAL TENDENCY VARIABILITY Flashcards

(36 cards)

1
Q

Do frequency distribution allow quantitative statements between distributions?

A

No, observations and counts are not enough to make measurable comparisons. Central tendencies (means) and dispersions of scores (variability) are best because most stats tests require them and they can be useful ont their own.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the definition of arithmetic mean?

A

The mean is a measure of central tendency. It is the sum of the scores divided by the number of scores.

It is the best single number for describing a group of scores; it gives the central tendency, or the typical value of a group of scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When can you calculate the mean of a population?

A

If you have access to all the scores of a population of interest, then the mean of that population can be represented by μ (mu).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Is the mean of a population set of scores a parameter?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the five properties of the mean?

A
  1. The mean is sensitive to the exact value of all scores in the distribution: you change one value, the mean changes.
  2. The sum of the deviations about the mean equals zero. Written algebraically this
    property becomes:
    ∑(Xi −MEAN(X barre en haut))=0

*Deviation means a quel point un score dévie/s’éloigne de la moyenne

That is why we say the mean is the balancing point in a distribution; values over and under mean cancel each other out.

  1. The mean is sensitive to extreme scores.

4.The sum of the squared deviations of all the scores about their mean is a minimum.
Stated algebraically,
∑(Xi − X)2 is a minimum.

5.Under most circumstances, out of all the measures used for central tendency (e.g. median), the mean is the least subject to sampling variation (differences between samples of a population). Dans le sens que les differences moyennes will not be THAT different from one another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a weighted mean?

A

The weighted mean is a type of average that gives different levels of importance (or “weight”) to different groups or data points. It ensures that larger or more important groups have a bigger influence on the overall average than smaller or less important ones.

two ways: when raw scores are available and when they aren’t (mean each group multiplied by number of scores in that group…)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

If you have 3 groups: group 1 has two scores, group 2, 3 scores and group 3, 5 scores. Can you sum up all their means and divide by the number of groups to give the overall mean?

A

No. You can only do this when the sample sizes (number of scores) are the same from group to group. When that is not the case you have to take into account the weight of each sample group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the median?

A

The scale value below which 50% of the scores fall. In other words, it is the
middle score of a distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the median in ungrouped odd and even scores?

A

In ungrouped scores (which is most often encountered) the median is the centermost score if the number of scores is odd. If the number of scores is even, the median is taken as the average of the two centermost scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is an important property of the median?

A

The median is not affected by extreme scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the mode ?

A

the most frequent score in the distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a unimodal distribution versus a bimodal distribution?

A

Unimodal: there is only one score that most frequently stands out.

Bimodal: distributions that has two modes (two common values)

keep in mind, possible for more….

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When is the mode most useful as a central tendency as a measure of central tendency?

A

a) there are relatively few values
b) the values are categories (ex: blue, green, blue, yellow…) - (not quantitative date - so you cat calculate the mean or find the median)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When is the mean = median = mode?

A

When the distribution is unimodal and symmetrical (bell-shaped curve)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Where can you find the median in a positive or negative skew?

A

Between the mode and mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

When is the mean never equal to the median?

A

When the distribution is skewed.

17
Q

What do measures of variability tell us?

A

How spread out the scores are in a distribution.

18
Q

What are 4 measures of variability (comment tes scores varient)?

A

Range, deviation score, variance, standard deviation.

19
Q

How do we calculate the range?

A

The difference between the highest and lowest score in the distribution

Range = (highest score – lowest score) + smallest measuring unit

example:
c. 1.2, 1.3, 1.5, 1.8, 2.3
Ranges = (2.3 – 1.2) + .1 = 1.2

20
Q

What does the deviation score tell us?

A

It tells how far away the raw score is from the mean of the distribution.

X-Mean. Remember,
sum (X-Mean) always = 0

21
Q

What is the variance?

A

The average of the squared distances of all the values from the mean.

Def des notes: an index that reflects the degree of variability in a group of scores. It does not have the limitations of the range

It is used only occasionally as a descriptive statistic since it is scaled in squared units.

22
Q

What is the difference between a sample variance and a population variance?

A

If you have access to all the scores of a population of interest, then the variance can be represented by o2 and not s2. The sum of squares (SS) is also divided by N rather than n-1 (Degrees of freedom (df)). This is because we are not estimating anymore, we are not leaving any room for the rest of the population.

23
Q

The sample mean estimates….

A

The population mean

24
Q

The sample variance estimates…

A

The population variance

25
le u = population mean. Hoe else can we call it?
The true mean
26
What is the pooled variance?
The average variance of several groups. Beware, just like the weighted mean, we can't sum up the variances form each group and then divide that number by the number of groups we have. We have to take into account the size of each group. We can only do that if the sizes of the groups are the same (le nombre de valeurs est le meme). Makes sense perch écoute si ta plus de valeurs dans un groupe, eh bien son average est surement plus representative, alors faut tenir ca en compte, porch il reste que le but est de estimate the population mean!!!!!!! In other words, making the pooled variance a more accurate reflection of the overall variability across all groups. Imagine you're comparing two groups: Group 1: The heights of 5 kids in one class. Group 2: The heights of 5 kids in another class. Now, you want to compare the variability (how spread out the heights are) in these two groups.
27
Can we calculate the pooled variance of a population?
No because pooled variances are used when we have many samples of one population. When you have a population (meaning you know the full set of data points), there is no need for pooling, because you already have all the data and can calculate the variance directly from the population itself.
28
Ca c'est juste un example pour que tu comprennes mieux les groupes writhin a population and the need of pooled variance:
Example 2: Comparing Treatment Effectiveness in Different Age Groups In another study, you might want to compare the effectiveness of a new depression treatment across two age groups: Group 1 (Young Adults): 20 young adults aged 18-30 who receive the treatment. Group 2 (Older Adults): 20 older adults aged 60-75 who receive the treatment. Again, both groups come from the same population (adults who need treatment for depression). You calculate the variance in how much depression scores improved in each group. Group 1 (Young Adults): Calculate the variance of depression score changes in this group. Group 2 (Older Adults): Calculate the variance of depression score changes in this group. Now, you want to get a pooled variance to estimate the overall variability of the treatment’s effect across both age groups, and this would give you a better, more combined picture of how the treatment works for people in both groups.
29
What is the standard deviation?
It is the square root of the variance
30
Why is the standard deviation a better measure of variability than variance?
Because it is in the original unit of measurement (not squared).
31
What are three ways the variance can be calculated?
Variance of the sample, variance of the population and variance of several groups (samples).
32
What does the coefficient of variation indicate? How do we calculate it?
The coefficient of variation (CV) is like a percentage that tells you how big the standard deviation is compared to the mean (average). Formula: variance/mean x 100.
33
What does the variance tell us? What does the standard deviation tell us?
Variance = 8: This tells us about the overall spread of the data, but it’s in squared units, which is harder to interpret. Standard Deviation ≈ 2.83: This is the average distance of each number from the mean (in the same units as the data, which makes it easier to understand). So, the data points (2, 4, 6, 8, 10) are, on average, about 2.83 units away from the mean (6).
34
What are the 6 properties of s2 (variance) and s (standard deviation)?
1.Sum of the squared deviations about the mean is less than about any other value (property of the mean remember!!) 2.Numbers are always positive because squaring of negative numbers results in positive values 3. A large deviation from the mean contributes disproportionately to the total (4 squared =16 but 8 squared= 64) 4. As the variability in the distribution increases, the statistical variance also increases. (plus loin du mean, makes sense) 5. If all scores are in distribution are identical, the variance will equal zero. (Identical to the mean makes sense) 6.Under certain conditions, the variance can be partitioned and its portions attributed to different sources: MEANING: This statement means that the total variance in a dataset can be broken down into parts (or "portions") that come from different causes or sources. Partitioning variance helps researchers and analysts understand what’s causing variability in their data. For example: In psychology, you might study how much variance in happiness is due to genetics vs. life circumstances.
35
The smaller the standard deviation is (and variance), the...
closer the numbers are from here eachother
36
For your example: Mean = 50 kg SD = 5 kg It tells us that most weights in the dataset are within 5 kg above or below the mean (so between 45 kg and 55 kg, roughly). It’s a good estimate of how far the weights typically vary from the mean. CV= 10% This means the standard deviation (5 kg) is 10% of the mean (50 kg). It gives a sense of how large the variation is relative to the average weight. This is useful when comparing datasets with different scales or units.
xxx