exam 2 Flashcards

1
Q

Why do we have reliability and variability?

A

In order to ensure that data collected IS WHAT WE THINK THER ARE, or what we use to measure IS IN FACT MEASUING DATA.
because INTERPRETATION OF DATA HAS CONSEQUENCES & All statistics IS MEANINGLESS unless we are confident that we know what we are looking at

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a measurement scale?

A

The assignment of values to outcomes following a set of rules. PARTICULAR LEVELS AT WHICH OUTCOMES ARE BEING MEASURED.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 4 different types of measurement scale?

A

nominal, ordinal, interval and ratio, always DEGREE OF ERROR, different PRECISENESS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Definition+example of a nominal mesurement scale

A

“names” (nominal- Latin). It is the LEAST PRECISE. DATA CAN ONLY BE CLASSIFIED.
MUTUALLY EXCLUSIVE CATEGORIES, outcome FIT IN ONE AND ONLY ONE.
Ex: Gender, Ethnicity, political affiliation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Definition+example of an ordinal mesurement scale

A

ORDER and the things being measured are ordered. DATA ARE RANKED Ex: your rank in class…. (We know that #1 is better than #2, but we don’t know by how much)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Definition+example of an interval mesurement scale

A

: MEANINGFUL DIFFERENCE BETWEEN VALUES. tool based on A CONTINUUM. Ex:temperature, dress size, metrics?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Definition+example of a ratio mesurement scale

A

MEANINGFUL 0 POINT & RATIO between VALUES.
EX: Number of boys versus 0 degrees
0 boys would be the ABSOLUTE VALUE thus the RATIO, since there are actually no boys present in the room

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the 4 general scales rules?

A
  • Any outcome can be assigned to one of 4 scales of measurement.
  • ORDER FROM LEAST TO MOST PRECISE ( nominal-ratio)
  • HIGHER SCALE ARE MORE INFORMATIVE AND PRECISE
  • THE MORE PRECISE, contains ALL qualities of scale BELOW
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is reliability?

A

test used as a measurement tool is measuring something CONSISTENTLY. we should be able to use the test TIME after TIME and get SIMILAR RESULTS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the two test score element;

A

*observed score (score obtained i.e. 94.5% in my stats first exam) versus
* true score (97%)
Observed score= true score+ error score (reason why test score vary from being 100% true)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the first form of reliability? example

A

1)TEST-RETEST: examine if test is reliable over TIME, helps examine changes over time.
if conditions are the same, results should be the same.
ex: same exam taken different results each time its taken=not reliable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the third form of reliability? example

A

3)Internal CONSISTENCY: extent to which a test or procedure is CONSISTENT WITHIN ITSELF i.e., questionnaire items or questions in an interview should all be measuring the same thing= REPRESENT ONE dimension or area of interest CRONBACH’S ALPHA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

To remember on Reliability

A

Ensure instructions are standardized and clear across all settings

  • Increase no of observations to increase the chance of the sample being reliable.
  • Delete all unclear items.
  • Moderate the easiness/difficulty of test
  • Minimize external effects
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is validity?

A

The tool MEASURES WHAT IT SAYS IT DOES; what it’s supposed to

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the first type of validity?

A

CONTENT VALIDITY:Good SAMPLE of the specific “UNIVERSE”

ex: Does the content of a test cover everything in the area of interest?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the second type of validity?

A

CRITERION VALIDITY:Systematically RELATED TO OTHER CRITERION
1 Concurrent: New measure test scores are CORRELATED with those FROM AN ESTABLISHED VALID test
ex: we have a high positive correlation between scores on the new and old tests. this test valid!

2 Predictive:futur
Ex:Can an intelligence test at age 3 predict academic performance at 21?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Not finding the validity evidence

A

means that your test is not doing what it should.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

when no criterion validity

A

to re-examine the nature of the items on the test and answer questions the way you expect the responses to be.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Not finding construct validity

A

means that you have to take a closer look at the theoretical rationale that underlies the test you have developed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Relationship between validity and reliability

A

A test can be reliable and not valid, but not the other way around. Because a test can do what it does over and over, but still not do what it is supposed to do.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

HYPOTHESIS

Definition

A

It is an EDUCATED GUESS
The “QUESTION/PROBLEM STATEMENT” we want to answer/address with research
TRANSLATES A PROBLEM IN A QUESTION that can be tested.
,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

How to formulate a good hypothesis

A
  • should TRANSLATE A statement/research question into a more amenable testing form.
  • Use the RESEARCH question as a GUIDE. Then the hypothesis will determine the techniques to use to create a good hypothesis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

rules about Samples and population

A

Samples should ACCURATELY, to allow a higher a degree of GENERALIZATION for the study results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Null Hypothesis def:

A

Statement that two or more things are EQUAL OR UNRELATED to each other
H0 : m1 = m2 or H0 : rm1m2 = 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Purpose of the NH
The NH acts as BENCHMARK & STARTING POINT against which the actual OUTCOMES of a study can be MEASURED (STATE OF AFFAIR accepted coz no other info)
26
Research hypothesis def
STATEMENT OF INEQUALITY posits that there is a RELATIONSHIP between variables.
27
RH two forms
•NON-DIRECTIONAL RH (ONE TAIL): UNSPECIFIED DIFFERENCE between groups, H1 : X1 > X2 more than/less than • DIRECTIONAL RH (2 TAILS): SPECIFIED DIFFERENCE between groups. H1 : X1 ≠ X2
28
Purpose of RH
DIRECTLY TESTED in RESEARCH. results compared with you expect by CHANCE ALONE & see what is a more attractive explanation for any differences observed between groups.
29
Difference between the NH and the RH
•RELATIONSHIP (NH-YES & RH- NO) •NH -POPULATION &the RH -SAMPLE. •NH INDIRECTLY tested & RH DIRECTLY ested •NH in GREEK symbols & RH in ROMAN symbols. *NH is IMPLIED, & RH is EXPLICIT- reason why NH is not used in research reports.
30
What makes a good hypothesis?
``` •DECLARATIVE form & NOT A QUESTION •POSITS A RELATIONSHIP between VARIABLES variables •REFLECT THEORYon which based •Should be BRIEF & TO THE POINT. -TESTABLE (unambiguous) ```
31
PROBALITY | Why?
*BASIS for the NORMAL CURVE & the FOUNDATION for INFERENTIAL statistics. • determining the degree of confidence we have in stating that a statement is true.DIDNT HAPPEN BY CHANCE
32
Probility: normal curve (bell-shaped curve) def:
it is a VISUAL REPRESENTATION of a distribution of scores
33
what are the probability normal curve 3 characteristics?
* NOT SKEWED (mean=median=mode) * SYMETRICAL-both HALVES r IDENTICAL. * ASYMPTOTIC: as they come CLOSER to HORIZONTAL AXIS, they NEVER TOUCH.
34
explain the reasons for the curve characteristics?
With large sets of data, & repeated samples of data from population, the VALUE IN CURVE APPROXIMATE THE SHAPE of a normal curve. EVENTS in the EXTREME tend to have a SMALLER PROBABILITY , and event in the MIDDLE have a HIGHER probability.
35
Probability standard scores def:
scores that COMPARABLE because they are STANDARDIZED, allow us to COMPARE scores with DIFFERENT MEANS They help decide THE PROBABILITY OF SOME EVENT OCCURING LIKELY, MORE LIKELY, LESS LIKELY
36
Probability | What do z scores represent?
RAW SCORES & a particular LOCATION on the x-axis | -the LARGER the Z score, the further awya from the mean
37
Hypothesis testing and z scores
*NH - no difference between groups with a chance of a 100% of that occurring. *RH shows that the likelihood of that event occurring is somewhat extreme; RH - better explanation than NH. *Z score will show the LIKEHOOD OF EVENT HAPPENING
38
Inferential statistics: significant
any DIFFERENCE between GROUPS is caused by an EXTERNAL FACTORS & not by CHANCEalone. allowing leeway on fact that difference in groups could be may be caused by uncontrollable factors.
39
Inferential Stats: what is Significance level?
is the DEGREE WE ALLOW FOR ERROR, the level of chance or risk we are willing to take that the RESULTS in an experiment are NOT DUE TO CHANCE ALONE.
40
What is the difference between significance and meaningfulness?
Significance(PROBABILITY) is not meaningful(CONTEXT) on its own (variability, difference of people, mean difference): it is influenced by the mean of the groups, variability and the difference in groups
41
Define inferential statistics
Tool used to infer result from sample to population
42
How does inference work?
* sample * test * significance * inference
43
Steps for significance
* NH * .01or .05 * Compute T * compare T to Bar * reject/accept NH
44
What is confidence interval?
The BEST ESTIMATE OF THE RANGE of population in a sample
45
What do we need Z test for?
To compare a SAMPLE MEAN to a POPULATION MEAN
46
Tea
Tool look up MEAN DIFFERENCES between of 1 or MORE VARIABLE between GROUPS that are INDEPENDENT of 1 ANOTHER (independents/dependents samples)
47
When to use tea?
With independent and/or dependent samples
48
Assumptions of T
Observations are INDEPENDENT 2 populations must be NORMAL 2 groups with EQUAL VARIANCE
49
Explain | t(58)=-.14>.05
t = test statistics 58 =degree of freedom -.14 = obtained value P>.05 =probability
50
What is degree of freedom?
EVERYTHING u need to KNOW before u KNOW the TEST Number of entities that are FREE TO VARY Df = n1-1+n2-1 in independent T2 Df=n-1 in dependent T1
51
Explain tails
* 2 tails= NON-DIRECTIONAL RH, results could go either way | * 1 tail= DIRECTIONAL RH, results could only go one way
52
What are effects size?
``` HOW BIG IS BIG? How different is different- magnitude Small 0.0-2.0 how similar r gps/overlap Medium 2.0-.50 Large .5 and above (Xbar1-Xbar2)/SD from either gp ```
53
SPSS T
Analyze, compare means, independents samples T-test/paired samples T test
54
Z scores observation
Scores below mean r negative Positive scores r right to the mean Z scores r comparable
55
Z scores formula
Z= x-x — Sd
56
what is the second form of reliability? example
2) PARRALEL FORM:To examine the EQUIVALENCE OR SIMILAR FORM of the same test. ex: Two versions of the same test should yield equivalent results =not reliable
57
what is the fourth form of reliability?
4)Interrater reliability: HOW MUCH 2 REATERS AGREE on their judgments and if they follow the same procedures.=use STANDARDISE CATEGORIES There SHOULD BE A HIGH POSITIVE CORRELATION between the scores of different observers
58
Chronbach’s Aplha
special measure of reliability INTERNAL CONSISTENCY:how closely related a set of items are as a group. The more consistently an item vary with the total score on the test, the higher the Chronbach’s Aplha value. (as the intercorrelations among test items increase) People who do well should do well on harder questions
59
if you can establish validity
- lower the error - use standardised instructions - increase no of observation - delete unclear items - moderate the test - eliminate external event
60
What is the importance of validity?
Often times we CAN'T “SEE” the CONCEPT we are measuring, i.e. “intelligence,” or “depression” – therefore establishing validity is important
61
what is internal validity
The tool is measuring WHAT IT IS INTENDING to measure
62
what is external validity
The findings can be GENERALIZED BEYOND THE CONTEXT of the research situation
63
What is the third type of validity?
CONSTRUCT VALIDITY: Related to an UNDERLYING IDEA | Ex: social status
64
Z-scores and SD
when comparing scores across distribution, Z scores and SD are EQUIVALENT
65
percentages in the curve
34,13; 15,59; 2,15;0,13
66
What is hypothesis testing?
procedure that DECIDES that the OUTCOME of a STUDY supports a THEORY at a POPULATION level
67
error type 1
rejecting a TRUE hypothesis. Claiming a difference when there is NO DIFERENCE FALSE POSITIVE greek alpha
68
error type 2
accepting a FALSE hypothesis.Claiming there is no difference while THERE IS A DIFFERENCE. FALSE NEGATIVE greek beta
69
p ≤ .05
On any one test of the NH there is a 5% chance you will reject it when the NH is actually true. The probability of observing this outcome in the “normal population” is less than .05. (the “Outcome” is rejecting the NH when it is true) There is a 5% probability that a score is that extreme if theNH is true
70
balancing errors
If you set your significance level at .000001 to control for a Type I error, you risk being too stringent to detect a real effect – committing a Type II error. Tradeoffs must be made
71
SIGNIFICANT
If p<.05 that means it is SIGNIFICANT. | If obtained value is MORE extreme, REJECT NH
72
NOT SIGNIFICANT
p>.05 means a result is not significant | If obtained value is NOT MORE extreme, ACCEPT NH
73
what is a Test Statistic (Obtained Value)
– RESULT of a specific STATISTICAL TEST done on a sample
74
independent samples
SEPARATE groups TESTED ONCE (males vs.females) only 2 groups total interested in DIFFERENCE BETWEEN GROUPS
75
NUMERATOR IN T STATISTIC
DIFFERENCE BETWEEN MEANS
76
Denominator in T Statistics
AMOUNT of VARIANCE WITHIN & BETWEEN groups