Exam 1 Flashcards

(148 cards)

1
Q

Statistics

A

Mathematical technique by which data are organized, treated and presented for interpretation and evaluation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Research

A

A systematic investigation which includes research, development, testing and evaluation designed to develop or contribute to generalized knowlege.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Historical Research

A

A search through record to establish a possible how and why explaining an event or phenomenon

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Observational Research

A

(descriptive research) involves describing events or conditions that the researcher does not actively manipulate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Experimental Research

A

involves manipulating and controlling events or variables to solve a problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Measurement

A

The process of comparing a value to a standard

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Data

A

the result of a measurement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Dimensions

A

physical quantities that can be measured (M,L,T,Te)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Units

A

A single quantity or individual isolated component that may be part of a more complex structure, function or system.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Validity

A

The soundness or the appropriateness of the test in measuring what it is designed to measure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Types of Validity

A

Internal and External

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Internal Validity

A

A measure to control within the experiment, (the study itself), results are due to the treatment that was applied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Threats to Internal Validity

A

Research Design
Extraneous Variables
Instrument Error
Investigator Error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

External Validity

A

The ability to generalize the results of the experiment to the population from which they were drawn.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Reliability

A

The consistency and reproducibility of data (test-retest value)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Variable

A

A characteristic of a person place or object that can assume more than one value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Intervening Variable

A

Extraneous Variable: Confouding variable that may reduce internal validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Continuous Variable

A

A variable that can assume any value (distance, force etc)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Discreet Variable

A

A variable that is limited to certain numbers, usually whole numbers or integers. (ticket sales, gymnast scores)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Constant

A

A characteristic that can only assume one value, never changes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Classificaiton of Data or Scales Acronymn

A

No Oil in Rivers
Nominal
Ordinal
Interval
Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Nominal Scale

A

Mutually exclusive categories, assigned number does not indicate an amount of something.

Numbers assigned to runners

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Ordinal Scale

A

Gives quantitiative order to the variables, but does not indicate how much better one score is than another, rank.

Rank order of winners

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Interval Scale

A

Has equal units, or intervals of measurement but has no absolute zero point (zero is arbitrary)

Performance rating on a scale of 1 to 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Ratio Scale
Quantitative, based on order has equal distance between scale points and uses zero to represent the absence of value, negatives are not possible. Time to finish`
26
Frequency Data
Data which indicates the number of occurences of a given value or event.
27
Hypothesis
An educated guess or logical assumption that is based on prior research or known facts and that can be tested.
28
Research Hypothesis
( H1 or Ha) The hypothesis that prompts the reserach, typically a statement, usually predicts a relationship or differences between or among groups.
29
Null Hypothesis
(Ho) Prediction that there is no relationship or no difference between groups
30
Independent Variable
In Experimental Studies: The variable that is manipulated or controlled by the researcher In observational Studies: The "predictor variable" this is measured only not controlled.
31
Dependent Variable
In experimental studies: The variable that is measured In observational studies: The criterion variable
32
Population
Any group of persons, places, or objects that ahve at least one common characteristic.
33
Sample
A subset of a population (Samples are representative of the population of intrest)
34
Statistical Inference
The process of generalizing from a sample to a population
35
Parameter
A characteristic of the population
36
Statistic
A characteristic of a sample that is used to estimate the value of the population parameter
37
Theory
A belief regarding a concept or a series of related concepts
38
Rank Order Distribution
ordered listing of the data in a single column, quick view of spread or variableity in the group, easily identifies outliers qualitatively
39
Range
Distance from HIghest to lowest value
40
Types of Frequency Distribution
Simple and Complex
41
Simple Frequency Distribution
Ordered listing of the variable being studied (x) with a frequency colun (f) that indicates the number of cases at each given (x) value.
42
Grouped Frequency Distribution
Ordered listing of, in one column, a variable (x) in groups and in a second column (f).
43
Frequency Histogram
Bar graph, displays a frquency distribution, frequency on y axis and groups on X
44
Frequency Polygon
Line Graph of scores plotted against frequency
45
Cumulative Frequency Graph
Curve: - Line Graph -Ordered scores on X axis - Number of subjects at or below that value on Y axis
46
Dot Plot
Scatter Plot: one categorical axis and one continuous axis. Each point represents the score of a dependent variable for each subject.
47
Normal Curve
Bell or Gaussian Curve: Assumes a symmetrical distribution of values abou thte center of the curve. Mean, Median, Mode are all located at the center point of the curve. ( z = 0)
48
Central Limit Theorem
A sum of random numbers becomes normall distributed as more and more of the random numbers are added together.
49
Bimodal Curve
Curve which contains multiple values occuring with the highest frequency
50
Skewed Curve
Nonsymmetrical curve, number of subjects or values shifted to one side.
51
Ceiling Effect
Limit on the upper value of scores causing high scores to be clustered around the limit.
52
Floor effect
Opposite of ceiling, when the lower limit causes cluster of values around it.
53
Percentile
Represents a fraction (in hundreths) of the ordered scores that are equal to or fall below a given raw score. Standard Score
54
Uses of Standard Scores
Evaluate Raw Score Compare two sets of scores that have different units
55
Standard Score
A score that is derived from raw data that has a known basis for comparison.
56
Quartile
Precentile Scores divided into four equal parts.
57
Q1
First quartile: 0-25%
58
Q2
Second Quartile: Q1 ato 50th percentile
59
Q3
Third Quartile: Q2 to 75th Percentile
60
Q4
Fourth Quartile: Q3 to 100th Percentile.
61
Quintile
Similar to quartile but uses 20% divisions
62
Steps to calculate percentile
Create a fraction: Number of values at or below value of interest / N
63
Steps to determine score from percentile
Convert percentile to decimal Multiply deceimal by number of score (N) Count that many scores from the bottom
64
Mean
Arithmetic average of the sample (Sum/n)
65
Median
The middle number, score associated with 50th percentile
66
Mode
The most frequent or common score in a distribution
67
Mode Disadvantages
Unstable: Varies depending on grouping methods Terminal: Not useful for other calculations Ignores extreme scores
68
Variability
Measure of the spread or dispersion of the data set
69
Indices of Dispersion
Describes the scatter of scores in a distribution Describes the spread of data
70
IQR
Interquartile Range: The difference between the 75th and 25th Percentile (3rd and 1st quartile)
71
SIQR
Semi Interquartile Range: Half of the IQR: (Q3-Q1)/2
72
Variance
The average of the squared deviations from the mean
73
Sum of Squares
Measures the deviation of data points away from the mean
74
Σ
Sigma : The sum of
75
𝛔
Population Standard Devation
76
V or 𝝈²
Population Variance
77
SD or s
Sample Standard Deviation
78
Sample Variance
79
Describe SD in relation to the normal curve
When N is large and the distribution is normal ther are typically 5 or 6 SDs within the range. Does not apply if range is small Does not apply in skewed data
80
Degrees of Freedom
The number of values in a data set that are free to vary
81
CoV
Coefficient of Variation: A normalized SD measure, normalized to mean, Percentage (SD/x) * 100
82
Sample Mean
83
𝝁
Population Mean
84
Z-Score
Number of standard deviations away from the mean (x-x̄)/s
85
T-score
Assigns the number 50 to a z-score of 0
86
What proportion of scores fall +/- 1 SD away from the mean?
68.26
87
What proportion of scores fall +/- 2 SD away from the mean?
95.44
88
What proportion of scores fall +/- 3 SD away from the mean?
99.75
89
Z score of 95% CI
1.960
90
Z score of 99% CI
2.576
91
Z score of 90% CI
1.645
92
Stanine
Standard 9: 9 point scale of scores can be used to assign a test score a single digit number, always positive from 0-9 e.g. 1,2,3 normal, 4,5,6 average, 7,8,9 above average
93
Probability
The liklihood of a particular outcome. 0 means it will definitely not happen 1 means it will definitely happen
94
Odds
p/(1-p) where p is the probability of the outcome
95
Skewness
Bilateral symmetry of the data
96
Positive Skew
long tail to right
97
Negative Skew
Longer tail to left
98
Kurtosis
The relative peakedness (magnitude) of the curve measure of heaviness of the tails
99
Leptokurtic
more peaked Kurtosis > 0
100
Platycurtic
less peaked, flatter Kurtosis < 0
101
Raw Skewness formula
(ΣZ³)/N
102
Raw Kurtosis formula
(ΣZ⁴/N)-3
103
Standardized Skewness Score
Z score of the raw skewness calclulated by taking the raw skewness dividing it by the standard error
104
Standardized Kurtosis Score
Z score of the raw kurtosis calculated by taking the raw skewness and dividing it by the standrard error
105
Standardized Skewness Formula
Zskew = ((ΣZ³)/N)/SQRT((6/N))
106
Standardized Kurtosis Formula
Zkurt = ((ΣZ⁴/N)-3)/SQRT(24/N)
107
Conditional Probability
Probability of the outcoem given that another outcome has occured.
108
Sampling Error
The amount of error in the estimate of a population parameter that is based on a sample statistic
109
Standard Error of the Mean
An estimate of the amount of Sampling Error
110
SEm formula
SEM = SD/ SQRT(N)
111
Sampling Distribution of the Mean
Frequency Distribution of Means
112
Level of Confidence
Degree to which it is believed that the IV has an effect on the DV. Opposite of the Probability of Error
113
𝝰
The probability of commiting a type 1 error (Alpha Level)
114
CI
Confidence Interval: Range of actual values associated with a level of confidence
115
CI formula
Mu =x̄ +/- Z(SEM)
116
Type 1 Error
𝝰 False Positive: Reject Null when Null is true
117
Type 2 Error
𝜷 False Negative: Fail to reject Null when Null is false
118
Region of Rejection
AUC either smaller than the lower critical value or larger than the higher critical value area where we reject alternate hypothesis and accept null.
119
Two tailed test
H0: Xbar1 - Xbar2 = 0 Rejection area divided between the two tails of the sampling value, upper and lower critical values
120
One tailed test
H1: Xbar1 > Xbar2 All rejection area in one tail of the sampling distribution
121
Correlation
Used to quantify the degree of relationship or association between variables.
122
r
Pearson Product Moment Correlation: Ranges from -1 to 1 , 1 = strong positive relationship -1 = strong negative relationship 0 = no relationship
123
Best fit line
Line of Best Fit: the best linear estimate of the relationship between the two variables.
124
Linear Relationship
Rate of change of variables is constant
125
Curvilinear Relationship
Rate of change of variables is not constant
126
Covariance
An index of how x and y vary together
127
Bivariate Regression
Predict a score on y if we know a score on x Simple linear regression regression = prediction
128
Residual
Error factor, the distance of the point from the line of best fit, difference between the predicted values of Y (DV) and the observed values of Y
129
Regression
Statistical technique that relates a DV to one or more IVs
130
"a" (correlation)
Y intercept: The value of y wehre the line crosses the y axis, the value of y when the value of x = 0
131
"b" (correlation)
Slope: the change ofy for 1 unit increase in x, rise/run, y2-y1 / x2-x1
132
Represents the proportion of teh variance for a DV that is explaine dby and IV or variables in a regression. Common variance Shared variance
133
How to find r squared
r = .8 , rsquared = .8*.8 = .64 = 36% unexplained
134
Homoscedasticity
Assumption that the residuals do not vary, there is no apparent relationship between the size of the residuals and the size of X,
135
Heteroscedasticity
Variance of the residuals is not constant Inflates tyle 1 errors
136
R
Multiple Correlation Coefficient: Ranges from 0 to 1 Cant be negative 0 no relationship 1 perfect relationship
137
R^2
Multiple correlation coefficient squared More useful than R Multivariate coefficient of determination
138
Partial Correlation
Teases out the correlation between a single x variable and the y variable in a multiple regression. i.e. Correlation of x1 and y with x2 removed (rsubYX2.X@) / correlation of x2 and y with x1 removed (rsubYX2.x1)
139
b (multiple regression model)
slope coefficient: coefficients that give weight to the independent variables according to their relative contributions to the prediction of Y.
140
k (multiple regression)
number of predictors, or IVs
141
Beta Weights
Regression coefficeints for standardized data
142
Multiple Regression Methods
Forward Selection Stepwise Backward Elimination Hierarchical Multiple Regression
143
Forward Selection Method Steps
1. Start with correlation matrix (pearson r table) 2. First X variable added has highest correlation with Y variable 3. Further additions of X variables are added in order of how much unique variance they can account for. **Variables that add uniqueness**
144
Stepwise Multiple Regression
Same process as forward selection Each step considered the Rsquared Each step the algorithm may remove a variabel that was previously added if it does not decrease the Rsquared This occurs if ta variable no longer accounts for a significant portion of unique variance in the model
145
Backward Elimination
All X variables forced into the model Removes variables by which effects Rsquared the least.
146
Hierarchical Method
Allows researchers to dictate the order in which X variables are added Used to examine a specific modelor hypothesis
147
Multicollinearity
When IVs are correlated with each other
148
Singularity
An extreme form of multicollinearity, perfect linear relationship between variables.