bivariate regression Flashcards

(19 cards)

1
Q

what do social scientists try to build

A

real-world psych processes (mood) in attempt to predict how these processes operate under diff conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

fit of the model

A

degree to which statistical model (predicted data) represents data collected (actual)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

the mean: simple stat model of actual data that…

A

summarizes data/dv scores- and since it’s hypothetical, it doesnt have to a a value in observed /actual data *looks like horizontal line on graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

how to assess fit of model?

A

sum of squares (ss) variance (var) standard deviations (sd)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are deviance scores

A

diff bw observed and model fitted data total error = sum of deviances about the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is sum of squared errors

A

total amount of deviation from mean of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is standard deviation and whats its utility

A

assess accuracy of model (predicted data) to observed/actual data

var = ss/df

SD = sqrt(ss/df)

where df = n-1

smaller sd means on avg, deviation of observed data from the mean (our predictor) is smaller and thus more accurate/ good predictor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

why not just use ss to assess accuracy of predictive model to actual data?

A

bc ss can beome too large and beome less representative of unique deviations of each observed data pt, therefore divide ss by df to obtain variance which is approx ss per data point.

however, var must be sqrt’d to obtain sd bc original units AREN”T SQRD

sd is essentially avg error ie avg deviation for observed data from the mean to give an idea of how well observed data fits model / predicted values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

how to use model to check accuracy

A

given IV (x axis), to predict scores on DV (y axis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is regression and its types

A

way of predicting value of 1 variable from another using hypothetical model (we use linear) of relationship bw 2 variables (IV & DV) therefore we describe the relationship using the equation of a straight line

simple regression: tries to predict DV/outcome from 1 IV/predictor

multiple regression : predicts DV from 2/+ IVs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

how to find line of best fit?

A

ordinary least squares criterion(ols): model that best fits the observed data ie the sum of sqrd estimated errors of prediction is minimized ie deviation of observed data from model / mean is smallest it can be for each data point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

eqn to describe an unstandardized straight line

A

to predict DV (y axis) w IV (x axis) :

DV = y-int + rate of change * IV

y = b + ax

a or b1: regression coefficient for IV predictor / accuracy of model dependent on strength of relationship bw DV n IVrescaled r (closer to 1 or -1 means good predictor/ correlation) / slope of regression line / direction n strength of relationship bw DV, IV

b or b0: y-int when x=0 / pt where regression line crosses y-axis (ordinate)

as x incr n y decr : - relationship

if x n y go in same direction: + relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what to do in bivariate regression

A

want to find y-int (b) and slope (a) to build predictor model eqn to see how well regression model fits actual data

  1. estimate coefficients of regression eqn
  2. asses how well model predicts Y (DV)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

pearson’s r provides info bout…

multiple R or R provides info bout…

A

r : bout size of prediction errors, closer to -/1 -> stronger relationship

R : absolute r: |r| : correlation bw actual and predicted scores, provides info bout usefulness of predictors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

bivariate regression assumptions

A
  1. each X, IV score independent of other X scores while each y score indep of other y scores (sample from normal distr)
  2. DV, Y scores are quantitative while IV, X scores quantitative or dichotomous

3.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

formula for slope

18
Q

constant ie y-int formula

A

y-int = mean of DV - slope * mean of IV

b = My - a*Mx

19
Q

what affects pearson r (decr accuracy of predictor)

A
  1. outliers - pull mean towards them, resulting in bad model representative of all data
  2. range / variance restritction
  3. nonlinearity - if pearsons r = 0, doesnt mean variables aren’t related, just means no observed LINEAR relationship - but r only relates 2 linearly related variables