bivariate regression Flashcards

Question 1

Q

what do social scientists try to build

Answer

A

real-world psych processes (mood) in attempt to predict how these processes operate under diff conditions

Question 2

Q

fit of the model

Answer

A

degree to which statistical model (predicted data) represents data collected (actual)

Question 3

Q

the mean: simple stat model of actual data that…

Answer

A

summarizes data/dv scores- and since it’s hypothetical, it doesnt have to a a value in observed /actual data *looks like horizontal line on graph

Question 4

Q

how to assess fit of model?

Answer

A

sum of squares (ss) variance (var) standard deviations (sd)

Question 5

Q

what are deviance scores

Answer

A

diff bw observed and model fitted data total error = sum of deviances about the mean

Question 6

Q

what is sum of squared errors

Answer

A

total amount of deviation from mean of data

Question 7

Q

what is standard deviation and whats its utility

Answer

A

assess accuracy of model (predicted data) to observed/actual data

var = ss/df

SD = sqrt(ss/df)

where df = n-1

smaller sd means on avg, deviation of observed data from the mean (our predictor) is smaller and thus more accurate/ good predictor

Question 8

Q

why not just use ss to assess accuracy of predictive model to actual data?

Answer

A

bc ss can beome too large and beome less representative of unique deviations of each observed data pt, therefore divide ss by df to obtain variance which is approx ss per data point.

however, var must be sqrt’d to obtain sd bc original units AREN”T SQRD

sd is essentially avg error ie avg deviation for observed data from the mean to give an idea of how well observed data fits model / predicted values

Question 9

Q

how to use model to check accuracy

Answer

A

given IV (x axis), to predict scores on DV (y axis)

Question 10

Q

what is regression and its types

Answer

A

way of predicting value of 1 variable from another using hypothetical model (we use linear) of relationship bw 2 variables (IV & DV) therefore we describe the relationship using the equation of a straight line

simple regression: tries to predict DV/outcome from 1 IV/predictor

multiple regression : predicts DV from 2/+ IVs

Question 11

Q

how to find line of best fit?

Answer

A

ordinary least squares criterion(ols): model that best fits the observed data ie the sum of sqrd estimated errors of prediction is minimized ie deviation of observed data from model / mean is smallest it can be for each data point

Question 12

Q

eqn to describe an unstandardized straight line

Answer

A

to predict DV (y axis) w IV (x axis) :

DV = y-int + rate of change * IV

y = b + ax

a or b1: regression coefficient for IV predictor / accuracy of model dependent on strength of relationship bw DV n IVrescaled r (closer to 1 or -1 means good predictor/ correlation) / slope of regression line / direction n strength of relationship bw DV, IV

b or b0: y-int when x=0 / pt where regression line crosses y-axis (ordinate)

as x incr n y decr : - relationship

if x n y go in same direction: + relationship

Question 13

Q

what to do in bivariate regression

Answer

A

want to find y-int (b) and slope (a) to build predictor model eqn to see how well regression model fits actual data

estimate coefficients of regression eqn
asses how well model predicts Y (DV)

Question 14

Q

pearson’s r provides info bout…

multiple R or R provides info bout…

Answer

A

r : bout size of prediction errors, closer to -/1 -> stronger relationship

R : absolute r: |r| : correlation bw actual and predicted scores, provides info bout usefulness of predictors

Question 15

Q

bivariate regression assumptions

Answer

A

each X, IV score independent of other X scores while each y score indep of other y scores (sample from normal distr)
DV, Y scores are quantitative while IV, X scores quantitative or dichotomous

3.

Question 16

Q

Answer

Study These Flashcards

A

Question 17

Q

formula for slope

Answer

Study These Flashcards

A

Question 18

Q

constant ie y-int formula

Answer

Study These Flashcards

A

y-int = mean of DV - slope * mean of IV

b = My - a*Mx

Question 19

Q

what affects pearson r (decr accuracy of predictor)

Answer

Study These Flashcards

A

outliers - pull mean towards them, resulting in bad model representative of all data
range / variance restritction
nonlinearity - if pearsons r = 0, doesnt mean variables aren’t related, just means no observed LINEAR relationship - but r only relates 2 linearly related variables

bivariate regression Flashcards

(19 cards)