stata Flashcards

(18 cards)

1
Q

how can you have detailed information about a variable

A

by writing codebook [variable]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

when you type codebook, what word is the Increment between values; (often 1.000)

A

range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what should we look at after a codebook?

A

-what type of value (cat or num)
-unique values: Number of unique, non-missing values
-missing .: Number and percent of missing values (noted as .

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

how to do a boxplot; a bar chart; a line chart; histogram; density plot

A

write:
1. graph box [variable] (if you want a condition add: if…
-> For interval variables
2. graph bar, over ([variable])
-> For nominal, ordinal and interval variable
3. line chart: ONLY for numeric and continuous varaibles
4. histogram [variable]
-> for interval variables
5. kdensity [variable]
-> for interval variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how to see skewness on a graph

A

PAR RAPPORT to the MEAN

Symmetrical (skew = 0) → balanced on both sides

Positive skew (right-skewed) → tail is longer on the right (more spread values)

Negative skew (left-skewed) → tail is longer on the left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

how to make a cross-tabulation

A

write: tab [DV] [IV], column
-> so DV in the row and IV in the column

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

how to make a comparison table

A

write: tabulate [IV], summarize([DV])

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

visualize cross-tabulation relationships, we often use bar charts. The general syntax is:

A

graph bar [DV], over([IV])

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

to visualize mean comparisons, we often use box plots. the general syntax is:

A

graph box [DV], over([IV])

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

ewplain and write the commands: sort, and list

A

*sort:
-Organizes your dataset by the values of one or more variables
-sort [variable]
-ex: age: from youngest to oldest SO if write sort gender age: Sorts first by gender, then by age within each gender

*list:
-Displays selected variables and observations in the Results window.
-list [variable1] [variable2]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are X, Y and Z

A

X: IV(s)
Y: DV
Z: CV(s) controlled variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how to recode when =/ categories, useful to say yes/no
how to recode missing values into true variables?

A

yes=1 and no=0
recode [variable] (1/3=0 “not a large threat”) (4=1 “large threat”) (miss=.), generate [variable_new]
…(miss=.)
don’t forget the coma after and generate (new name so that you don’t modify the true variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

how to have the CI

A

proportion [variable]
attention: read in line the 2 bornes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what does nofreq nokey means

A

have a better look at the cross-tab
get ride of the frequency in a table (only categories and percentage)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how to make a mean comparison?

A

tabulate independent_variable, summarize(dependent_variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly