1.1 and 1.2 Overview and Descriptive Statistics Flashcards

(32 cards)

1
Q

Population

A

the entire group to be studied

you define the population, be precise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Parameter

A

numerical summary of entire population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Census

A

data collected from entire population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Anecdote

A

measure of a single individual (opposite of census, nothing can be learned from it)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Sample

A

subset of the population

* DO NOT give certainty (not 100% accurate)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Statistic

A

numerical summary of a sample

study of designing sampling methods to reduce bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Statistics (inferential statistics)

A

using info about sample to make inferences about population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Probability

A

knowing something about population (general rule / trend) and applying it to sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Error =

A

statistic - parameter
(error = difference)
error is NOT the same as mistakes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Statistic =

A

parameter + error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

sampling error =

A

chance error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

nonsampling error =

A

bias! ex: asking how many students exercise after coming out of the gym

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Types of Samples

A

Convenience, Quota, volunteer, Random (simple: equal likelihood, Systematic: every 14th person, Cluster: specific areas, Stratified: random quota), Multistage (combination of a few types) ex pick 5 random dorms, pick every 9th person from each floor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Convenience Samples

A

asking the people the researcher encounters

haphazard != random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Quota Samples

A

convenience with stratification

Ex. 5 men and 5 women

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Volunteer Samples

A
voluntary responses
extreme responses (especially negative) think about people's motivation to respond
17
Q

Random Sample

A

any individual is as likely to be included as any other

  • remove bias
  • you don’t decide who’s in the sample (chance)
18
Q

Simple Random Sample (SRS)

A

using a randomizing device to select subjects

-dice, cards, random number table, technology

19
Q

Stratified Random Sample

A

simple random samples done on each set of partitions of a pop.

  • male vs female
  • bias if subpopulations are not equal
20
Q

Multi-stage sample

A

combination of several methods

-randomly select floors and knock on every other door

21
Q

Variables

A

characteristics of individuals

22
Q

Categorical or Qualitative Variables

A

they place individuals into distinct groups (categories)

23
Q

Quantitative Variables

A

numerical measures which ARITHMETIC MAKES SENSE

ex. can’t average zip codes, can average heights

24
Q

Discrete Variables

A

quantitative variables where possible values have JUMPS
ex. shoe size, can’t have size 4.443
can’t have 32.5 students

25
Continuous Variables
quantitative variables where values can be arbitrarily close ex. temp, distance, height
26
Frequency Distribution
a list of each category of data, and the number of occurrences for each category (frequency: how many things fall into each category)
27
Relative Frequency
percentage of observations in a certain category RF = frequency / total # of observations (what proportion of the whole is in each category)
28
Ways of displaying Categorical Data
Bar graph, pie chart
29
Ways to display Quantitative data
histograms, stem plots, dot plots
30
Histogram
displays data distribution divided into classes -Discrete data: classes = DIFF values (set) -Continuous data: classes = RANGE of values (only one value can be in each class) -classes turn quantitative data into categorical data
31
Stem Plot (stem and leaf plot)
represents quantitative data graphically "( )" = values in stem containing MEDIAN value #items stem #each item in stem ex: 5 1 55678 = 15, 15, 16, 17, 18
32
Dot plot shapes
``` Symmetric: bell curve Skewed Left: bump = right Skewed Right: bump = left Bimodal: 2 bumps Center, spread, outliers ```