preFinals Math In Modern World Flashcards

1
Q

are raw information or facts that become useful information when organized in a meaningful way. It could be of qualitative and quantitative nature.

A

Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

is concerned with “looking after” and processing data

A

Data Management

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data management involves the following: (there are 4 items)

A

• Looking after field data sheets
• Checking and correcting the raw data
• Preparing data for analysis
• Documenting and archiving the data and meta-data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Importance of Data Management
(there are 3 items)

A

Ensures that data for analysis are of high quality so that conclusions are correct
Good data management allows further use of the data in the future and enables efficient integration of
results with other studies.
Good data management leads to improved processing efficiency, improved data quality, and improved meaningfulness of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Methods of Data Collection
(4 items)

A

Census
Sample survey
Experiment
Observation study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

this is the procedure of systematically acquiring and recording information about all members of a given population. Researchers rarely survey the entire population for two (2) reasons: the cost is too high and the population is dynamic in that the individuals making up the population may change over time.

A

Census

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

sampling is a selection of a subset within a population, to yield some knowledge about
the population of concern. The three main advantages of sampling are that (i) the cost is lower, (ii) data
collection is faster, and (iii) since the data set is smaller, it is possible to improve the accuracy and quality
of the data.

A

Sample survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

this is performed when there are some controlled variables (like certain treatment in medicine) and the intention is to study their effect on other observed variables (like health of patients). One of the main requirements to experiments is the possibility of replication.

A

Experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

this is appropriate when there are no controlled variables and replication is impossible. This type of study typically uses a survey. An example is one that explores the correlation
between smoking and lung cancer. In this case, the researchers would collect observations of both smokers and non-smokers and then look for the number of cases of lung cancer in each group.

A

Observation study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Planning and Conducting Surveys
1. Characteristics of a well-designed and well-conducted survey
a. ?
b. ?
c. ?
d. ?

A
  1. A good survey must be representative of the population.
  2. To use the probabilistic results, it always incorporates a chance, such as a random number generator.
    Often we don’t have a complete listing of the population, so we have to be careful about exactly how
    we are applying “chance”. Even when the frame is correctly specified, the subjects may choose not to
    respond or may not be able to respond.
  3. The wording of the question must be neutral; subjects give different answers depending on the phrasing.
  4. Possible sources of errors and biases should be controlled. The population of concern as a whole may not be available for a survey. Its subset of items possible to measure is called a sampling frame (from
    which the sample will be selected). The plan of the survey should specify a sampling method, determine
    the sample size and steps for implementing the sampling plan, and sampling and data collecting.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

2 types of sampling method

A

Non-probability sampling
Probability sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

is any sampling method where some elements of the population have no
chance of selection or where the probability of selection can’t be accurately determined.

A

Non-probability sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

it is possible to both determine which sampling units belong to which sample and the probability that each sample will be selected.

A

Probability sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

One example of nonprobability sampling is** 1. ___________ **sampling (customers in a supermarket are
asked questions). Another is 2. _____ sampling, when judgment is used to select the subjects based on
specified proportions.

A
  1. convenience sampling
  2. quota sampling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

The following sampling methods are example of probability sampling:

(There are 5 of them)

A

Simple Random Sampling (SRS)
Systematic sampling
Stratified sampling
Cluster sampling
Matched random sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

all samples of a given size have an equal probability of being
selected and selections are independent. The frame is not subdivided or partitioned. The sample variance is a good indicator of the population variance, which makes it relatively easy to estimate
the accuracy of results.

A

Simple Random Sampling (SRS)

17
Q

relies on dividing the target population into strata (subpopulations) of equal
size and then selecting randomly one element from the first stratum and corresponding elements
from all other strata.

A

Systematic sampling

18
Q

when the population embraces a number of distinct categories, the frame can be organized by these categories into separate “strata”.

A

Stratified sampling

19
Q

is an example
of two-stage random sampling: in the first stage a random sample of areas is chosen; in the second
stage a random sample of respondents within those areas is selected.

A

Cluster sampling

20
Q

in this method, there are two (2) samples in which the members are
clearly paired, or are matched explicitly by the researcher (for example, IQ measurements or pairs
of identical twins). Alternatively, the same attribute, or variable, may be measured twice on each
subject, under different circumstances (e.g. the milk yields of cows before and after being fed a
particular diet).

A

Matched random sampling

21
Q

C. Planning and conducting experiments:
1. Characteristics of a well-designed and well-conducted experiment

A good statistical experiment includes: (4 items)

A

a. Stating the purpose of research, including estimates regarding the size of treatment effects,
alternative hypotheses, and the estimated experimental variability. Experiments must compare the
new treatment with at least one (1) standard treatment, to allow an unbiased estimates of the
difference in treatment effects.
b. **Design of experiments, **using blocking (to reduce the influence of confounding variables) and
randomized assignment of treatments to subjects
c. **Examining the data set in secondary analyses, **to suggest new hypotheses for future study
d. Documenting and presenting the results of the study

22
Q
  1. Treatment, control groups, experimental units, random assignments and replication
    (3 items)
A

a. Control groups and experimental units
- To be able to compare effects and make inference about associations or predictions, one typically has to subject different groups to different conditions. Usually, an experimental unit is subjected to treatment and a control group is not.
b. Random Assignments
- The second fundamental design principle is randomization of allocation of (controlled variables)
treatments to units. The treatment effects, if present, will be similar within each group.
**c. Replication **
- All measurements, observations or data collected are subject to variation, as there are no
completely deterministic processes. To reduce variability, in the experiment the measurements
must be repeated. The experiment itself should allow for replication itself should allow for
replication, to be checked by other researchers.

23
Q

To be able to compare effects and make inference about associations or predictions, one typically
has to subject different groups to different conditions. Usually, an experimental unit is subjected to
treatment and a control group is not.

A

Control groups & experimental units

24
Q

The second fundamental design principle is randomization of allocation of (controlled variables)
treatments to units. The treatment effects, if present, will be similar within each group.
c. Replication

A

Random Assignments

25
Q

All measurements, observations or data collected are subject to variation, as there are no
completely deterministic processes. To reduce variability, in the experiment the measurements
must be repeated. The experiment itself should allow for replication itself should allow for
replication, to be checked by other researchers.

A

Replication

26
Q

A stratified sampling approach is most effective when three conditions are met:
(3 items)

A

a. Variability within strata are minimized
b. Variability between strata are maximized
c. The variables upon which the population is stratified are strongly correlated with the desired
dependent variable (beer consumption is strongly correlated with gender).

27
Q
  1. Sources of bias and confounding, including placebo effect and blinding
    (3 items)
A

Confounding
Placebo and blinding
Blocking

28
Q

is an extraneous variable in a statistical model that correlates
(positively or negatively) with both the dependent variable and the independent variable. The
methodologies of scientific studies therefore need to control for these factors to avoid a false positive
(Type I) error (an erroneous conclusion that the dependent variables are in a causal relationship with the independent variable).

A

Confounding

29
Q

is an imitation pill identical to the actual treatment pill, but without
the treatment ingredients.

A

Placebo and blinding

30
Q

is the arranging of experimental units in groups (blocks) that are similar to one another.

A

Blocking

31
Q
  1. Completely randomized design, randomized block design and matched pairs

_____ are for studying the effects of one primary factor without the
need to take other nuisance variables into account. The experiment compares the values of a
response variable (like health improvement) based on the different levels of that primary factor (e.g.,
different amounts of medication).

A

Completely randomized designs

32
Q
  1. Completely randomized design, randomized block design and matched pairs

____ is a collection of completely randomized experiments, each run within one of the blocks of the total experiment.

A

Randomized block design

33
Q

is used to determine whether there is significant difference between the expected value
frequencies and the observed frequencies in one or more categories.

A

Chi-square

34
Q

There are two (2) types of chi-square tests. Both use the chi-square statistic and distribution for different purposes:

A

chi-square goodness of fit test
chi-square test for independence

35
Q

determines if a sample data matches a population.

A

chi-square goodness of fit test

36
Q

compares two (2) variables in a contingency table to see if they
are related. It tests to see whether the distributions of categorical variables differ from each other.
- A very small chi-square test statistic means that your observed data fits your expected data well. In other words, there is a relationship.
- A very large chi-square test statistic means that the data does not fit very well. In other words, there
is no relationship.

A

chi-square test for independence

37
Q

Assumptions of the Chi-Square Test
The assumptions of the chi-square test are the same whether we are using the goodness-of-fit or the test-of independence. The standard assumptions are:
(3 items)

A

• Random sample
• Independent observations for the sample (one observation per subject)
• No expected counts less than five (5)

38
Q

is used to test whether a frequency distribution obtained experimentally fits an “expected” frequency distribution that is based on the theoretical or previously known probability of each outcome.

A

chi-square goodness-of-fit test

39
Q

is used to assess if two (2) factors are related. This test is often used in
social science research to determine if factors are independent of each other. For example, we would use this test to determine relationships between voting patterns and race, income and gender, and behavior and education.

A

chi-square test of independence