The Scientific Method & Statistical Analysis Flashcards

1
Q

What is the definition of the Scientific Method?

A

A systematic investigation towards increasing the sum of human knowledge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a research hypothesis?

A

A research hypothesis is a statement that proposes a possible explanation to some phenomenon or event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a characteristic of a good hypothesis?

A

A good hypothesis must be testable (falsifiable/refutable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Research Question: You are interested in understanding what causes leaves to change colour?

Which of the following hypothesis is testable?

A. Temperature may cause leaves to change colour

B. Leaves change colour as little invisible elves want them to

A

A as B cannot be rejected because there is no way to measure the activity of little invisible elves

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What do good hypotheses often contain?

A

Good hypotheses often contain at least two variables and their (causal) connection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the two main types of research designs for collecting data?

A
  • experimental research
  • observational research
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is experimental research?

A

Experimental research involves willfully manipulating (independent/predictor/treatment) variables to observe the outcome on your response (dependent) variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is observational research?

A

Observational research involves studying the response variable in ‘in situ’ without any direct manipulation of independent/predictor variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the independent and dependent variable in this research question:

How does biodiversity change with decreasing latitude?

A

Dependent variable: Butterfly species no.
Independent variable: Latitude

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the independent and dependent variable in this research question:

Does dosage of an antibiotic lead to more resistance?

A

Dependent variable: Bacterial survival
Independent variable: Dose levels of antibiotic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the independent and dependent variable in this research question:

Is a particular gene involved in eye development ?

A

Dependent variable: Eyes present or absent
Independent variable: is the gene on or off?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is sampling in research design?

A

Sampling in research design is the process of selecting a subset of individuals or units from a larger population to study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why do we have to sample from a population?

A

We have to sample from a population because it is often impractical or impossible to collect data from every individual in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why does the sample have to be random?

A

The sample has to be random to avoid bias and ensure that every individual or unit in the population has an equal chance of being selected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are some different approaches to random sampling?

A

Different approaches to random sampling include:
- simple random sampling
- stratified random sampling
- cluster random sampling
- systematic random sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is random assignment used?

A

Random assignment is needed in experimental studies to ensure that each participant has an equal chance of being assigned to different treatment groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Why may random sampling not be useful in an experimental study?

A

Random sampling may not be useful in an experimental study if the sample is already limited to a specific population that has met certain inclusion criteria

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is Random Sampling?

A

Random Sampling is a research design sampling method in which every element of the population has an equal chance of being selected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What factors determine the choice of sampling method?

A

The choice of sampling method depends on:
- your population
- budgetary constraints

20
Q

What is the objective of using Random Sampling methods?

A

Help minimise any errors caused by chance, bias, or confounding when making inferences

21
Q

What is the significance of variability in statistics?

A

Refers to the amount of dispersion or spread of data around the mean or average. The degree of variability can provide important information about the data and help in drawing meaningful conclusions.

22
Q

Can the number of bricks eaten last week be used in statistical analysis?

A

No as it is not a relevant or meaningful variable

23
Q

Can the number of grapes eaten last week be used in statistical analysis?

A

Yes as it is a relevant and meaningful variable

24
Q

Which of the following are Statistical Questions:

How much does my pet grapefruit weigh?
What was the average score on the essay on the Practising Scientist last year?
How many teeth does my mother have?
How much time do members of family spend on their phones (screen time) at dinner?
How many times have I watched Lord of the Rings?

A

statistical questions:
- What was the average score on the essay on the Practising Scientist last year?
- How much time do members of family spend on their phones (screen time) at dinner?

not statistical questions:
- How much does my pet grapefruit weigh?
- How many teeth does my mother have?
- How many times have I watched Lord of the Rings?

25
Q

What is the difference between descriptive and inferential statistics?

A

Descriptive statistics: involves summarising and exploring a collection of data, either graphically or numerically.
Inferential statistics: involves estimating parameters and their confidence limits, as well as hypothesis testing.

26
Q

What are some examples of descriptive statistics?

A
  • measures of central tendency (e.g., mean, median, mode)
  • measures of variability (e.g., range, standard deviation)
  • graphical displays (e.g., histograms, box plots, scatter plots)
27
Q

What are some examples of inferential statistics?

A
  • hypothesis testing (e.g., t-tests, ANOVA, regression analysis)
  • confidence interval estimation
  • model selection
28
Q

What are the two types of variables in data?

A
  • numerical
  • categorical
29
Q

What are numerical variables?

A

Variables that
- take on numerical values
- sensible to perform arithmetic operations on
- quantitative

30
Q

What are some examples of numerical variables?

A
  • age
  • height
  • weight
  • income
  • temperature
31
Q

What are categorical variables?

A

Variables that
- take on a limited number of distinct categories
- not sensible to perform arithmetic operations with these categories (but can be identified with number)
- qualitative variables

32
Q

What are some examples of categorical variables?

A
  • gender
  • ethnicity
  • occupation
  • education level
  • favorite colour
33
Q

What are the subtypes of numerical variables?

A
  • continuous
  • discrete
34
Q

What is a continuous variable?

A

A numerical variable that can take on any value within a given range

35
Q

What is a discrete variable?

A

A numerical variable that can only take on specific, distinct values, often represented by integers

36
Q

What are the subtypes of categorical variables?

A
  • regular categorical
  • ordinal
  • binary
37
Q

What is a regular categorical variable?

A

A categorical variable that can take on one of several distinct categories, but these categories have no inherent order or ranking

38
Q

What is an ordinal variable?

A

A categorical variable in which the categories have an inherent ordering or ranking

39
Q

What is a binary variable?

A

A categorical variable that can take on only two categories, such as
- yes or no
- true or false
- male or female

40
Q

How can errors in numeric data be identified and corrected in Excel?

A
  • plot each numerical variable as a column chart or scatterplot and look for outliers
  • row number will be on the x-axis, so you can go and check the value
  • can also sort each numerical column and look at the extreme ends to identify potential errors
  • once errors are identified, they can be corrected directly in Excel
41
Q

What is alphanumeric data?

A

Data that consists of both letters and numbers

42
Q

How can errors in alphanumeric data be identified and corrected in Excel?

A
  • sorting and checking, but this method may miss subtle differences such as case changes, missing letters, and spaces
  • Pivot Tables: quickly identify and correct errors
43
Q

How are missing values represented in Excel?

A

By blank (empty) cells

44
Q

What code is used for missing values in R?

A

In R, the code NA is used for missing values and should not be used for anything else

45
Q

What are the $ sign rules?

A

$A1 = column A, but different row
A$1 = row 1, different column
$A$1 = same cell