Summarising Data Flashcards

- Understanding the importance of summarising data in neuroscience. - Clear demonstration of different types of data - How you can summarise data? - Get a clear idea of data variability & how you could Identify outliers in your data? - Demonstrate ways of managing my research data. - Apply statistical software (e.g., STATA) to carry out exploratory analysis and display a dataset (histogram, box plot, cumulative frequency) presentation.

1
Q

Why is it important to summarise data in neuroscience?

A

Using statistics to summarise data allows it to be presented and communicated, as well as quantifying the variation and uncertainty in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

CLARITY AND UNDERSTANDING

A

Distil large amounts of complex data into an understandable form. Allows easier identification of patterns and insights.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

HYPOTHESIS TESTING

A

Allows focus on the essential data to allow more accurate conclusions to be formed about whether hypotheses are supported or refuted.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

EFFICIENT COMMUNICATION

A

Summarised data can be clearly and concisely presented for sharing in journals, conferences etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

RESOURCE MANAGEMENT

A

Focussing on the most relevant data allows time, resources and funding to be focussed to ensure that research is impactful.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

META-ANALYSES AND GENERALISATION

A

Summarised data is needed to combine results from multiple studies to increase statistical power and generalisability of findings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

DATA INTEGRITY AND REPRODUCIBILITY

A

Clearly documented summarised data allows researchers to replicate experiments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

DEVELOPMENT OF THEORIES

A

Summarised data allows identification of consistent patterns, allowing theoretical models to be developed and refined.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are nominal data?

A

Categorical data without a specific order e.g., blood types.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are ordinal data?

A

Categorical data with a meaningful order but no consistent interval between categories e.g., stages of cancer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are discrete data?

A

Countable values (typically integers) e.g., number of hospital visits.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are continuous data?

A

Values can fall anywhere within a specified rang e.g., blood pressure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are interval data?

A

Numerical data with meaningful intervals between values but without a true zero point e.g., temperature.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are ratio data?

A

Numerical data with equal intervals and a true zero point which allows for the calculation of ratios e.g., height, weight.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

MEASURES OF CENTRAL TENDENCY

A

Mean

Median

Mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

MEASURES OF SPREAD

A

Range

Variance (average of squared differences from the mean)

Standard deviation (square root of variance)

IQR (range between 25th and 75th percentile)

17
Q

MEASURES OF SHAPE

A

Skewness (positive - tail on right, negative - tail on left)

Kurtosis (tailedness of data distribution)

18
Q

GRAPHICAL SUMMARIES

A

Histograms (frequency distributions)

Box plots (median, quartiles, potential outliers)

Scatter plots (relationship between 2 quantitative variables)

Bar charts (categorical data)

19
Q

SUMMARY TABLES

A

Frequency tables (number of occurrences in each category)

Contingency tables (frequency distribution of variables - shows relationship between them)

20
Q

CORRELATION AND ASSOCIATION

A

Correlation Coefficient (measures strength and direction of relationship between 2 variables)

Covariance (indicates direction of linear relationship between 2 variables)

21
Q

REGRESSION MODELS

A

Linear Regression (models relationship between dependent and independent variable(s) by fitting a linear equation to data - predicts value of dependent variable based on value(s) of independent variable(s))

Logistic Regression (models probability of event using binary outcome variables)

Poisson Regression (counts data and rates - for example the number of event occurrences within a fixed period)

22
Q

LONGITUDINAL DATA ANALYSIS

A

Mixed-Effects Models (accounts for fixed and random effects - useful for measurements taken on the same subjects over time)

Generalised Estimating Equations (estimates parameters of a generalised linear model with a possible unknown correlation between outcomes)

23
Q

SURVIVAL ANALYSIS

A

Kaplan-Meier Estimator (estimates survival function from lifetime data)

Cox Proportional Hazards Model (assesses effect of variables on survival time and estimates hazard ratios)

24
Q

MULTIVARIATE ANALYSIS

A

Principle Component Analysis (reduces dimensionality of data whilst retaining most of the variance - identifies patterns and simplifies datasets)

Factor Analysis (identifies underlying relationships between variables by grouping them into factors)

25
BAYESIAN METHODS
Bayesian Inference (updates probability of a hypothesis as more evidence becomes available) Markov Chain Monte Carlo (samples from a probability distribution to perform Bayesian inference)
26
ADVANCED VISUALISATION TECHNIQUES
Heatmaps (show intensity of data points) Network Analysis (visualises relationships between entities)
27
What type of outlier falls outside inner fences?
A minor outlier.
28
What type of outlier falls outside outer fences?
A major outlier.
29
How do you calculate inner fence boundaries?
(Q3-Q1) x 1.5 Add to Q3 Subtract from Q1
30
How do you calculate outer fence boundaries?
(Q3-Q1) x 3 Add to Q3 Subtract from Q1
31
Why would you transform data?
To transform the data into a different scale to allow interpretation and/or statistical analysis.
32
What reasons are there for transforming data?
1. To improve normality (to allow use of parametric tests) 2. To reduce skewness 3. To linearise the relationship between 2 variables 4. To make multiplicative relationships additive
33
What are some commonly used data transformations?
1. Natural logarithm transformations 2. Power transformations
34
When would you perform a log transformation?
If the data are positive values and positively skewed - log transformations stretch the scale at the lower end and compress the scale at the upper end.
35