Terms to study Flashcards by Damien lybrand

z-test

z-test A z-test is a statistical test to determine whether two population means are different when the variances are known and the sample size is large. A z-test is a hypothesis test in which the z-statistic follows a normal distribution. A z-statistic, or z-score, is a number representing the result from the z-test.

How well did you know this?

Not at all

Perfectly

Sample

Sample In statistics, a sample is an analytic subset of a larger population. The use of samples allows researchers to conduct their studies with more manageable data and in a timely manner. Randomly drawn samples do not have much bias if they are large enough, but achieving such a sample may be expensive and time-consuming.

How well did you know this?

Not at all

Perfectly

p-value

p-value The p-value, or probability value, tells you how likely it is that your data could have occurred under the null hypothesis. It does this by calculating the likelihood of your test statistic, which is the number calculated by a statistical test using your data.

How well did you know this?

Not at all

Perfectly

The symbol ‘μ’ represents the population mean.

How well did you know this?

Not at all

Perfectly

An F-value is the ratio of two variances, or technically, two mean squares. Mean squares are simply variances that account for the degrees of freedom (DF) used to estimate the variance. F-values are the test statistic for F-tests. Learn more about Test Statistics.

How well did you know this?

Not at all

Perfectly

One-sample t-test

One-sample t-test
A one sample test of means compares the mean of a sample to a pre-specified value and tests for a deviation from that value. For example we might know that the average birth weight for white babies in the US is 3,410 grams and wish to compare the average birth weight of a sample of black babies to this value.

How well did you know this?

Not at all

Perfectly

Population

Population
What Is Population? A population is the complete set group of individuals, whether that group comprises a nation or a group of people with a common characteristic. In statistics, a population is the pool of individuals from which a statistical sample is drawn for a study.

How well did you know this?

Not at all

Perfectly

Critical region

Critical region
A critical region, also known as the rejection region, is a set of values for the test statistic for which the null hypothesis is rejected. i.e. if the observed test statistic is in the critical region then we reject the null hypothesis and accept the alternative hypothesis.

How well did you know this?

Not at all

Perfectly

σ
The symbol ‘σ’ represents the population standard deviation. The term ‘sqrt’ used in this statistical formula denotes square root. The term ‘Σ ( Xi – μ )2^ used in the statistical formula represents the sum of the squared deviations of the scores from their population mean.

How well did you know this?

Not at all

Perfectly

Z
A z-score, or z-statistic, is a number representing how many standard deviations above or below the mean population the score derived from a z-test is. Essentially, it is a numerical measurement that describes a value’s relationship to the mean of a group of values.

How well did you know this?

Not at all

Perfectly

Two-sample independent t-test

The independent t-test, also called the two sample t-test, independent-samples t-test or student’s t-test, is an inferential statistical test that determines whether there is a statistically significant difference between the means in two unrelated groups.

How well did you know this?

Not at all

Perfectly

Alpha level

In statistical tests, statistical significance is determined by citing an alpha level, or the probability of rejecting the null hypothesis when the null hypothesis is true. For this example, alpha, or significance level, is set to 0.05 (5%).

How well did you know this?

Not at all

Perfectly

The sum of squares is a measure of deviation from the mean. In statistics, the mean is the average of a set of numbers and is the most commonly used measure of central tendency. The arithmetic mean is simply calculated by summing up the values in the data set and dividing by the number of values.

How well did you know this?

Not at all

Perfectly

The correlation coefficient (r) is a statistic that tells you the strength and direction of that relationship. It is expressed as a positive or negative number between -1 and 1. The value of the number indicates the strength of the relationship: r = 0 means there is no correlation.

How well did you know this?

Not at all

Perfectly

Repeated measures t-test

The t-test assesses whether the mean scores from two experimental conditions are statistically different from one another. A repeated-measures t-test (also known by other names such as the ‘paired samples’ or ‘related’ t-test) is what you should use in situations when your design is within participants.

How well did you know this?

Not at all

Perfectly

Sample size

Study These Flashcards

Sample size

Sample size refers to the number of participants or observations included in a study. This number is usually represented by n. The size of a sample influences two statistical properties: 1) the precision of our estimates and 2) the power of the study to draw conclusions.

Null hypothesis

Study These Flashcards

Null hypothesis

The null hypothesis is a typical statistical theory which suggests that no statistical relationship and significance exists in a set of given single observed variable, between two sets of observed data and measured phenomena.

Study These Flashcards

The sample standard deviation (s) is the square root of the sample variance and is also a measure of the spread from the expected values. In its simplest terms, it can be thought of as the average distance of the observed data from the expected values.

Study These Flashcards

One-way ANOVA

Study These Flashcards

One-way ANOVA

One-Way ANOVA (“analysis of variance”) compares the means of two or more independent groups in order to determine whether there is statistical evidence that the associated population means are significantly different. One-Way ANOVA is a parametric test.

Population parameter vs sample statistic

Study These Flashcards

Population parameter vs sample statistic

sample statistic. When you collect data from a population or a sample, there are various measurements and numbers you can calculate from the data. A parameter is a measure that describes the whole population. A statistic is a measure that describes the sample.

Alternate hypothesis

Study These Flashcards

Alternate hypothesis

The alternative hypothesis is a statement used in statistical inference experiment. It is contradictory to the null hypothesis and denoted by Ha or H1. We can also say that it is simply an alternative to the null. In hypothesis testing, an alternative theory is a statement which a researcher is testing.

x ̅

Study These Flashcards

x ̅

X-bar in statistics is a symbol for the sample mean. Given a sample of n observations of numbers, the sample mean is found by adding up all of the observations, then dividing by the total number of observations (n).

Study These Flashcards

The “k” in that formula is the number of cell means or groups/conditions.

(Pearson) Correlation

(Pearson) Correlation The Pearson correlation coefficient (r) is the most common way of measuring a linear correlation. It is a number between –1 and 1 that measures the strength and direction of the relationship between two variables. Pearson correlation coefficient (r) Correlation type.

Standard deviation and Standard Error

Standard deviation and Standard Error Standard deviation describes variability within a single sample, while standard error describes variability across multiple samples of a population. Standard deviation is a descriptive statistic that can be calculated from sample data, while standard error is an inferential statistic that can only be estimated.

Normal distribution

Normal distribution The normal distribution is a continuous probability distribution that is symmetrical around its mean, most of the observations cluster around the central peak, and the probabilities for values further away from the mean taper off equally in both directions.

df Degrees of freedom refers to the maximum number of logically independent values, which are values that have the freedom to vary, in the data sample. Degrees of freedom are commonly discussed in relation to various forms of hypothesis testing in statistics, such as a chi-square.

(Linear) Regression

(Linear) Regression Linear regression analysis is used to predict the value of a variable based on the value of another variable. The variable you want to predict is called the dependent variable. The variable you are using to predict the other variable's value is called the independent variable.

Statistical significance

Statistical significance Statistical significance is a determination made by an analyst that the results in the data are not explainable by chance alone. Statistical hypothesis testing is the method by which the analyst makes this determination.

Type 1 Error/Type 2 Error

Type 1 Error/Type 2 Error Simply put, type 1 errors are “false positives” – they happen when the tester validates a statistically significant difference even though there isn't one. Source. Type 1 errors have a probability of “α” correlated to the level of confidence that you set. A type II error produces a false negative, also known as an error of omission. For example, a test for a disease may report a negative result when the patient is infected. This is a type II error because we accept the conclusion of the test as negative, even though it is incorrect.

t The t-value measures the size of the difference relative to the variation in your sample data. Put another way, T is simply the calculated difference represented in units of standard error. The greater the magnitude of T, the greater the evidence against the null hypothesis.

Terms to study Flashcards

(32 cards)