Everything Flashcards

(252 cards)

1
Q

What is raw data?

A

Unprocessed data that has just been collected and needs to be ordered, grouped, rounded, or cleaned.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define qualitative data.

A

Non-numerical, descriptive data such as eye/hair colour or gender.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of data is easier to analyze, qualitative or quantitative?

A

Quantitative data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Give an example of quantitative data.

A

Height, weights, marks in an exam.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is discrete data?

A

Data that only takes particular values, such as shoe size or number of people.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is continuous data?

A

Data that can take any value, such as height or weight.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define categorical data.

A

Data that can be sorted into non-overlapping categories, such as gender.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is ordinal data?

A

Quantitative data that can be given an order or ranked on a rating scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does bivariate data involve?

A

Measuring two variables, which can be qualitative or quantitative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is multivariate data?

A

Data made up of more than two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the purpose of grouping data?

A

To make it easier to spot patterns and see how the data is distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or False: Discrete data can be grouped into overlapping classes.

A

False.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a primary data source?

A

Data that you have collected yourself or someone has collected on your behalf.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define secondary data.

A

Data that has already been collected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a population in statistics?

A

Everyone or everything that could be involved in the investigation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a census?

A

A survey of the entire population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Fill in the blank: A _______ is a smaller number from the population that you actually survey.

A

Sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a sampling frame?

A

A list of all the members of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is a biased sample?

A

A sample that does not represent the population fairly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Define random sampling.

A

Every item/person in the population has an equal chance of being selected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is stratified sampling?

A

Sampling where the size of each group in the sample is in proportion to the sizes of those groups in the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is systematic sampling?

A

Choosing items in the population at regular intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Define cluster sampling.

A

The population is divided into natural groups, and groups are chosen at random with every member sampled.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is quota sampling?

A

Population is grouped by characteristics and a fixed amount is sampled from every group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Fill in the blank: Opportunity sampling uses the people/items that are _______.
Available at the time.
26
What is judgment sampling?
When the researcher uses their own judgment to select a sample they think will represent the population.
27
What does the Petersen Capture-Recapture method estimate?
The size of large or moving populations.
28
What is an explanatory variable?
The variable that is changed in an experiment.
29
Define response variable.
The variable that is measured in an experiment.
30
What is a sample size?
Size large enough and representative of the population.
31
What is an experiment?
Used when a researcher examines how changes in one variable affect another.
32
Define Explanatory (Independent) Variable.
The variable that is changed.
33
Define Response (Dependent) Variable.
The variable that is measured.
34
What are Extraneous Variables?
Variables not of interest but that could affect the result of your experiment.
35
What characterizes Laboratory Experiments?
Researcher has full control over variables; conducted in a lab or similar environment.
36
Give an example of a Laboratory Experiment.
Measuring reaction times of people of different ages.
37
What is the Explanatory variable in the laboratory example?
Age.
38
What is the Response variable in the laboratory example?
Reaction time.
39
List some Extraneous variables in laboratory experiments.
* Gender * Health condition * Fitness level.
40
What are advantages of Laboratory Experiments?
* Easy to replicate * Extraneous variables can be controlled.
41
What is a disadvantage of Laboratory Experiments?
People may behave differently under test conditions than in real life.
42
What are Field Experiments?
Carried out in the everyday environment with some control over variables.
43
Give an example of a Field Experiment.
Testing new methods of revision.
44
What is the Explanatory variable in the field experiment example?
Method of revision.
45
What is the Response variable in the field experiment example?
Results in exam.
46
List some Extraneous variables in field experiments.
* Amount of revision pupils do * Ability of pupils.
47
What are the advantages of Field Experiments?
More accurate; reflects real life behaviour.
48
What is a disadvantage of Field Experiments?
Cannot control extraneous variables.
49
What are Natural Experiments?
Carried out in everyday environments with little control over variables.
50
Give an example of a Natural Experiment.
The effect of education on level of income.
51
What is the Explanatory variable in the natural experiment example?
Level of education.
52
What is the Response variable in the natural experiment example?
Income.
53
List some Extraneous variables in natural experiments.
* IQ * Other skills individuals may have * Personal circumstances.
54
What is an advantage of Natural Experiments?
Reflects real life behaviour.
55
What are disadvantages of Natural Experiments?
* Low validity * Difficult to replicate.
56
What is a Simulation?
A way to model random events using random numbers and previously collected data.
57
What are the steps in conducting a Simulation?
* Choose a suitable method for getting random numbers * Assign numbers to the data * Generate random numbers * Match random numbers to outcomes.
58
What is a Questionnaire?
A set of questions used to obtain data from the population/sample.
59
What types of questions can be included in a Questionnaire?
* Open questions * Closed questions.
60
What are features of a good questionnaire?
* Easy to understand * Uses simple language * Avoids leading questions.
61
What is a problem with Questionnaires?
Non-response when people do not respond to the questionnaire.
62
What is the Random Response Method?
Uses a random event to decide how to answer a question ensuring anonymity.
63
What is a Pilot Study?
A small-scale replica of the study to test the design and methods of the questionnaire.
64
What is an Interview?
Where you question each person individually, involving specific questions or topics.
65
What are Outliers?
Values that do not fit in with the pattern or trend of the data.
66
What does Cleaning Data involve?
* Identifying and correcting/removing incorrect data values or outliers * Putting all data in the same format.
67
What is a Control Group?
Used in an experiment to ensure that the treatment given is causing the experimental results.
68
What are Matched Pairs?
Two groups of equally matched people used to test the effect of a particular factor.
69
Define Hypothesis.
A statement that can be tested by collecting and analysing data.
70
What are the stages of an Investigation?
* Planning * Collecting Data * Processing and Representing data * Interpreting Results.
71
What is the first stage of an investigation?
Planning ## Footnote In this stage, you choose a hypothesis, decide what data to collect (variables), and determine how to record the data (data collection tables).
72
What does the collecting data stage involve?
Choosing data sources (primary/secondary), collection methods (questionnaire/interviews), and control factors. ## Footnote This stage is crucial for ensuring accurate and relevant data is gathered for analysis.
73
List the stages of an investigation.
* Planning * Collecting Data * Processing and Representing Data * Interpreting Results * Evaluating Methods
74
What are databases in the context of data representation?
Tables with a collection of data, often secondary data that is available online. ## Footnote These databases usually contain real-life statistics and are essential for interpreting data.
75
What is a common inconsistency found in data tables?
Percentages do not add up to 100% due to rounding errors. ## Footnote This is often encountered when individual percentages for columns/rows in tables have been rounded.
76
What type of data do two-way tables represent?
Bivariate data, which has information in two categories and two variables. ## Footnote They are useful for analyzing relationships between two different data sets.
77
What is a pictogram?
A representation using pictures or symbols to show a particular amount of data. ## Footnote It always includes a key to indicate the amount each symbol represents.
78
What are the key features of simple bar charts?
* Bars are equal width * Equal gaps between bars * Frequency on y-axis
79
What distinguishes multiple bar charts from simple bar charts?
They can compare two or more sets of data with more than one bar for each class represented by different colours. ## Footnote This allows for a clearer comparison between different data categories.
80
How are composite bar charts structured?
Single bars split into different sections for each category, used to compare different times/days/years. ## Footnote The frequency of each component is calculated by subtracting the upper frequency of that component from the lower frequency.
81
Define stem and leaf diagrams.
A method of organizing data that retains all original data while presenting it simply, showing the shape of the distribution. ## Footnote Each value is split into a 'stem' (first digits) and 'leaf' (last digit).
82
What is the purpose of pie charts?
To display data showing how something is shared or divided into categories, with each sector representing a proportion of the total data. ## Footnote The angles in a pie chart must add up to 360 degrees.
83
True or False: Comparative pie charts can be used to compare two sets of data of different sizes.
True
84
What do population pyramids show?
Distribution of ages in a population, either in numbers or proportions/percentages. ## Footnote They are used to compare two sets of data, usually genders or geographical areas.
85
What do choropleth maps represent?
Geographical areas split into different regions that are shaded based on frequency. ## Footnote The darker the shading, the higher the frequency for that area.
86
What is cumulative frequency?
A running total of frequencies. ## Footnote It helps in understanding the total number of occurrences up to a certain point in a dataset.
87
What is the formula for frequency density in histograms?
Frequency Density = Frequency / Class Width ## Footnote This reflects the concentration of values within each range of the dataset.
88
How do you estimate the median from cumulative frequency diagrams?
Divide total frequency by 2, find that value on the y-axis, draw a horizontal line to the curve, and read off the value from the x-axis.
89
What is the formula to calculate frequency density?
FD = F/CW ## Footnote FD stands for Frequency Density, F is Frequency, and CW is Class Width.
90
What are the steps to draw a histogram?
* Calculate class widths for each class interval * Calculate frequency density for each class interval * Draw a suitable scale on y-axis labelled frequency density * Draw bars using frequency density data ## Footnote Remember that the bars have no gaps in between.
91
What is the shape of a distribution?
It can be positive, negative, or symmetrical.
92
What is the difference between a histogram and a frequency polygon?
A histogram uses bars, while a frequency polygon uses mid-points of class intervals plotted and joined with straight lines.
93
True or False: To compare histograms, they need to have different class intervals.
False ## Footnote They need to have the same class intervals and frequency density scales.
94
What is the mode in a dataset?
The value that appears the most.
95
How do you find the median in discrete data?
* Put the numbers in order from smallest to largest * Find the (n + 1)th value * If the position is a decimal, average the two middle values. ## Footnote n is the total frequency.
96
What is the formula for the mean?
𝑥̅ = ∑𝑥/𝑛 ## Footnote Where 𝑥̅ is the mean, ∑𝑥 is the sum of data values, and 𝑛 is the number of data values.
97
What is a weighted mean?
Used to combine different sets of data where one set is more important than another.
98
Fill in the blank: The _______ is the class with the highest frequency.
Modal Class
99
What can cause diagrams to be misleading?
* Shape of the diagram * Axes and scales ## Footnote Examples include scales not starting at zero, missing values, or unevenly scaled axes.
100
How do you estimate the median for grouped continuous data?
Use ½ n to find the median position and calculate using cumulative frequency.
101
What is the geometric mean?
The nth root of the product of all the values.
102
What happens to the mean if you add a value greater than the mean?
The mean increases.
103
What is the first step to transforming data?
Take away the same large number from all the values.
104
True or False: The median will always change if a new value is added.
False ## Footnote The median may stay the same if the added value is equal to the median.
105
What is a common error when drawing frequency polygons?
Not using midpoints.
106
What should you do if the median position is a decimal?
Find the two values around that position and divide by 2.
107
Fill in the blank: The sum of all values divided by the number of values is called the _______.
Mean
108
What is the formula for estimating the median using linear interpolation?
Add the lower bound for the class interval to the result of multiplying the frequency for the median class.
109
What happens to the mean if a value greater than the mean is added?
The mean increases.
110
What happens to the mean if a value less than the mean is removed?
The mean increases.
111
What happens to the mean if a value less than the mean is added?
The mean decreases.
112
What happens to the mean if a value greater than the mean is removed?
The mean decreases.
113
What is the mode?
The value that appears most frequently in the data.
114
List advantages of using the mode.
* Easy to use * Always a value in the data * Unaffected by extreme values * Can be used with quantitative and qualitative data
115
List disadvantages of using the mode.
* There may not be a mode or may be more than one mode * Cannot be used to calculate measures of spread * Not always representative of the data
116
What is the median?
The middle value when the data is ordered.
117
List advantages of using the median.
* Easy to find when data is in order * Unaffected by outliers/extreme values * Best to use with skewed data
118
List disadvantages of using the median.
* May not be a data value * Not always representative of the data
119
What is the mean?
The average of all the data values.
120
List advantages of using the mean.
* Uses all the data * Can be used to calculate standard deviation and skew
121
List disadvantages of using the mean.
* May not be a data value * Always affected by extreme values or outliers
122
What does the range measure?
How spread out the data is.
123
How is the range calculated?
Range = Largest Value - Smallest Value.
124
What is the Interquartile Range (IQR)?
The middle 50% of the data when in order.
125
How is IQR calculated?
IQR = Upper Quartile - Lower Quartile.
126
What is the Lower Quartile (LQ)?
The value ¼ of the way through the data.
127
What is the Upper Quartile (UQ)?
The value ¾ of the way through the data.
128
How do you find LQ and UQ for discrete data?
LQ = ¼ (n+1)th value, UQ = ¾ (n+1)th value.
129
What is the Interpercentile Range (IPR)?
The difference between two percentiles.
130
What are deciles?
Values that divide the data into 10 equal parts.
131
What is the Interdecile Range?
The difference between the first and ninth deciles.
132
What does standard deviation (SD) measure?
How far all the values are from the mean value.
133
What is the formula for standard deviation?
σ = √(1/n ∑(x - x̅)²) or σ = √(∑x²/n - (∑x)²/n²).
134
What is a box plot?
A graphical representation of data that shows its distribution.
135
What are the five pieces of information included in a box plot?
* Minimum Value * Lower Quartile (LQ) * Median * Upper Quartile (UQ) * Maximum Value
136
What are outliers?
Values that are far from the rest of the data.
137
How are outliers identified?
Values that are more than 1.5 x IQR above UQ or below LQ.
138
What is skewness?
Describes the shape of the distribution and how the data is spread out.
139
True or False: The mean is always affected by extreme values.
True.
140
Fill in the blank: The _______ is the average of all the data values.
[mean]
141
Fill in the blank: The _______ is the value that appears most frequently in the data.
[mode]
142
Fill in the blank: The _______ is the middle value when the data is ordered.
[median]
143
Fill in the blank: The difference between the largest and smallest values is called the _______.
[range]
144
Fill in the blank: The middle 50% of the data is represented by the _______.
[IQR]
145
What is the IQR used for?
Measure of spread ## Footnote IQR stands for Interquartile Range, which measures the middle 50% of data.
146
What does skewness describe?
The shape of the distribution and how the data is spread out.
147
What indicates positive skewness?
Most values are at the beginning of the data set with few higher values.
148
In positive skewness, what is the relationship between mean, median, and mode?
Mean > Median > Mode
149
What indicates negative skewness?
Most values are at the end of the data set with few lower values.
150
In negative skewness, what is the relationship between mean, median, and mode?
Mean < Median < Mode
151
What signifies a symmetrical distribution?
Mean = Median = Mode
152
What does a normal distribution look like on a box plot?
Median is halfway between LQ and UQ.
153
How is skewness calculated using a formula?
Skewness = 3(mean - median) / standard deviation
154
What does a positive skewness value indicate?
Positive skew.
155
What does a negative skewness value indicate?
Negative skew.
156
When comparing data sets, what measures should be used?
Average (mean/median/mode) and spread (range/IQR/SD) or skewness.
157
What does a lower standard deviation indicate?
Values are closer to the mean and therefore similar.
158
What are scatter diagrams used for?
To show if there is a relationship between two variables.
159
What is the explanatory variable in a scatter diagram?
The independent variable plotted on the x-axis.
160
What is the response variable in a scatter diagram?
The dependent variable plotted on the y-axis.
161
What indicates a positive correlation?
As one variable increases, so does the other.
162
What indicates a negative correlation?
As one variable increases, the other decreases.
163
What is a causal relationship?
When one variable causes a change in another.
164
What does the line of best fit (LOBF) represent?
A straight line drawn through the middle of the points on a scatter diagram.
165
In the equation of LOBF, what does 'a' represent?
The gradient.
166
In the equation of LOBF, what does 'b' represent?
The y-intercept.
167
What is the purpose of interpolation?
To make predictions within the range of data given.
168
What is the purpose of extrapolation?
To predict values outside of the range of values given.
169
How is Spearman’s Rank Correlation Coefficient (SRCC) calculated?
SRCC = 1 - (6 * ∑d²) / (n(n² - 1))
170
What does a value of SRCC near 1 indicate?
Strong positive correlation.
171
What does Pearson’s Product Moment Correlation Coefficient (PMCC) measure?
The strength of linear correlation between two variables.
172
What is a time series graph used for?
To spot trends over time.
173
What is plotted on the x-axis of a time series graph?
Time.
174
What is a time series?
A set of data collected over a period of time at equal intervals.
175
What is the purpose of time series graphs?
To spot trends, usually going up, down, or fluctuating.
176
What does a trend line show?
The general trend of the data.
177
What are moving averages?
An average worked out for a given number of successive observations.
178
Why are moving averages used?
To smooth out fluctuations and make the trend line more accurate.
179
What are seasonal variations?
A pattern that repeats at a specific point every cycle.
180
How is seasonal variation calculated?
Seasonal Variation = Actual Value - Trend Value.
181
What is the Estimated Mean Seasonal Variation (EMSV)?
The average of all the seasonal variations for the same point in each cycle.
182
How can future values be predicted in time series?
Using the trend line and estimated mean seasonal variations.
183
What is simple probability?
A measure of how likely an event is to happen.
184
How can probabilities be expressed?
As fractions, decimals, or percentages.
185
What is an outcome in probability?
A possible result of an experiment or trial.
186
What does P(event) represent?
The number of successful outcomes divided by the total number of outcomes.
187
What is expected frequency?
The number of times you expect an event to happen.
188
How is experimental probability estimated?
Using results of previous trials to predict future probabilities.
189
What is risk in probability?
The likelihood of a negative event occurring.
190
What are the two types of risk?
* Absolute Risk * Relative Risk
191
What does a sample space represent?
A list of all the possible outcomes.
192
What is a sample space diagram?
A table used to represent the outcomes of two events.
193
What is a Venn diagram?
Uses overlapping circles to represent all outcomes of two or three events.
194
What are mutually exclusive events?
Events that cannot happen at the same time.
195
What is the addition law in probability?
Used for events that are not mutually exclusive and can happen together.
196
What are independent events?
Events where the outcome of one does not affect the outcome of the other.
197
Fill in the blank: The formula for the probability of two mutually exclusive events A and B is P(A or B) = P(A) + P(B). What is the additional component for non-mutually exclusive events? _______
− P(A and B)
198
What are independent events?
Unconnected events where the outcome of one does not affect the other ## Footnote Example: Flipping a coin and rolling a dice.
199
What is the Multiplication Law for independent events A and B?
P(A and B) = P(A) × P(B) ## Footnote For 3 independent events A, B, and C: P(A and B and C) = P(A) × P(B) × P(C)
200
How do you calculate P(at least 1)?
P(at least 1) = 1 - P(none) ## Footnote This formula helps determine the probability of at least one occurrence.
201
What do tree diagrams represent?
Each branch shows an outcome and probabilities on branches add up to 1 ## Footnote Multiply along the branches for end results and add probabilities down columns.
202
What happens to the denominator in a tree diagram with replacement?
The denominator stays the same for the second set of branches ## Footnote The question indicates if the item has been replaced.
203
What is conditional probability?
The probability of one event affecting the chances of another ## Footnote Example: Taking a white ball first changes the probability of the second draw.
204
What notation is used for conditional probability?
P(B | A) ## Footnote It represents the probability of B given that A has happened.
205
What is the formula for conditional probability?
P(B | A) = P(A and B) / P(A) ## Footnote This can also be used to test if two events are independent.
206
What are simple index numbers used for?
To compare price changes over time ## Footnote They compare the price change of an item with its base year price.
207
What does an index number greater than 100 indicate?
The value has increased ## Footnote An index number less than 100 indicates a decrease.
208
What does the Retail Price Index (RPI) measure?
The rate of change of prices of everyday goods ## Footnote RPI is calculated monthly by comparing prices to the same month of the previous year.
209
What is the Consumer Price Index (CPI)?
Official measure of inflation used by the UK Government ## Footnote It does not include mortgage payments and is weighted to reflect consumer spending.
210
What does Gross Domestic Product (GDP) represent?
The value of goods and services produced in a country in a given time ## Footnote A fall in GDP for two successive quarters indicates a recession.
211
What are weighted index numbers?
They take into account proportions similar to the weighted mean ## Footnote Weightings reflect the importance of different items.
212
What do chain base index numbers compare?
Prices from each year with that of the previous year ## Footnote They show how values change from year to year.
213
What are crude rates?
Rates that tell how many times a particular event occurs per 1000 of the population ## Footnote Examples include crude birth and death rates.
214
What is the formula for calculating crude rates?
Crude Rate = (number of births/deaths / total population) × 1000 ## Footnote Crude rates can be misleading when comparing different age distributions.
215
What is a standard population?
A hypothetical population of 1000 used to represent the whole population ## Footnote It takes into account age, gender, and income distributions.
216
What does a standardized rate allow you to do?
Compare the same age group in different populations ## Footnote It uses the standard population for realistic comparisons.
217
What is a probability distribution?
A list of all possible outcomes with their expected probabilities ## Footnote Example: Flipping a fair coin results in heads or tails.
218
What is a binomial distribution?
A type of probability distribution with only two possible outcomes ## Footnote Examples include flipping a coin (heads or tails) or rolling a six (success or failure).
219
What conditions must be met for a binomial distribution?
* Fixed number of trials (n) * Each trial has 2 outcomes (success or failure) * Trials are independent * Probability of success is constant ## Footnote If these conditions are met, the binomial distribution is applicable.
220
How do you find probabilities using the binomial distribution?
Use (p + q)^n and identify the outcomes and their probabilities ## Footnote Expand (p + q)^n where n is the number of trials.
221
What is Pascal’s Triangle used for?
To find coefficients of a binomial distribution ## Footnote The coefficients follow the pattern of Pascal's triangle.
222
What is the probability of landing on 6 three times?
10 × (X Heads) × (X Tails) ## Footnote P(x) = ½ for Heads and ½ for Tails
223
What pattern do the coefficients of a binomial distribution follow?
Pascal’s triangle
224
What is the first row of Pascal’s triangle?
1
225
How do you find the numbers in Pascal's triangle?
By adding the 2 numbers directly above
226
What is the expansion of (p + q)^4?
1p^4 + 4p^3q^1 + 6p^2q^2 + 4pq^3 + 1q^4
227
What does the nCr button on a calculator represent?
N=number of trials and r=number of successes
228
How do you calculate the coefficient for 5 trials with 3 successes using nCr?
Type '5', 'nCr', '3', '=' to get 10
229
To find a range of probabilities, what should you do?
Work out their individual probabilities and then add them up
230
How do you calculate the probability of 'at least 1 success'?
Work out the probability of 0 successes and subtract from 1
231
What is the mean (or expected value) of the binomial distribution B(n, p)?
np
232
For B(6, ½), what is the mean?
3
233
What shape does a normal distribution curve have?
Bell-shaped
234
What does a larger standard deviation result in for a normal distribution curve?
A lower curve
235
What is the notation for a normal distribution?
N(μ, σ²)
236
What do μ and σ² represent in the normal distribution notation?
μ = mean, σ² = variance
237
What are the conditions for normal distribution?
* Data is continuous * Distribution is symmetrical * Mode, median, and mean are approximately equal
238
Approximately what percentage of data values lie within 1 standard deviation of the mean?
68%
239
Approximately what percentage of data values lie within 2 standard deviations of the mean?
95%
240
Approximately what percentage of data values lie within 3 standard deviations of the mean?
99.8%
241
For a data set with mean=30 and SD=3, what is the range for 68% of the sample?
27-33
242
How do you sketch a normal distribution?
Draw a bell-shaped curve centered on the mean and ending at 3 SD from the mean
243
What is the formula to calculate the number of SDs from the mean?
(value - mean) / standard deviation
244
What is the purpose of standardised scores?
To compare how far above or below the average individual values are
245
What does a positive standardised score indicate?
The value is above the mean
246
What does a negative standardised score indicate?
The value is below the mean
247
What does a standardised score of zero indicate?
The value is equal to the mean
248
What does quality assurance involve?
Checking samples to ensure products are of the same quality and standard
249
What is a control chart?
A time series chart used for quality assurance
250
What are the lines on a control chart?
* Target Value (middle line) * Upper and Lower Warning Lines (inner 2 lines) * Upper and Lower Action Limits (outer 2 lines)
251
What happens if a sample average/range is above/below the warning line?
Another sample is taken and checked for problems
252
What happens if a sample average/range is outside the action limits?
Production is stopped immediately and machinery is reset