Chance and data Flashcards

Question

Time series 2. Describe seasonal variation

Answer 1

(If both graphs have similarly shaped peaks) For both graphs, the shape of yearly patterns have jagged peaks which shows the same pattern of increase and decrease. This is because, the number of (y axis value) peaks in (Q3) and dips in (Q1) each year. (If both graphs have differently shaped peaks) For (graph 1), the shape of the yearly patterns have flat peaks and are not jagged like the peaks for (graph 2) which shows a different pattern of increase and decrease. This is because, (graph 1) data peaks in (Q4) and (Q1), whereas (graph 2) data only peaks in (Q3) each year. This may have occurred because... (see data limitations)

Answer 2

1. State what the unusual feature is 2. Explain feature using specific numbers and statistics from the graph Eg. There is a sharp increase in the middle of the graph. This is because the number of people going to Australia for holidays rose sharply during 2004-2005 and then remained elevated. 3. This may have occurred because... (see data limitations)

Answer 3

To draw a trend line, project lines across the top and bottom of the data, then split it in half (following the shape of previous years data) • I do/do not feel very confident in my prediction because... (see data limitations)

Answer 4

Bivariate data is when there is two pieces of numeric data about every individual in a sample or population. Bivariate data is plotted on a scatter graph to see if there is a relationship between them

Answer 5

(Solve for gradient of line of best fit) • Gradient = rise / run = __ / __ • By drawing a line of best fit and calculating the gradient, we can see how the average (x, y value) is (___) (y value unit) per (x value unit). This is because the gradient represents how many (y value) changes for every (x value unit) increase in (x value). (Count total number of points) • There are (__) points in total, halfway between these points is (half of first value) points up, which is (__) on the (x/y axis value depending on question) for (x/y axis value depending on question). This indicates the approximate median of (___).

Answer 6

1. Identify the problem 2. Explain why there is a problem Use specific numbers and statistics from the graph 3. Discuss improvements and assumptions to data

Answer 7

Draw line of best fit 1. State the relationship 2. Describe the relationship • State the variation around the trend line (remains consistent, increases/decreases) and how close points are to line of best fit 3. Unusual points 4. Groups Limitations of data

Answer 8

(If gradient is positive) • There appears to be a positive linear relationship. As (x axis value) increases, (y axis value) increases and vice versa. (If there is no clear line of best fit) • There appears to be no relationship between the (x axis value) and the (y axis value). This is seen by the way that the dots do not follow any line, and the (y axis values) for each (x axis value) has a large range from (__) to (__).

Answer 9

(If points are close to the line of best fit) • Most points are concentrated close to the line of best fit, which means there is a strong relationship between the (x axis value) and the (y axis value). This means the linear model is appropriate for (graph 1), and this data would be useful for calculating and predicting the (y axis value) from the (x axis value). (If points are not close to the line of best fit) • Most points are spread away from the line of best fit, which means there is a weak relationship between the (x axis value) and the (y axis value). There is increased variation/significant variation in the amount of the (y axis value) as the (x axis value) increases. This means the linear model is not appropriate for (graph 1), and a non-linear model may better represent the relationship between (x axis value) and (y axis value). This also means the data would not be useful for calculating and predicting the (y axis value) from the (x axis value).

Answer 10

• Unusual points are some distance away from most other points and the trendline. Sometimes they are valid, otherwise, they may be a result of measurement error, recording error or reversed coordinates. 1. State what the unusual points are using coordinates and specific numbers from the graph. 2. Explain why they are outliers 3. This may have occurred because... (see data limitations) Eg. There are 3 unusual values (outliers), one at $10 500, one at $16 000 and one at $17 000. These 3 points are much higher than the others. This would need to be investigated, as they may be due to brand, mileage, wear and tear, etc.

Answer 11

1. State where points are clustered according to the (x axis value) and (y axis value) 2. Explain why these values may be common 3. This may have occurred because... (see data limitations)

Answer 12

• Consider what factors affect results of data. (maybe annual changes, seasonal changes due to weather, weekly changes due to weekends, maybe daily changes due to night/day light or enviromental conditons) * Consider the size of the data sample and whether the data accurately represents the entire population. * Consider the conditions of the 'test' for each sample and whether the test accurately represents real life. * Consider whether there is a bias of samples in data. * Consider outliers that may have been wrongfully considered. * Consider whether the trend is consistent

Answer 13

* Select equal numbers of males and females from results collected so far (if not equal numbers were used) * Could use percentages or relative frequency on the vertical scale

Chance and data Flashcards

(37 cards)