U2 + U3 - Processing and representing data Flashcards

1
Q

What can continuous data be represented by?

A
  • Histograms
  • Cumulative frequency curves
  • Line graphs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What can discrete data be represented by?

A
  • Bar charts

- CF step polygons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What can categorical data be represented by?

A
  • Frequency tables (normal freq tables, relative freq tables, cf tables)
  • Pie charts
  • Bar charts
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What can ordinal data be represented by?

A
  • Bar charts
  • Pie charts
  • Tables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What do you need to ensure when drawing a pictogram? (4)

A
  • Each picture is the same size
  • The picture can be divided easily to show different frequencies
  • The spacing between the pictures is the same in each row
  • You write a key to show what each symbol represents
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

2 properties of a bar chart

A
  • Bars are equal width with equal spaces between them

- The height of the bar represents the frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Differences (2) between histograms and bar charts

A
  • The bars of a bar chart don’t touch, but in a histogram, the bars touch because the data is continuous
  • You can draw on a frequency polygon using the midpoint of each class on a histogram
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why are composite bar charts good?

A

The total frequencies and the frequencies of each component can be easily compared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Compare a stem-and-leaf diagram with a bar chart

A

A stem and leaf diagram shows the shape of the data distribution in the same way as a bar chart, but you can still see the original data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to compare
i) total frequencies
ii) proportions
in comparative pie charts

A

i) Compare the areas

ii) Compare the individual angles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why would you use comparative pie charts over normal pie charts?

A

When two sets of data have different total frequencies, drawing two pie charts the same size to represent them would be misleading.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In a cumulative frequency graph, when would you use curved lines and when would you use straight step polygons?

A

Curved lines - continuous data

Straight lines - discrete data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a choropleth map?

A

A map that is used to classify regions of a geographical area. Regions are shaded with an increasing depth of colour. A key shows what each shade represents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is positive skew? What is negative skew?

A

POS - Most of the data values are at the lower end

NEG - Most of the data values are at the upper end

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Misleading things about graphs (7)

  • scales that do not start at zero
  • scales that do not increase uniformly
  • thick lines on a graph
  • axes without labels
  • graphs and charts without keys
  • colours on a graph
  • 3D diagrams
A
  • Scales that do not start at zero give a misleading impression of the heights of bars
  • Scales that do not increase uniformly distort the shape of anything plotted on them
  • Lines on a graph that are drawn too thick make it difficult to read information
  • Axes without labels prevent you from knowing what the data represents
  • Graphs and charts without keys may be impossible to interpret
  • Colours may make some parts of a graph or chart stand out more than others
  • 3D diagrams make comparisons difficult as data proportions appear distorted
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Advantages of bar charts and line graphs (2)

A
  • They show trends and patterns in data

- You can read values from the scale as long as it is not too small

17
Q

Advantage and disadvantage of pie charts

A

ADV:
- They show proportions

DISADV:
- They don’t show accurate data values

18
Q

Advantage and disadvantage of tables

A

ADV:
- They give exact data values for different categories

DISADV:
- They don’t show trends and patterns as clearly

19
Q

In pie charts, what does
i) the area of each sector
ii) the area of the whole pie chart
represent?

A

i) the area of each sector is proportional to the total frequency it represents
ii) the area of the whole pie chart is proportional to the total frequency

20
Q

Median in discrete data

Median in continuous data

A

Discrete: (n+1)/2
Continuous: 1/2

21
Q

What is standard deviation?

A

A measure of how much all the values deviate from the mean value (how spread out they are)

22
Q

Using calculation, what indicates

i) positive skew
ii) negative skew

A

i) mean > median > mode

ii) mode > median > mean

23
Q

Advantages of mode (4)

Disadvantages of mode (2)

A

ADV:

  • Easy to find
  • Can be used with any type of data
  • Unaffected by open-ended or extreme values
  • Mode is always a data value

DISADV:

  • May be no mode or sometimes more than one
  • Cannot be used to calculate a measure of spread
24
Q

Advantages of median (4)

Disadvantage of median (1)

A

ADV:

  • Easy to calculate
  • Unaffected by extreme values
  • Best to use when data is skewed
  • Can be used to help calculate quartiles, interquartile range and skew

DISADV:
- May not be a data value

25
Advantages of mean (2) | Disadvantages of mean (2)
ADV: - Uses all the data - Can be used to calculate standard deviation and skew DISADV: - Always affected by extreme values - Can be distorted by open-ended classes
26
What can you compare mode with?
Range for quantitative data
27
What can you compare median with?
Range, interquartile range
28
What can you compare mean with?
Range, standard deviation
29
In a distribution:
- 50% of the data is less than the median, and 50% is greater - 25% of the data is less than the lower quartile - 25% of the data is greater than the upper quartile - 50% of the data is between the lower and upper quartiles