Large Data Set Flashcards
(20 cards)
In order to clean the data, you would have to ___________ ?
Remove all car’s with mass “0”
How many registered makes of car are there?
5
Name the regions the owners live in
- London
- North West
- South West
2 years of manufacture that are considered
2002 and 2016 (emissions scandal happened between these years)
5 propulsion types
- Petrol
- Diesel
- Electric
- Gas/petrol
- Electric/petrol
5 keeper titles
- Male
- Female
- (Not used)
- Unknown (Rev, Dr etc)
- Company
Units for engine size
cubic cm
Units for mass
kg (includes 75kg mass of average driver)
Gas emissions units
g/km
5 emissions considered
- Carbon dioxide
- Carbon monoxide
- Oxides of nitrogen (NO and NO2)
- Particulates
- Hydrocarbons
Which cars have values for particulate emissions?
Only diesel vehicles
What is the most common body type of car?
5 door hatchback
What is the least common body type of car?
2 door saloon
What is the least common make of car?
Toyota
What is the most common make of car?
Ford (closely followed by Vauxhall)
2 limitations of the LDS for comparing emissions in 2002 to 2016 in England
- Not all makes of car are included in the database
- Not all English regions are included
Which emissions are known for every car in the data set?
None of them
5 makes of car
- BMW
- Ford
- Toyota
- Vauxhall
- Volkswagen
Which types (e.g. petrol) of car are the least common?
Electric and gas/petrol (only 1 of each)
Which keeper title ID isn’t used in the data set?
3