Summarising & Analysing Data Flashcards

Question 1

Q

Big Data

Answer

A

mass of data that society creates every year
extends beyond traditional data created by companies
social networking sites, internet search engines, mobile devices

Question 2

Q

What are the main characteristics of big data?

Answer

A

Volume
- created and stored due to advances
Velocity
- real time data, timeliness is key
Variety
- structured or unstructured
Value
- insights gained add value
Veracity
- truthfulness, careful of hidden biases

Question 3

Q

Structured Data

Answer

A

contained within a field or data record
easy to analyse, store, search
in standard format or in specific location within data
rows/columns
expiry date on card

Question 4

Q

Semi- Structured Data

Answer

A

doesn’t reside in fixed field but contains some properties that can be organised/analysed
email- content is unstructured but info stamps are structured

Question 5

Q

Unstructured Data

Answer

A

not easily contained within data fields
video, audio, images
difficult to analyse, manage, search

Question 6

Q

Data Analytics

Answer

A

process of collecting/examining data
to extract meaningful business insights
used to inform decision making

Question 7

Q

Descriptive analysis/analytics of data

Answer

A

summarises or describes what the data shows

Question 8

Q

Inferential Analysis of data

Answer

A

makes predictions about a population based on sample

Question 9

Q

What are the key effects of big data on decisions for businesses?

Answer

A

can be made quickly
respond earlier to environmental changes/ be more flexible
decisions based on current situations but still have element of future situations
based on hard evidence
outside the box decisions as using all factors

Question 10

Q

Frequencies of data

Answer

A

how often data occurs
can be grouped together into bands/classes if in large set
then shown in a frequency distribution or table but this means individual values are lost

Question 11

Q

Grouped Data

Answer

A

frequency is shown in terms of range

Question 12

Q

Ungrouped data

Answer

A

frequency shown in terms of specific measure/value

Question 13

Q

Arithmetic Mean

Answer

A

adding all observations and dividing by number of observations.
x bar

Advs
- most frequent used/understood
- uses all data

Disadvs
- value may not be in distribution
- can be distorted
- ignores dispersion

Question 14

Q

Question 15

Q

Mode

Answer

A

modal value
most frequently occurring value

advs
- not distorted by high/low
- actual value in distribution

disadvs
- ignores dispersion
- not use all data

Question 16

Q

Median

Answer

Study These Flashcards

A

value of middle member of array
use n+1/2 to find middle item when data arranged in order
if even amount will have to find mean of two middle numbers

advs
- not distorted by low/high
- corresponds to actual value in distribution

disadvs
- ignores dispersion
- limited use

Question 17

Q

Standard Deviation

Answer

Study These Flashcards

A

measure of dispersion/ spread of data
measures spread of data around the mean

= v (sum of values x)^2/sum of frequency - mean^2

= square root of variance
advs
- uses all data
- gives weight to values far away from mean

Question 18

Q

Variance

Answer

Study These Flashcards

A

variance is square of standard deviation

Question 19

Q

Coefficient of Variance =

Answer

Study These Flashcards

A

= standard deviation/ mean

the bigger = the wider the spread

Question 20

Q

The Normal Distribution Properties

Answer

Study These Flashcards

A

probability distribution
arises frequently in real life
majority of items lie near to average
bell-shaped curve on graph
the mean is mew and each side represents 50% so symmetrical
at certain points of standard deviation from the mean the area under the curve represents same % of population

Question 21

Q

z score

Answer

Study These Flashcards

A

distance from mean in normal distribution measured by number of standard deviations

= value of variable - mean / standard deviation

can then be looked up in tables to find proportion

Question 22

Q

Expected Value

Answer

Study These Flashcards

A

weighted average value of different possible outcomes from decision
weightings are based on probability of each possible outcome

= sum of probability x outcome/results

Question 23

Q

What are the limitations of expected value?

Answer

Study These Flashcards

A

limitations
- long run average result and so not appropriate for one off decisions
- heavily dependent on probability distribution
- ignores risk

Summarising & Analysing Data Flashcards

(23 cards)