Week 1 Flashcards

1
Q

What are the three Vs big data can be explained by ?

A

Volume - this is the sheer size of the dataset

Velocity - the speed of data processing

Variety - the different types of the data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What factors take part in increasing the volume of data ?

A

Transaction-based data stored in relational databases* for years make a part of the volume

Unstructured data that is being streamed from social media also plays a role

Sensory and machine-to-machine generated data is increasing with time

Storage was an issue in the past, however, the costs of decreased

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Velocity: examples of high speed data

A

Radio-Frequency identification (RFID) tags sensors* and smart metering spell out large data within a short period, reacting fast enough to have velocity data is one of the challenges.

The speed of data could be inconsistent and can have peaks, this is especially true in social media when something trends.

Daily, seasonal and event-triggered can peak and data loads can be difficult to manage, especially when there is unstructured data involved.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Variety: Different types of data ?

A

Structural data - is traditional relational databases and file systems

Unstructured data - Text documents, email, video, audio log files etc

It comes from various sources, the challenge comes in managing, merging and governing different varieties of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is data mining

A

Data mining is the process of using large data sets to be able to identify patterns and trends. This can be used to gain a better insight into customer behaviour and this can be then used to then drive down costs and therefore increase revenue.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What use does data mining have in business ?

A

Data mining could transform business in the future. Businesses could use the data to analyse the buying patterns of customers, investigate any anomalies that were not predicted, and forecast future possibilities. Through data mining, they could use it for more direct marking campaigns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do auditors use data mining ?

A

have been using data mining techniques to analyse large sets of data rather than the traditional sampling techniques used to gain assurance over large balances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How would management accountants use data mining ?

A

They may be required to do forecasting and the ability to analyse both financial and non-financial data can help to improve the understanding of cost drivers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Tax data analytics

A

An example of how data analytics might be used is the capability to predict the potential tax consequences of potential M&A

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Benefits of big data

A

the sheer amount of data that can be collected means that sampling errors/bias can be avoided, as you are reviewing all data.

Quality of data might be less important if analysing larger data sets, however, you need to consider how reliable the data is and how that can impact the quality of the conclusions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Issues with big data

A

Are we drawing the wrong conclusion from the patterns? An example of this is where investors tried to deduct sales tends at Walmart from satellite photos of the car parks found that many motorists were visiting rival stores.

It is important for you to distinguish between correlation and causation. A famous example shows that there is a correlation between ice cream sales and crime.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly