Introduction - Data Analysis Flashcards

(26 cards)

1
Q

What is Business Intelligence (BI)?

A

Business Intelligence is the use of data and software tools to analyze historical data for actionable insights, mainly through descriptive analytics like dashboards and reports.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are relational databases and why are they important?

A

Relational databases organize data into tables and allow querying large datasets. They became popular in the 1990s and enable integrated enterprise systems for strategic decisions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How is Data Analytics different from Business Intelligence?

A

Data Analytics includes a broader range of techniques, from basic descriptive stats to predictive models, and focuses on answering questions and solving problems through data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Data Science?

A

Data Science integrates data manipulation, visualization, statistics, and machine learning to build models that predict outcomes and automate decisions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Data Mining?

A

Data Mining is a subset of data science that discovers patterns, associations, and anomalies in large datasets using statistical and logical methods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the four analytics classifications?

A

Descriptive (what happened?), Diagnostic (why did it happen?), Predictive (what is likely to happen?), Prescriptive (what should we do about it?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is CRISP-DM?

A

A process for data mining that includes: Business understanding, Data understanding, Data preparation, Modeling, Evaluation & Communication, and Deployment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an example of predictive analytics?

A

Predicting future sales based on historical customer data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the goal of prescriptive analytics?

A

To recommend actions based on predictive insights, often identifying the best option under constraints.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why has analytics become important in modern business?

A

Due to hyper-competition and rapid tech change, companies use analytics to adapt, compete, and make data-driven decisions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Descriptive Analytics?

A

Summarizes historical data to identify patterns and trends. Answers: What happened?

Example: A company reviews monthly sales data to identify its top-performing products.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Diagnostic Analytics?

A

Explores data to find causes behind certain outcomes. Answers: Why did it happen?

Example: A business finds that sales dropped due to a competitor launching a major promotion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Predictive Analytics?

A

Uses statistical models and historical data to forecast future trends. Answers: What is likely to happen?

Example: A retailer forecasts next month’s demand based on holiday trends and past sales.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Prescriptive Analytics?

A

Recommends actions based on predictions and constraints. Answers: What should we do about it?

Example: Walmart’s system recommends restocking high-demand products at specific stores to avoid stockouts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the first step in the CRISP-DM process?

A

Business Understanding: Define the problem clearly and understand the business goals.

Ask: What decision or issue are we trying to improve with data?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the second step in the CRISP-DM process?

A

Data Understanding: Gather initial data and explore it.

Identify missing values, inconsistencies, and get familiar with its structure.

17
Q

What is the third step in the CRISP-DM process?

A

Data Preparation: Clean the data (remove duplicates, handle missing values).

Transform it into a format suitable for analysis (e.g., filtering, merging).

18
Q

What is the fourth step in the CRISP-DM process?

A

Modeling: Select appropriate models (e.g., regression, decision trees, clustering).

Apply tools like Excel, Tableau, or R depending on the problem and data type.

19
Q

What is the fifth step in the CRISP-DM process?

A

Evaluation & Communication: Check how well your model performs.

Interpret and validate results in the context of the business question. Communicate findings using visualizations, reports, or dashboards.

20
Q

What is the sixth step in the CRISP-DM process?

A

Deployment: Implement the model’s insights into real business processes.

Share results with stakeholders and use insights for decision-making.

21
Q

A hypothetical scenario of all four analytics classifications: Weather Example

A
  1. Descriptive

Over the past 30 days, there has been an increase in rainfall amount over 25%

2. Diagnostic We must investigate why there has been a sudden increase in rainfall, and whether it can be related to the geographic area or seasonality storm patterns

3. Predictive  Our model shows that there is a 65% chance of heavy rain over the next 8 days after taking into consideration the current atmospheric data

4. Prescriptive 

It is essential that is information be reported to cities as they might need to close certain roads in case of floodings and also ensure that any important events be rescheduled given a high increase in rainfall

22
Q

A hypothetical scenario of all four analytics classifications: Declining sales in coffee shop

A
  1. Descriptive

Analyzed sales data and noticed a 10% drop over the past 3 months

2. Diagnostic

We have determined the decline in coffee sales and it is due to several customers complaints due to the very slow service during busy hours

3. Predictive

We need to forecast whether this issue is likely to continue into the issue. If not, the sales will continue to fall another 4% until the next quarter

  1. Prescriptive

Recommend hiring more baristas to prepare the coffees during rush hour or using a delivery service app to forecast the coffee orders in advance for quick service. This will lead to more customer satisfaction in the long run

23
Q

A hypothetical scenario of all four analytics classifications: Inventory restocking in advance

A
  1. Descriptive

Based on Walmart’s last month of sales, there has been a high demand for snacks, and bottled water during the week

2. Diagnostic

This increase in sales during the week is because school has started and there are more students in the area

3. Predictive 

Our system predicts that there will be a 45% increase in snack sales over the coming weeks for increased student convenience

4. Prescriptive 

It is recommended that Walmart restock bottled water and snacks early to ensure property inventory management, increase revenue, and avoid a shortage of snacks

24
Q

What potential ethical issues do you anticipate using data analytics for hiring practices?

A

○ Significant bias in data

Given that data analytics uses algorithms, it is very likely that it favours certain individuals that come from a particular demographic. This leads to bias based on historical hiring rather than making impartial decisions

○ It is also difficult to understand why certain clients were rejected as the process is not transparent given it is fully automated

○ By relying heavily on automation, it can lead to unfair decisions as it does not always capture soft skills such as teamwork, responsibility or adaptability that a interviewer would easily see observe during the interview. This is because this qualitative information is not captured in the database that is being analyzed

25
What are some common issues you might find in a prescriptive analytics practice?
Since prescriptive models are responsible for predicting what we should do about certain business problem next, it does so on incomplete or biased data, which can lead to uninformed recommendations There might be budget or ethical guidelines that are not considered by the model resulting in conflicts with the recommended action Lack of transparency in the model Implementation can be difficult due to lack of infrastructure or organizational resistance
26
Given a retail company like Walmart, what is an example of prescriptive analytics?
How much inventory is required for back to school to restock? Prescriptive analytics can help recommend optimal inventory levels for each store. It will achieve this based on historical data, such as past sales, seasonality and/or promotions The model can suggest how much of each product to restock, and when to ship it to avoid over or under stocking inventory and to minimize costs