What is data analysis?
Data Analysis is a process of collecting, transforming, cleaning, and modeling data with the goal of discovering the required information.
The idea is to do a separation of a whole into its component parts. For example in criminal data analysis:
What are data limitation issues?
What are the four steps of data analysis?
What are the major tasks of data pre-processing?
What are the four types of central tendency?
What is the five number summary?
The five-number summary is a set of descriptive statistics that provide information about a dataset. It consists of the five most important sample percentiles:
min, max, med, Q1 (first quartile), Q3 (third quartile)
The five-number summary can be used to examine the credit card summary and its distribution and detect outliers as fraud values.
What is a data warehouse?
A data warehouse, also known as an enterprise data warehouse, is a system used for reporting and data analysis, and is considered a core component of business intelligence. DWs are central repositories of integrated data from one or more disparate sources.
What is Behaviour profiling?
Finding of suspicious behaviour by analyzing databases. Capability to recognize patterns of criminal activity. Predict when and where crimes are likely to take place.
What is data mining and what is the ultimate goal?
Data mining (knowledge discovery in databases). Extraction of interesting (hidden) information or patterns from data in large databases.
The ultimate goal of data mining is the prediction of human behaviour.
What are potential applications for data mining?
How can data mining help to detect fraud?
Data mining can help to detect fraud by using the historical data to build models of fraudulent behavior and then to use data mining techniques to identify similar instances.
Name a few different data mining techniques?
What is clustering?
Clustering is a technique to group a set of data objects into clusters.
What are the three main components of link analysis?
Name a few data mining tools?
What is the data mining process?