MIS 301 Final Flashcards
(58 cards)
decision tree analysis
a technique used to create hierarchical if/then rules using several variables to predict a class of a record, for example use on data, income, gender, homeownership, to predict good/bad risk
cluster analysis
a technique used to divide an information set into mutually exclusive groups such that the members of each group are as close together as possible to one another and the different groups are as far apart as possible, ex: data driven profiling using many variables to generate clusters of records
association detection
reveals the degree to which variables are related, and the nature and frequency of these relationships in the information
statistical analysis
performs such functions as correlations, distributions, calculations, variance analysis, and regression
classification
asign records to one of a predefined set of classes
estimation
determine values for an unknown continuous variable behaviour or estimated future value
affinity grouping
determine which things go together
clustering
segment a heterogeneous population of records into a number of more homogenous subgroups
Richard Hackthorn’s graphic on value of BI
[order is important] data latency, analysis latency, decision latency, action taken. this is the time it takes to make an action and why business intelligence is useful
quantifiable benefits
working time saved in producing reports, selling info to suppliers, etc. [ex. moet and chandon reducing from .30 to .15 per bottle]
indirectly quantifiable benefits
can be evaluated through indirect evidence - improved customer service means new business from the same customer and differentiated service brings new customers. [ex: owens and minor cited that extranet access to the data warehouse as the primary reason for giving 44m in new bsiness]
unpredictable benefits
are the result of discoveries made by creative users [ex. volkswagen discovering that audi customers and vw customers behave differently based on socioeconomic profiles]
intangible benefits
include improved communication throughout the enterprise, improved job satisfaction of empowered users, and improved knowledge sharing
forecast
predictions made based on time series information
time series info
time stamp information collected at a particular frequency
statistical analysis
performs such functions as information correlations, distributoins, calculations, and variance analysis [ex regression] includes forecast and time series information
OLAP
online analytical processing is the manipulation of information to create busienss intelligence in support of strategic decision making (executive end)
OLTP
online transaction processing is the capture of transaction and event information using technology to process the information according to designed business rules, store the info, and update existing info
ETL
extraction, transformation, loading / a process that extracts information from internal and external databases, transforms info using a common set of enterprise definitions, and loads the informatino to a data warehouse
dimensions
the who, what, where, when and how criteria of a query [ex: store 123 product abc promo 123]
facts
the sales number in each cell at the intersection of dimension values
multidimensional analysis
a pivot table is an example of multidim analysis, the columns fields, rows fields, and page fields are dimensions. the data in the center is a fact
info security
a broad term encompassing the protection of information from accidental or intentional misuse by persons inside or outside an organization [ex: 38% of security incidents originate within the org]
phishing
a technique to gain personal information for the purpose of identity theft or bank information