Week 01 Flashcards
(54 cards)
Explain Data in Actuarial Science
Data was once scarce, now it is plentiful but it is often used for many purposes beyond the reason for its initial collection. Data may have limitations for those extended applications.
- Long established use of quant techniques
– Techniques understood theoretically - Limited data of known provenance
– Additional data expensive to collect - Data curated by specialists
- Long term planning
– Consequences of decision not immediately clear - Regulatory control
Explain Data Science and it’s implications
Data science mainly originates in computer science. Since the focus comes from the technical side it is often a set of tools looking for an application.
The focus is often on identifying new patterns and relationships that were not obvious without powerful computer technology.
This suggests correlation but without a clear path to causation.
Implications:
* Cheap computation and data storage
* Vast amounts of data collected and stored
– Collected for operational processing
– Analysis is a by-product and so data is cheap
* Data will provide insight into business
* Pragmatic rather than theoretical approaches
* does it work rather than do I understand it?
What are the 4 types of Business Analytics
Descriptive
Diagnostic
Prescriptive
Predictive
What should one be aware of when using data that has come from people remote from those who collected it?
What has happened to the data since its collection - need to know this to know if the data is suitable for my purpose and my analysis
How do computers help us? - Four functions and when they were realised?
Computation in 1950s
Storage in 1960s (meant databases were introduced)
Graphics in 1970s
Networks in 1980s
Explain Computer Circuits
Closed for true (1), open for false (0). This is a binary system
Define a byte
Cluster of 8 bits
Define a microprocessor
Computer processor where the data processing logic and control is included on a single integrated circuit. The microprocessor contains the arithmetic, logic, and control circuitry required to perform the functions of a computer’s central processing unit.
Intel 1972
Explain the meaning of Moores law
Speed of computers doubles every 2/3 years and gets cheaper progressively
Using a comparison between first personal computer describe the developments in storage for computers
1983 IBM made first personal computer which the internal hard drive held 10 MB. Price of hard disk was €2500.
1996, PC hard disks could hold 1.66 gigabytes. Price of hard disk was €200.
2024, 14000 gigabytes (14TB) for €250
In comparison where storage used to be very costly storage is now almost free and comes by the terabyte.
What is data
It’s a starting point for a process to allow better decisions - includes raw measurements and is considered to have little or no value until it has been processed and transformed
Explain the meaning of noise
Unrelated data items
What is information
-data that have been processed so that they are meaningful
– data that have been processed for a purpose
– data that have been interpreted and understood by the recipient
What is significant about data processing which can affect its interpretation
Data can be processed in different ways to provide different forms of information
What is the general process to organise data into information (ie data transformation)
Classification
Rearranging/ sorting
Aggregating
Performing calculations
Selection
Give examples of how we summarise information for decision making using statistical method, visual method and textual ,method.
Stats - Central tendancy
Visual - charts
Textual - sentiment analysis
Give examples of how we subset information for decision making?
Database - selection and projection
Case-Based Reasoning - relevant examples
Full text search
Give examples of how we interpret information for decision making using statistical method, visual method rule based method or machine learning.
Stats - confidence interval
Rule based - expert knowledge
Machine learning - anomaly detection
Visual - dashboards
What is the purpose of operation systems and what do they do to data
These systems process data into standard forms.
EX: Statements & Invoices
The main reason we use IT for operation is to save money
What’s the difference between operational systems and information systems
Operational systems aim to save money doing things that need to be done. Information systems aim to provide information for better management decisions
Explain what web analytics do
Turn data collected from website into information about customers ex: why do people not proceed with a purchase/ who visits the site etc
What does business analytics do
Processes quant data into information
Explain what financial reporting is doing in terms of data
Processes financial dara into standard reports for managing the business
What are three lower level information systems in order and what are they used for
Transaction processing systems - output used for business operations not decision making so much
Office automation systems - transfer of information and coordination of work
Management information systems - summary reports and systematic organisation of info. Produces simple models for business activity.