18. Data Flashcards

1
Q

Define ‘personal data’

A

Data where an individual could be identified when combined with other data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What allows the collection & storage of vast amounts of data

A

Improved technology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How can data legislation vary by jurisdication?

A

Objectives & expected behaviour are similar but legislation varies. The US has much less stringent laws

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why must extra care be taken when transferring data between countries?

A

Data legislation can vary between jurisdiction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Name the 8 categories of ‘sensitive personal data’ (RREPPCST)

A
  • Race
  • Religion
  • Ethnicity
  • Political opinion
  • Physical/mental condition
  • Convictions
  • Sex life
  • Trade union membership
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Name the 3 qualities by which big data can be categorised

A
  • Very large datasets
  • Brought together from many sources
  • Can be analysed quickly
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How can big data be altered to provide data protection?

A

Anonymisation can remove any personal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the theory of data minimisation?

A

That big data is excessive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Complexity of big data is not an excuse for…

A

failure to comply

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define ‘data governance’ SIAU

A

The term used to describe overall management of availability, usability, integrity & security of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a data governance policy?

A

A documented set of guidelines for data management, detailing how data is captured, analysed & processed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What 6 things does a data governance policy detail? RUSCCM

A
  • Use for data
  • Roles/responsibilities of individuals
  • How data is captured, analysed & processed
  • Security/privacy issues
  • Details of controls to meet standards
  • How adequacy of controls is monitored
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Name 3 risks of poor data governance

A
  • Fines
  • Reputational damage
  • Inability to rely on data for use
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What should the data governance policy detail regarding a merger?

A

The risk of aggregating data & data systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Give an advantage of combining data & data systems in a merger

A

Adv: overhead savings
Disadv: cost of converting systems is high

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Name 5 risks around data & its suitability for use

A
  1. Errors/omissions
  2. Insufficient data
  3. Credibility
  4. Not reflective of future experience
  5. Form - not in required form for purpose
17
Q

Give 5 reasons why data may not be a reflection of future experience

A
  1. Random fluctuation
  2. Abnormal events in data
  3. Changes to data recording
  4. Change in balance of homogenous grouping
  5. Socio-economic change
18
Q

What is the result of a lack of confidence in data

A

A lack of confidence in conclusions

19
Q

What is the issue with very small homogenous groups

A

They can be too small to draw credible conclusions & if merged to form sufficiently sized groups, it may reduce homogeneity

20
Q

What is algorithmic decision making?

A

Automated trading to capitalise on price discrepancies across markets

21
Q

List the benefits of algorithmic decision making

A
  • Quicker, more consistent decision making
  • Lower dealing cost
22
Q

How can advancements in big data aid algorithmic decision making?

A

Allows for greater accuracy in setting parameters

23
Q

Name 6 risks of algorithmic decision making

A
  • Algorithm error
  • Data error
  • Creating instability in markets (plunge & rebound)
  • Turbulent conditions can cause market suspension
  • May not operate in turbulent markets
24
Q

Name 3 reasons why data for all tasks should be controlled through a single system

A
  • Audit trail
  • Easier access
  • Lower chance of data corruption
25
Why might competitor data be limited in its usefullness?
- Different benefits offered - Difference in target market - Difference in approach to valuation (prudence in CBE)
26
Why does it take a long time to accumulate good data?
Data takes many years to accumulate so must have good systems in place
27
What kind of questions are used on the proposal form & why?
- Tick boxes to be easily entered - Unambiguous for accurate information - Rating factors used to translate qualitative to quantitative
28
Who provides the data used in employee benefit schemes?
Sponsor (employer)
29
Why can data be a particularly prevalent issue in employee benefit schemes?
Provided by sponsor (employer) who may not have sufficiently detailed or reliable data
30
Name 3 good checks on data
- Checks against data from past valuation date - Checks against accounting data - Assertations
31
Name the 3 things to be attested to
- Appropriate valuation date - Complete - Assets/liabilities exist on given date
32
How can data be checked if it's not possible to check an entire dataset?
Random spot checks
33
What is important to note when using summarised data (summarised due to insufficient volume or detail)
Recognise the reliability of results will be impacted
34
Give an example of an industry-wide data collection scheme
IFoA PPO working party
35
Give 2 benefits of using industry-wide data collection scheme
- Can compare experience with the industry - Can compare homogenous groupings
36
Name 6 potential causes of heterogeneity distortion in industry-wide data collection schemes
- Different policies sold - Different sales methods - Different underwriting process - Different risk factors - Different data systems - Different socio-economic conditions
37
Give 3 other distortions in industry-wide data collection schemes
- out-of-date - less detailed - not all firms participate so not fully market representative