Change Tracking Flashcards
(34 cards)
True or False: Any change to patient data in Clarity will generate a new row in PatientDim after the next extract.
False. Assuming the data is extracted to Caboodle, only changes to snapshot columns would generate a new row in PatientDim.
What is a durable key?
Snapshot tables have a durable key column which stores the same value for all rows that correspond to the same entity. This allows for accurate grouping and summarizing.
Consider the following query:
SELECT DurableKey FROM <TableName>
Which of the following statements are true?
A. This query will only run if <TableName> is a snapshot table
B. This query will run regardless of whether <TableName> has change tracking
C. The value in the DurableKey column could repeat in multiple rows of the results
D. There is no table in Caboodle for which this query will run</TableName></TableName></TableName>
a. This query will only run if <TableName> is a snapshot table
c. The value in the DurableKey column could repeat in multiple rows of the results</TableName>
refers to the method used to record changes to data over time.
Change tracking
Tracks changes to snapshot data, creating new rows for each change.
Snapshot Change Tracking (Type 2)
No change tracking; the table reflects the current content of the source data at the time of the most recent extract.
None (Type 1)
a table that stores both current and historical data. It captures changes to specific columns (snapshot columns) by creating new rows for each change, allowing the table to maintain a history of data changes over time.
snapshot table
are columns in a snapshot table that retain previous values which have been extracted to Caboodle and have since changed in the source. Changes to these columns result in new rows being added to the snapshot table.
Snapshot columns
a special surrogate key used in snapshot tables to identify unique entities
durable key column
in a snapshot table stores the date when the snapshot data for a row became valid. It represents the beginning of the date range during which the data was current.
StartDate column
column in a snapshot table stores the date when the snapshot data for a row ceased to be valid. It represents the end of the date range during which the data was current.
EndDate column
is a flag that indicates whether a row holds the most current information
IsCurrent
True or False: Durable keys are used only in tables that store snapshot (type 2) data.
True
True or False: foreign key columns never contain null or unmatched values.
True
True or False: Snapshot change tracking is also known as Type 1 change tracking.
False. Explanation: Snapshot change tracking is known as Type 2, while Type 1 refers to tables with no change tracking.
True or False: Non-snapshot tables reflect the current content of the source data at the time of the most recent extract.
True. Explanation: Non-snapshot tables do not track historical changes and only show the latest data from the source.
True or False: The DurableKey column in a snapshot table uniquely identifies each row in the table.
False. Explanation: The DurableKey identifies unique entities across multiple rows, not individual rows themselves.
True or False: The IsCurrent column in a snapshot table indicates if the row holds the most current information.
True. Explanation: The IsCurrent column is a flag that shows whether a row contains the latest data (1 for current, 0 otherwise).
True or False: Any change to a patient’s data in Clarity will generate a new row in PatientDim after the next extract.
False. Explanation: Only changes to snapshot columns will result in a new row being added to a snapshot table.
True or False: Snapshot data tracks the exact date and time when a change occurred in the source system.
False. Explanation: Snapshot data only tracks when changes were captured in Caboodle ETL, not the exact time of the change in the source system.
True or False: The StartDate and EndDate columns in a snapshot table represent the date range for the snapshot data.
True. Explanation: These columns indicate the period during which the snapshot data was valid in Caboodle.
True or False: The condition IsCurrent = 1 should be used when joining to snapshot tables to return only the most current data.
True. Explanation: Including IsCurrent = 1 ensures that only the latest data is retrieved from snapshot tables.
True or False: In Caboodle, tables with no change tracking are labeled as Type 2.
False. Explanation: Tables with no change tracking are labeled as Type 1, while Type 2 refers to snapshot change tracking.
True or False: The DurableKey column can have the same value in multiple rows of a snapshot table.
True. Explanation: The DurableKey can repeat across multiple rows for the same entity, representing different snapshots over time.