The Basics of Caboodle Flashcards
(45 cards)
True or False: A column can either be a primary key or a surrogate key, but not both
False. A column can be both a primary key (the column(s) used to uniquely identify a row) and a surrogate key (a key created during ETL that does not exist in the source database). In Caboodle, primary key columns are also surrogate keys.
True or False: Naming conventions are enforced in Caboodle.
True. Naming conventions, such as <name>Fact, are enforced in Caboodle.</name>
It is a process used to extract data from various sources, transform it into a suitable format, and load it into a target database or data warehouse.
ETL
is a temporary storage area where data is transformed, checked for integrity issues, and cleaned before being moved to the reporting database.
staging database
the final destination for data that has been processed and cleaned. It is used by report writers to create and run reports.
reporting database
a set of instructions used to move and transform data during the ETL process. It can extract data from various sources and load it into a target database.
SSIS Package
is a default schema in SQL Server databases. In Caboodle, it is used as the data source for SlicerDicer, Epic’s self-service reporting tool.
dbo schema
contains all the data available in the dbo schema and additional columns. It is the default schema for report writers and provides more detailed data.
FullAccess schema
is similar to the FullAccess schema but includes user-specific filtering based on security settings. It is commonly used with Community Connect organizations.
FilteredAccess schema
a unique identifier for a row in a table that is created during the ETL process. It does not exist in the source database and is used to maintain data integrity in the target database.
surrogate key
a unique identifier from the source database that is used to track and validate data during the ETL process. In Caboodle, these are often stored in columns ending in -EpicId or -EpicCsn.
source identifier
is a unique identifier for a row in a table. It ensures that each row can be uniquely identified and is often used to establish relationships between tables.
primary key
a column in one table that references a row in another table. It is used to establish relationships between tables and typically ends in -Key in Caboodle.
lookup column
What kind of tables make up a DMC (Data Model Component)?
facts, dimensions, bridges, data marts, raw data, or informational tables.
These are the primary tables that contain the many rows of data created in the source system, such as individual encounters, orders, and transactions. They hold al the measures in Caboodle
Fact Tables
a table in Caboodle that contains attributes describing one or more facts
Dimension
a table that stores information where many-to-many relationships are common.
Bridge
a collection of Caboodle data on a topic. can do direct key lookup to retrieve data, which means they can link to other tables using a single Key row instead of two rows for the ID and ID type
Data Mart
where one central fact table will join to many associated lookup or dimension tables.
Star schema
What schema is the data source for SlicerDicer?
dbo schema.
How to see what schema is being used for reports?
select schema_name()
True or False: Caboodle can only store data from Epic sources.
False: Caboodle can store data from both Epic and non-Epic sources.
True or False: The ETL process in Caboodle is simpler than the ETL process from Chronicles to Clarity.
False. The ETL process in Caboodle is more intense than the ETL process from Chronicles to Clarity.
True or False: The FullAccess schema contains more detailed data than the dbo schema.
True: The FullAccess schema contains more detailed data than the dbo schema.