Instructor Created Flashcards

1
Q

What is the purpose of affinity grouping?

A

To evaluate relationships or associations between data elements that demonstrate some kind of affinity between objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is estimation defined in data analysis?

A

A process of assigning some continuously valued numeric value to an object

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a key benefit of the estimation process?

A

Results can be ranked by score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does classification involve?

A

Organizing data into predefined classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the goal of the classification process?

A

To build a model that can accurately classify new records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is clustering in data mining?

A

The task of dividing a large collection of entities into smaller groups based on similarity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the main process in data mining?

A

Assemble information, prepare it for mining, apply algorithms, and analyze results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does data mining rely on?

A

Using one set of data for training and another for testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is data type conversion?

A

Parsing strings representing values and transforming them into the proper form for the target machine

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is data cleansing?

A

Correcting known data errors and automating corrections

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the purpose of integration in data processing?

A

To represent linkage between different tables and maintain metadata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does referential integrity checking ensure?

A

That referential integrity constraints are not violated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are derivations in data processing?

A

Transformations based on business rules applied during data movement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the difference between denormalization and renormalization?

A

Denormalization breaks data into a simpler form, while renormalization restores a structured form

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is aggregation in data processing?

A

Populating summaries or cube dimensions in the staging area

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the purpose of audit information?

A

To provide a reference for integrity checking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is null conversion?

A

Transforming different forms of nulls from disparate systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the two key questions in the extraction phase of ETL?

A
  1. What data should be extracted?
  2. How should that data be extracted?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is a data mart?

A

A subject-oriented data repository for decision support and BI needs of a specific department

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does the ETL process stand for?

A

Extract, Transform, Load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are the main steps in the ETL process?

A
  • Get data from the source location
  • Map data into a suitable model
  • Validate and clean data
  • Apply transformations
  • Move data to the repository
  • Load data into the warehouse
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the financial value associated with increased profitability?

A

Derived from lowered costs or increased revenues

Financial value is crucial for evaluating business performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What does productivity value refer to?

A

Decreased workloads and high-quality outcomes

It emphasizes efficiency in processes like manufacturing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is trust value in a business context?

A

Greater customer, employee, or supplier satisfaction and confidence in forecasting

Trust value also includes better management reports and decision-making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What does risk value encompass?
Improved visibility into credit exposure and confidence in capital investments ## Footnote It also involves *maintaining auditable records*.
26
Define a data warehouse.
A centralized repository of information ## Footnote [📖](https://share.evernote.com/note/42449ea4-7e25-1fdd-e23b-3c69d5b5308d) It is essential for data analysis and reporting.
27
How is a data warehouse organized?
Around relevant subject areas important to the organization ## Footnote This organization supports targeted data analysis.
28
What is the primary use of a data warehouse?
For analysis and not for transaction processing ## Footnote This distinction is critical for understanding its role in data management.
29
What characteristic does the data in a data warehouse have?
Nonvolatile ## Footnote Nonvolatility means that data remains stable and is not frequently changed.
30
What is the target location *for integrating data* from multiple sources?
A data warehouse ## Footnote It consolidates data from both internal and external sources.
31
Fill in the blank: A data warehouse provides a platform for different _______ to submit queries about enterprise information.
[consumers] ## Footnote Consumers can be both human and automated.
32
What is the purpose of MySQL Workbench?
A visual database design and management tool for MySQL databases.
33
What are the features of MySQL Workbench?
Provides a user-friendly interface for: 1. database modeling 2. SQL development 3. server configuration 4. user administration 5. backupand data recovery
34
In which use case is MySQL Workbench ideal?
Managing MySQL databases graphically, *creating visual representations* of database schemas, and *managing complex SQL statements*.
35
What is the purpose of the MySQL Command-Line Client?
A command-line interface for interacting with MySQL databases.
36
What are the features of the MySQL *Command-Line Client*?
Allows users to execute SQL queries, manage databases, and perform administrative tasks directly through the terminal.
37
Who is the **target user** for *MySQL Command-Line Client*?
Users who prefer command-line operations or are working on servers without a graphical interface.
38
What is SQL Server Management Studio (SSMS) used for?
A management tool for Microsoft SQL Server databases.
39
What are the features of SQL Server Management Studio (SSMS)?
Offers a **graphical interface** for SQL Server database: *development, management, and administration*, including querying and performance monitoring.
40
What is phpMyAdmin?
A web-based application to manage MySQL databases.
41
What are the features of phpMyAdmin?
Offers a browser interface for executing SQL queries, managing tables, and handling database administration tasks.
42
What is the key difference in database compatibility between MySQL Workbench, phpMyAdmin, and SQL Server Management Studio?
1. MySQL Workbench and phpMyAdmin are specifically for MySQL databases 2. SQL Server Management Studio is exclusively for Microsoft SQL Server.
43
What is a primary key?
A unique identifier for each record in a table.
44
What are the characteristics of a primary key?
* Cannot contain null values * Must contain unique values
45
What type of relationship does a primary key establish between tables?
Establishes a one-to-one or one-to-many relationship with foreign keys in other tables.
46
What is a foreign key?
A column or a set of columns in one table that references the primary key in another table.
47
What are the characteristics of a foreign key?
* Can contain null values * Can have duplicate values
48
What type of relationship does a foreign key establish between tables?
Establishes a many-to-one relationship with the table containing the primary key.
49
What is a composite key?
A combination of two or more columns in a table that together uniquely identify a record.
50
What does a **composite key** allow; what does it consist of?
* May allow null values in some columns * Always consists of multiple columns
51
What type of relationship does a composite key establish between tables?
Establishes relationships where each part of the composite key must be *considered in conjunction* with others.
52
What is a candidate key?
A column or set of columns that can qualify as a potential primary key for a table.
53
What are the characteristics of a candidate key?
* Can contain null values * Must contain unique values
54
What SQL statement is used to order the result set?
ORDER BY
55
What is the purpose of the WHERE clause in SQL?
**Filters records** based on a specified condition.
56
What SQL statement is used to group rows that have the same values in specified columns?
GROUP BY
57
What is the purpose of the LIMIT clause in SQL?
Restricts the number of rows returned by the query.
58
What does the IN operator do in SQL?
Checks if a *column’s value* exists within a list of specified values.
59
What does the BETWEEN operator do in SQL?
Checks if a value falls within a specified inclusive range.
60
What is the purpose of the `EXISTS` operator in SQL?
Evaluates whether _a subquery_ returns *any* rows.
61
What does the ALL operator do in SQL?
Checks if all values in a subquery *meet a specific condition*.
62
What is the syntax of the UPDATE statement in SQL?
``` UPDATE table_name SET column1 = value1, column2 = value2, ... WHERE condition; ``` `UPDATE` table_name `SET` ... is the core. However, always use `WHERE` condition to specify which rows to update
63
What is the function of the `COMMIT` command in SQL?
Saves all changes made in the current transaction to the database permanently.
64
What does the ROLLBACK command do?
Undoes all changes made in the current transaction.
65
What is the function of the SAVEPOINT command?
Creates a savepoint within a transaction allowing for partial rollbacks.
66
What is the UNDO command in SQL?
Not a standard SQL command; behavior is generally covered by the ROLLBACK command.
67
What is the purpose of the BETWEEN operator?
Filters results within a specific range (inclusive).
68
What is the LIKE operator used for in SQL?
Used for pattern matching in SQL.
69
What is the function of the BETWEEN operator in SQL?
The BETWEEN operator is used to filter results within a specific range (inclusive). ## Footnote It checks if a column’s value is within a specified lower and upper limit.
70
What types of values can the BETWEEN operator be used with?
Numeric, date, or text values.
71
Provide an example of a query using the BETWEEN operator.
SELECT * FROM products WHERE price BETWEEN 50 AND 100;
72
What is the function of the LIKE operator in SQL?
The LIKE operator is used for pattern matching in SQL.
73
What wildcard character represents zero or more characters in the LIKE operator?
%
74
What wildcard character represents a single character in the LIKE operator?
_
75
Provide an example of a query using the LIKE operator.
SELECT * FROM employees WHERE last_name LIKE 'Smi%';
76
What is the function of the IN operator in SQL?
The IN operator is used to specify multiple values in a WHERE clause.
77
What types of values can the IN operator be used with?
Numeric, character, or date values.
78
Provide an example of a query using the IN operator.
SELECT * FROM customers WHERE country IN ('USA', 'Canada', 'Mexico');
79
What is the function of the EXISTS operator in SQL?
The EXISTS operator is used to check for the existence of any records in a subquery.
80
When is the EXISTS operator often used?
Frequently used with subqueries to filter records based on conditions in related tables.
81
Provide an example of a query using the EXISTS operator.
SELECT * FROM products p WHERE EXISTS (SELECT * FROM orders o WHERE o.product_id = p.product_id);
82
What does the EXISTS operator return if the subquery returns one or more records?
True
83
What does the BETWEEN operator do in SQL?
Filters data within a specified inclusive range.
84
What operators can achieve the same effect as the BETWEEN operator?
>= and <= (Greater Than or Equal To, Less Than or Equal To)
85
Provide an example of using >= and <= instead of BETWEEN.
SELECT * FROM employees WHERE salary >= 50000 AND salary <= 100000;
86
What does the > and < operators do in relation to the BETWEEN operator?
Similar to BETWEEN, but does not include the boundaries.
87
Provide an example of using > and < instead of BETWEEN.
SELECT * FROM employees WHERE salary > 50000 AND salary < 100000;
88
When should you use the BETWEEN operator?
When you want to specify a clear range with inclusive boundaries.
89
What is the role of the GROUP BY keyword in SQL?
Used to group rows that have the same values in specified columns into aggregate data.
90
Which SQL functions are commonly used with GROUP BY?
COUNT, SUM, AVG
91
Provide an example of a query using GROUP BY.
SELECT department, COUNT(*) FROM employees GROUP BY department;
92
What is the function of the ORDER BY keyword in SQL?
Used to sort the result set of a query by one or more columns.
93
What options can you specify when using ORDER BY?
Ascending (ASC) or descending (DESC) order.
94
Provide an example of a query using ORDER BY.
SELECT name, salary FROM employees ORDER BY salary DESC;
95
Is SORT BY a defined keyword in standard SQL?
No
96
What should you use instead of SORT BY in standard SQL?
ORDER BY
97
What is FILTER BY used for in SQL?
Typically used in contexts involving window functions.
98
Can FILTER BY be used as a standalone keyword in standard SQL?
No
99
Provide an example of a query using FILTER BY.
SELECT department, COUNT(*) FILTER (WHERE salary > 50000) FROM employees GROUP BY department;
100
What is the key difference between GROUP BY and ORDER BY?
GROUP BY aggregates data based on one or more columns, while ORDER BY sorts the results of the query.