f Flashcards

Question

Execution plan:

Answer 1

Called explain plan or query plan. Tells you how the optimizer will execute your SQL

Answer 2

Minimize lock duration using pessimistic and optimistic locking strategies

Answer 3

Histogram allows the optimizer to understand how the data is being distributed and make the best decision

Answer 4

Avoid unnecessary network round trips.

Answer 5

works best if you think no one else will “grab” the row

Answer 6

merge tables to avoid joins, create “materialized views: to avoid big grouping or filtering

Answer 7

optimizer determines execution plan

Answer 8

Concatenated index has more than one column

Answer 9

Use the “Array” interface in your program code Use the stored procedures for complex interactions with the database

Answer 10

Do not create an index on every column

Answer 11

In denormalization you violate 3NF to improve performance

Answer 12

use memory to avoid I/O, operations to read from memory, operations to share memory

Answer 13

IT Service Management (ITSM) can be used to identify and maintain service levels

Answer 14

Do not use NULL if you will search for NULLS use something that can be indexed (N/A)

Answer 15

avoid contention for locks and latches

Answer 16

avoid bottlenecks in initial design and later monitor performance

Answer 17

Create indexes on groups of columns that are queried together Indexes speed up queries but make DML slower

Answer 18

Using nulls has significant performance implications (nulls do not take any space, nulls cannot be indexed)

Answer 19

Monitor, estimate, plan capacity, analyze, reorganize, optimize, cache, compress, sort

Answer 20

Tune the application Reduce contention Configure Memory Tune I/O

Answer 21

choose the best data model, reduce the load on the database, tune the SQL statements

Answer 22

1. Structure the tables in a way that the database would work better 2. Tune the application code (Java, C++ etc.) 3. Tune the SQL statements

Answer 23

Do not put much load on the database | When the application puts too much load on the database, and the performance decrease, it is not the database fault

Answer 24

Subtypes can occur when you are modelling things that are almost the same

Answer 25

use fast disks, use RAID (o + 1), use SSDs

Answer 26

* Oracle performance method – eliminating bottlenecks and developing efficient SQL statements * Database self-monitoring – sends alerts to notify of impending problem using expected values for comparison * AWR (Automatic Workload Repository) – performance history * AWR baseline-statistics with DB performing well at peak load * Adaptive threshold-warning and critical alert thresholds * ADDM uses AWR statistics to diagnose performance * OEM (Oracle Enterprise Manager) – GUI for maintenance

Answer 27

Starting point is 3NF, it takes all redundancy in the data and determines the PKs. Every column in a table should be identified by: the PK, all of the PK, nothing but the PK.

Answer 28

usually locking problems are due to application locks, sometimes internal locks can cause problems

Answer 29

Database buffers cache table and index data to avoid reading data from the disk

Answer 30

Prevent 2 sessions from changing table data at the same time – this avoids “lost updates”

Answer 31

* SQL contains an ORDER BY or GROUP BY * The two tables are being joined without an index, both tables are sorted and the results merged (called a SORT-MERGE join)

Answer 32

Soft areas (Called PGA in Oracle) allow sorting and hash structures to be maintained in memory; otherwise they would be written to a temporary

Answer 33

Latches are very light weight locks that protect memory instead of tables

Answer 34

When asked for data, the database first looks for it in the memory buffers. If the data is found in memory it is called a “hit”, otherwise the data must be read from the disk (Called a “miss”)

Answer 35

Memory itself can become a problem; no one can get the memory they want.

Answer 36

If there is not enough memory to do the hash table or a sort in memory, then the database will read and write data to a temporary file group

Answer 37

Database buffers improve performance by caching data in memory

Answer 38

Hash joins are more efficient alternative to SORT-MERGE. A hash table is built on one of the tables and acts like an “on the fly” index

Answer 39

When buffers are modified they are called dirty; these have to be written to disk

Answer 40

The ratio (hit/(hits + misses)) is called the ‘hit rate’

Answer 41

Latches are like locks, but instead of protecting table rows, they protect memory (buffers)

Answer 42

Disk IO is the slowest part of the database system, so it is critical to performance

Answer 43

Most locking problems are caused by application code, optimistic locking strategy is often the solution There are system locks that can cause problems, these are rare and database specific

Answer 44

Latency is the time taken to perform a single IO

Answer 45

When all the blocks are dirty then sessions have to wait for the buffers to be written before new data can be read

Answer 46

Throughput is the number of operations over time (IO/second)

Answer 47

for a latch: If two sessions try to access the same area of memory, then one will wait Instead of “sleeping” (like a lock) the waiting session will “spin” on the CPU for a very short time

Answer 48

High latency means you are overloading the disk

Answer 49

throughput, disk fill

Answer 50

To avoid overloading disks, we combine multiple disks into an “array”. The array can then support higher amounts of disk IO

Answer 51

sparsely populate, under only moderate load

Answer 52

The array can also protect the disk data loss by storing multiple copies of data

Answer 53

When the disk is overloaded, latency goes up and throughput stalls (called the “hockey stick” curve)

Answer 54

“RAID” levels describe the type of array. RAID levels 1,0 and 5 are the most frequently used

Answer 55

Distributes data across disks like RAID 0 Creates a “parity” block for every data block that can be used to recover data if the disk fails

Answer 56

But they are a poor choice for data that does not get accessed very often

Answer 57

RAID 5 requires less disks that RAID 0+1 so it’s cheaper (but also much slower when writing)

Answer 58

very high IP rates

Answer 59

Stripping and mirroring together Best performance Protection against data loss More expensive (more disks) than RAID 5 Best solution for database files

Answer 60

Also called striping

Answer 61

use of information as directive rather than indicator of potential problem

Answer 62

Data is spread across multiple disks to distribute IO evenly Good performance but no protection against data loss

Answer 63

Also called mirroring

Answer 64

Data is duplicated on two or more disks Protects against data loss, but does not spread the IO across disks

Answer 65

Data governance program oversees the management of the quality, maintainability, availability, usability, integrity, scalability and security of enterprise data

Answer 66

* DBMS software – migrations, procedures * Hardware configuration * Logical and physical design * Applications * Physical database structures

Answer 67

Impact Prosecution Cost Durability

Answer 68

upper-level management is keenly aware of the need to comply

Answer 69

DBA does not request change (programmers, application owners, business owners do)

Answer 70

DBA carries out most database changes

Answer 71

* Proactivity * Intelligence * Analysis * Automation * Standardization of procedure * Reliable and predictable process * Availability * Quick and efficient delivery

Answer 72

can result in huge fines and imprisonment

Answer 73

can be significant but so can the cost of non-compliance

Answer 74

increasing regulation – increasing time, effort and capital will be spent on compliance

Answer 75

Business legal IT

Answer 76

removes the sensitive data by deleting it

Answer 77

must understand the legal requirements imposed on their data and systems as dictated in regulations

Answer 78

scrambles the data algorithmically. Thi5s technique will not produce realistic looking data and can make the data larger

Answer 79

* Metadata management and data quality * Database and data access auditing * Data masking and obfuscation * Long-term data retention and database archiving * Closer tracking of traditional DBA tasks

Answer 80

varies the existing values in a specified range in order to obfuscate them

Answer 81

1. Adding columns to tables – not a good idea 2. DBMS traces – ISV offering is better 3. Log based - missing read activity 4. Network sniffing – missing server requests 5. Capture requests at the server

Answer 82

uses the existing data and moves the values between rows in such a way that the no values are present in their original rows

Answer 83

must be involved to interpret the legal language of the regulations and ensure that the business is taking proper steps to protect itself

Answer 84

replaces existing data with random values from a pre-prepared data set

Answer 85

Data masking is the process of protecting sensitive and personally identifiable information (PII) in non-production databases from inappropriate visibility

Answer 86

must be involved to implement the policies and procedures to enact the technology to support the regulatory mandates

Answer 87

masks data assuring that the results are referentially intact.

Answer 88

``` Substitution Shuffling Number and data variance Encryption Nulling out Table-to-table synchronization ```

Answer 89

online analytical processing

Answer 90

a collection of integrated, non-volatile, time-variant, subject oriented databases designed to support the DSS function.

Answer 91

differs from transactional operational data in timespan, granularity and dimension

Answer 92

data distrubution system

Answer 93

very large databases

Answer 94

Data warehouses are designed for analytical processing.

Answer 95

online transaction processing

Answer 96

* Create * Operational (completing business transactions) * Reference (reporting or queries) * Archive (compliance and business protection) * discard

Answer 97

The data warehouse contains atomic data and lightly summarized data.

Answer 98

Comprehensive, cohesive, integrated tools and processes.

Answer 99

* Drilling up/down hierarchies * Comparing aggregate values * Parallel execution

f Flashcards

(123 cards)