Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
10 Cards in this Set
- Front
- Back
Activities |
Define Architecture, capacity planing, select storage servers Integrate server, storage, tools Define physical warehouse organization Connect sources Design and implement script(ETL, refresh) Populate Design and implement app Roll out warehouse and application |
|
Implementing a Warehouse(4) |
Monitoring Integrating Processing Managing |
|
Monitoring (Define, 1) techniques 4 |
Detect changes and propagate changes to integrator
-Extractor - extraction from standard interfaces -Techniques - Triggers, Update to log, programs for legacy, polling |
|
Integrating (Define, 2) |
Receive from monitoring, clean and integrate Clean Loading |
|
Data Cleaning (Why?, Techniques) |
Why? Inconsistent field lenghts, descriptions, value assignment, missing entries, violation of integrity Techniques Migration, simple transform (ex-gender) Scrubbing, with domain knowledge Auditing, outliers |
|
Data Loading Processing tasks(3) Issues(2) Data Refresh When? (2) How? (2) Derived data, When to update |
Processing tasks: integrity constraints, sorting Summarizing... Issues Large volumes Long time(checkpoints) Data Refresh when?(periodically, immediately) how? (Data/Transaction shipping) Derived data Materialized Views Indexes Aggregates |
|
Processing (3) |
Index Structures What to Materialize Algorithms |
|
Index Structures(4) |
Inverted Trees Bitmap indexes Join indexes Text Indexes |
|
What to materialize Maintenance (Update how?) |
?? |
|
Managing (3) |
Metadata Repository Admin metadata, Business, Operational |