• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/54

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

54 Cards in this Set

  • Front
  • Back

Business Intelligence combines


architectures, tools, databases, analytical tools, application ad methodologies

Business Intelligence Means

different things to different people

Business Intelligence is based on

the transformation of Data – Info – Decisions – Actions

Business Intelligence Include

artificial intelligence capabilities and powerful analytical capabilities

data warehouse is the


user interface is the

(source of data),


dashboard

business analytics is the

( collection of tools for manipulating, mining, and analyzing the data in the data warehouse)

business performance management is the

(for monitoring and analyzing performance

Data Warehouses is a repository of

data (current and historical data of potential interest) that typically comes from a source transactional system produced to support decision making


MAJOR COMPONENTS OF DATA WAREHOUSING PROCESS

Data Sources- OLTP, ERP,independent “legacy” systems (US Census), Web





Data Extraction and Transformation – using custom written or commercial software called ETL


Data Loading- data loadedto staging area- transformed, cleansed – made ready for DW






Comprehensive Database- EDW to support all decision analysis by providing relevant summarized and detailed information originating from many different sources




Metadata: assessed by IT –easy to index and search, esp with Web Tools




Middleware Tools- enable access to data warehouse (write SQL queries, interact with data stored in data repositories)


(OLAP)

online analytical processing

Data Characteristics

Subject Oriented (organizedby detailed subject)




Integrated, (name conflicts/unites of measure),




TimeVariant (time series – multiple time points and they detect trends, deviations,and long-term relationships)




Nonvolatile (after uploaded no changes to data,changes= new data),




Web Based,


Relational/multidimensional,




Client/Server(structure),




Real Time, and include Metadata

Data Integrationcomprises three major processes


o Data Access: ability toaccess/extract data




o Data Federation:integration of business views across multiple data storeso




Change Capture: based onidentification, capture, and delivery of changes made to enterprise datasources.

Difference between an OLTP (online transaction processing) and data warehouse.

handle a company’s routineongoing business. In contrast a data warehouse is typically a distinct systemthat provides storage for data that will be made use of in an analysis.




DW analysis is to givemanagement the ability to scour for information




o To provide tactical andoperational support – used for online analytical processing systems


o OLTP is transactionprocessing such as ATM withdrawals and cash register scans – constantlyinvolved in handling updates to operational databases.




o OLTP is primarilyresponsible for capturing and storing data related to day-to-day businessfunctions

Descriptive/ReportingAnalytics refers to what:


what can we do with it.


refers to knowingwhat is happening in the organization and understanding some underlying trendsand causes of such occurrences




From this datainfrastructure we can develop appropriate reports, queries, alerts and trendsusing various reporting tools and techniques


Predictive Analytics is use to what?

to recognize whatis going , forecast and make decisions to achieve forthe best performance possible.


The goal is to provide a decision specific action also termed as decision /Normative Analysis

What is big data

data that cannot bestored in a single storage unit.


Refers to data that is coming in manydifferent forms: structured, unstructured, in a stream, etc.


Actionableintelligence is?

is information that can beacted upon, with the further implication that actions should be take


Big Data= body (3Vs)

Volume, Variety, andVelocity

IntelligenceCreation:

starts by identifyingand prioritizing specific BI projects

Intelligence


how modern companiesethically and legally organize themselves to glean as much as they can fromtheir customers, business environment, their stakeholders, and other sources ofpotentially valuable information


BI Governance:

Each project must be examinedthrough costing associated with the general process phases as well as costs ofmaintaining the application for the business user- as well as the end-user.

Subject Oriented:


: data organized bydetailed subject, such as sales, products or customers, containing onlyinformation relevant for decision support. Enables users to determine only howtheir business is performing, but why.


Product Oriented

Most operationaldatabases are product oriented and are tuned to handle transactions that updatethe data base. In an ODS - Short-term decisions, only stores very recentinformation.

Nonvolatile

Data entered into a DWwhere users cannot change or update the data. Obsolete data are discarded, andchanges are recorded as new data

Volatile

Current data, almostupdated real time, liable to change and rapidly

Data Mart

is a subset of a datawarehouse, consisting of a single subject area. Can be dependent or independent

Dependent Data Mart:


is a subset that is createddirectly from the data warehouse. Has advantages of using a consistent datamodel and providing quality data ensures that the end user is viewing the sameversion the data that Is accessed by all other data warehouse users.

Independent Data Mart:

asmall warehouse designed for a strategic business unit or department,lower-cost, scaled-down version of a data warehouse who’s source is not anEnterprise Data Warehouse

Metadata

data about datathat describes the structure of and some meaning about data

Parallel Processing

enables multipleCPUs to process data warehouse query requests simultaneously and providesscalability

ETL Processing


Consists of extraction (reading data from one or more data bases)transformation (converting the extracteddata from its previous form into the form in which it needs to be so that itcan be placed into a data warehouse or simple another database) and load (putting the data into the data warehouse).




Transformation occurs byusing rules or lookup tables or by combing the data with other data




Purpose is to load thewarehouse with integrated and cleansed data


Benefits of DataWarehouse

1) End users canperform extensive analysis in numerous ways.




2) A consolidated view ofcorporate data is possible.




3) Better and timelier information is possible.




4)Enhanced system performance can result




5) Data access is simplified


Fact Table:

set of relational table objectsthat are related by set of foreign to primary key relationships – requires acollection of measurable data that is extracted from source transactionalsystem (data is store in table objects)


Dimension Table

store descriptiveinformation about the measures collected by a transactional system (dates,times, count, metrics currency, etc.)

Schema

the relational shape of thediagram and its use of foreign and primary keys




Star Schema consistsof a central fact table surrounded by and connected to several dimension tables

Snow Flake Schema:

has hierarchy; isrepresented by centralized fact tables which are connected to multipledimensions

Data WarehouseAdministrator

should be familiar withhigh performance software, hardware, and networking technologies.




DWA should possess businessinsight, and should be familiar with the decision-making processes so as tosuitably design and maintain the data warehouse.




Keep existing requirementsand capabilities of the DW stable while simultaneously providing flexibilityfor rapid improvement


communication skills

Report


is any communicationartifact prepared with the specific intention of conveying information in apresentable to whoever needs it, whenever or wherever they may need it

Bar Chart

are effective when youhave nominal data or numerical data that splits nicely into differentcategories so you can quickly see comparative results and trends within yourdata. Can be stacked/horizontal/verticalable

Pie Chart

Visually appealing, shouldonly be used to show the relative proportions of a specific measure. If numberof categories to show is more than a few (4) – consider using bar chart.

Scatter Plot

often used to explore therelationship between two or three variables. Having more than three variablesis not easily translated. Scatter plots are an effective way to explore theexistence of trends, concentrations, and outliers.


Bubble Chart

enhanced versions ofscatter plots. Should be viewed as a technique to enrich data illustrated inscatter plots. By varying sizes/colors of circles, one can add data dimensions,offering more enriched meaning.


Histogram

looks like a barchart, difference is the way information is portrayed. Histograms are used toshow the frequency distribution of a variable or several variables. Showdistributional shape of the data

Gantt Chart

special case ofhorizontal bar charts that are used to portray project timeless, projecttasks/activity durations, and overlap amongst the tasks/activities. – Good aidfor management and control of projects

Pert Charts - AKA NetworkDiagrams

show precedencerelationships among project activities/tasks developed primarily to simplifythe planning and scheduling of large and complex projects. Composed of nodes(rectangles/circles) and edges (directed arrows).

Data Visualization (Information Visualization)

the use of visualrepresentations to explore, make sense of, and communicate data.

Managerial Reports

business performancemeasured through outcome-oriented metrics. KPIs or Service-level Agreements(SLAs). Tracked over time- TQM or Six Sigma

Dashboard Type

present a range ofdifferent performance indicators on one page. Common to have color-codedtraffic lights defined for performance.

Balanced Scorecards

presents an integrated view of success in an organization. In addition to finalperformance, they includes customer, business process, and leaning and growthperspectives

Assurance

quality serviceoffered to users by a good business reporting system. Includes determining ifand when the right information is to be delivered to the right ppl in rightformat/way.

Publication

the system thatbuilds the various reports and hosts them or disseminates them. Also providenotification, annotation, collaboration, etc.

Data Supply

a system that takesrecorded events/transactions and delivers them reliably to the reportingsystem.




The data access can be push or pull, depending on whether or not it isresponsible for initiating the delivery process.




It can also be polled if thedata are transferred periodically, or triggered if data are transferred in caseof a specific eventement

Data Storage

the storage area for data and metadata. It could be a flat file or spreadsheet,but usually a RDBMS set up as a data mart, data warehouse, or operational datastore – often employs OLAP (online analytical processing)

Business Logic

steps for how the recorded transactions/events are to be converted into metric,scorecards, and dashboards