[关闭]
@ChuckIsReady 2018-11-16T03:43:42.000000Z 字数 4556 阅读 625

BI test

未分类


Better decisions: balance between accuracy and speed

model: A model is a simplified representation of reality.

The Benefits of Models

Problem: Problem is the difference between what people desire (or expect) and what is actually occurring

Potential issues in data/information collection and estimation

Outcome of intelligence phase: A Formal Problem Statement

Descriptive models

Benefit: time saving
bounded rationality
- Simulation - most common descriptive modeling method

The Choice Phase

Includes the search, evaluation, and recommendation of an appropriate solution to the model

DSS: A DSS is typically built to support the solution of a certain problem

Components of DSS

Data Management Subsystem

Model Management Subsystem

Knowledgebase Management Subsystem

User Interface Subsystem

Phases of Decision-Making Process

1) Intelligence
2) Design
3) Choice
4) Implementation
5) Monitoring

Data warehouse: A physical repository where relational data are specially organized to provide enterprise-wide, cleansed data in a standardized format

A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of data in support of management’s decision

-

Characteristics of DW
- Subject oriented
- Integrated
- Time-variant (time series)
- Nonvolatile
- Summarized
- Not normalized
- Metadata
- Web based, relational/multi-dimensional
- Client/server
- Real-time and/or right-time (active)

Definitions

  • Operational data stores (ODS)
    A type of database often used as an interim area for a data warehouse
  • Oper marts
    An operational data mart.
  • Enterprise data warehouse (EDW)
    A data warehouse for the enterprise.
  • Metadata
    Data about data. In a data warehouse, metadata describe the contents of a data warehouse and the manner of its acquisition and use

Two Approaches:

OLTP: On Line Transaction Processing
– Describes processing at operational sites
OLAP: On Line Analytical Processing
– Describes processing at warehouse

OLTP
- Mostly updates
- Many small transactions
- Mb-Tb of data
- Raw data
- Clerical users
- Up-to-date data
- Consistency, recoverability critical

OLAP
- Mostly reads
- Queries long, complex
- Gb-Tb of data
- Summarized, consolidated data
- Decision-makers, analysts as users

Types of OLAP Servers

  • Relational OLAP (ROLAP)
    – Use relational or extended-relational DBMS to store and manage warehouse data and OLAP middle ware to support missing pieces
    – Include optimization of DBMS backend, implementation of aggregation navigation logic, and additional tools and services
    – greater scalability

  • Multidimensional OLAP (MOLAP)
    – Array-based multidimensional storage engine (sparse matrix techniques)
    – fast indexing to pre-computed summarized data

  • Hybrid OLAP (HOLAP)
    – User flexibility, e.g., low level: relational, high-level: array
  • Specialized SQL servers
    – specialized support for SQL queries over star/snowflake schemas

ROLAP

MOLAP

Data Integration and the Extraction, Transformation, and Load (ETL) Process

Integration :

  • Extraction -- reading data from a database
  • Transformation -- converting the extracted data
    from its previous form into the form in which it
    needs to be so that it can be placed into a data
    warehouse or simply another database
  • Load -- putting the data into the data warehouse
添加新批注
在作者公开此批注前,只有你和作者可见。
回复批注