• No results found

OLAP (online analytical processing) is computer processing that enables a user to easily and selectively extract the data from different points of view.

N/A
N/A
Protected

Academic year: 2022

Share "OLAP (online analytical processing) is computer processing that enables a user to easily and selectively extract the data from different points of view. "

Copied!
30
0
0

Loading.... (view fulltext now)

Full text

(1)

OLAP (Online Analytical Processing)

UNIT-III Part 2

(2)

UNIT-III

Overview of Data warehouse and OLAP technology, Types of OLAP Servers, Multi-dimensional data model, data warehouse architectures, Data Warehouse Implementation, concept of data warehousing to data mining.

(3)

OVERVIEW

INTRODUCTION

OLAP CUBE

HISTORY OF OLAP

OLAP OPERATIONS

DATAWAREHOUSE ARCHITECHTURE

DIFFERENCE BETWEEN OLAP &

OLTP

TYPES OF OLAP

APPLICATIONS OF OLAP

(4)

INTRODUCTION TO OLAP

OLAP (online analytical processing) is computer processing that enables a user to easily and selectively extract the data from different points of view.

OLAP allows users to analyze database information from multiple database systems at one time.

OLAP data is stored in multidimensional

databases.

(5)

AN EXAMPLE…

(6)

Some popular OLAP server software programs include:

Oracle Express Server

Hyperion Solutions Essbase

OLAP processing is often used for data mining.

OLAP products are typically designed for multiple-user environments, with the cost of the software based on the number of users.

(7)
(8)

THE OLAP CUBE

An OLAP cube is a multi-dimensional array of data OR simply a data structure.

Online analytical processing (OLAP) is a

computer-based technique of analyzing data to look for insights.

The term cube here refers to a multi-dimensional dataset, which is also sometimes called a hypercube if the number of dimensions is greater than 3.

(9)

For example, a company might wish to summarize financial data by product, by time-period, and by city to compare actual and budget expenses. Product, time, city and scenario (actual and budget) are the data's dimensions

(10)

OLAP CUBE

(11)

HISTORY OF OLAP

The term OLAP was created as a slight modification of the traditional database term OLTP (Online Transaction Processing).

Databases configured for OLAP employ a multidimensional data model, allowing for complex analytical and ad-hoc queries with a rapid execution time.

Nigel Pendse has suggested that an alternative and perhaps more descriptive term to describe the concept of OLAP is Fast Analysis of Shared Multidimensional Information (FASMI).

(12)

OLAP OPERATIONS

There are five basic analytical operations that can be performed on an OLAP cube:

1.Drill down 2.Roll up

3.Dice 4.Slice 5.Pivot

(13)

Drill Down operation

In drill-down operation, the less detailed data is converted into highly detailed data. It can be done by:

Moving down in the concept hierarchy

Adding a new dimension

In the cube given in overview section, the drill down operation is performed by moving down in the concept hierarchy of Time dimension (Quarter -> Month).

(14)
(15)

Roll Up operation

It is just opposite of the drill-down operation. It performs aggregation on the OLAP cube. It can be done by:

Climbing up in the concept hierarchy

Reducing the dimensions

In the cube given in the overview section, the roll-up operation is performed by climbing up in the concept hierarchy of Location dimension (City -> Country).

(16)
(17)

Dice operation

It selects a sub-cube from the OLAP cube by selecting two or more dimensions. In the cube given in the overview section, a sub-cube is selected by selecting following dimensions with criteria:Location = “Delhi” or “Kolkata”

Time = “Q1” or “Q2”

Item = “Car” or “Bus”

(18)

Slice operation

It selects a single dimension from the OLAP cube which results in a new sub-cube creation. In the cube given in the overview section, Slice is performed on the dimension Time

= “Q1”.

(19)

Pivot operation

It is also known as rotation operation as it rotates the current view to get a new view of the representation. In the sub-cube obtained after the slice operation, performing pivot operation gives a new view of it.

(20)

TYPES OF OLAP

Relational OLAP(ROLAP):

To store and manage warehouse data, ROLAP uses relational or extended-relational DBMS(DBMS).

Multidimensional OLAP(MOLAP): MOLAP uses array-based multidimensional storage engines for multidimensional views of data. With multidimensional data stores, the storage utilization may be low if the data set is sparse.

(21)

Hybrid Online Analytical Processing (HOLAP)

Hybrid OLAP is a combination of both ROLAP and MOLAP. It offers higher scalability of ROLAP and faster computation of MOLAP. HOLAP servers allows to store the large data volumes of detailed information

(22)

Relational OLAP

Provides functionality by using relational databases and relational query tools to store and analyze multidimensional data.

Build on existing relational technologies and represent extension to all those companies who already used RDBMS.

Multidimensional data schema support within the RDBMS.

Data access language and query performance are optimized for multidimensional data.

Support for very large databases.

(23)

Multidimensional OLAP

MOLAP extends OLAP functionality to MDBMS.

Best suited to manage, store and analyze multidimensional data.

Proprietary techniques used in MDBMS.

MDBMS and users visualize the stored data as a 3- Dimensional Cube i.e Data Cube.

MOLAP Databases are known to be much faster than the ROLAP counter parts.

(24)

ROLAP v/s MOLAP

Characteristics ROLAP MOLAP

SCHEMA User star Schema

Additional

dimensions can be added dynamically.

User Data cubes

Addition dimensions require recreation of data cube.

Database Size Medium to large Small to medium

Architecture Client/Server Client/Server

Access Support ad-hoc

requests

Limited to pre-defined dimensions

(25)

Characteristics ROLAP MOLAP

Resources HIGH VERY HIGH

Flexibility HIGH LOW

Scalability HIGH LOW

Speed Good with small data sets.

Average for medium to large data set.

Faster for small to medium data sets.

Average for large data sets.

(26)

Implementation of OLAP servers

ROLAP:

Data is stored in tables in relational database or extended relational databases.

They use an RDBMS to manage the warehouse data and aggregations using often a star schema.

Advantage:

Scalable

Disadvantage:

Direct access to cells.

(27)

MOLAP:

Implements the multidimensional view by storing data in special multidimensional data structures.

Advantages:

Fast indexing

Only values are stored.

Disadvantage:

Not very Scalable

(28)

APPLICATIONS OF OLAP

OLE DB for OLAP

OLE DB for OLAP (abbreviated ODBO) is

a Microsoft published specification and an industry standard for multi-dimensional data processing.

ODBO was specifically designed for Online Analytical Processing (OLAP) systems by

Microsoft as an extension to Object Linking and Embedding Database (OLE DB).

(29)

/Contd…

Marketing and sales analysis

Consumer goods industries

Financial services industry (insurance, banks etc)

Database Marketing

(30)

One main benefit of OLAP is consistency of information and calculations.

It allows a manager to pull down data from an OLAP database in broad or specific terms.

OLAP creates a single platform for all the information and business needs, planning,

budgeting, forecasting, reporting and analysis.

BENEFITS OF OLAP

References

Related documents

 Low-cost storage and active data archive.  Staging area for a data warehouse and analytics

Testing of Hypothesis: Test of Significance, Chi-square test, t-test, ANOVA, F-test...

The significant components of data mining systems are a data source, data mining engine, data warehouse server, the pattern evaluation module, graphical user interface,

• Hierarchical database model structures data as a tree of records, with each record having one parent record and many children while, the network model allows each record

• Hierarchical database model structures data as a tree of records, with each record having one parent record and many children while, the network model allows each record

The paper aims to understand the challenges faced by graduate students and their perspectives in data-intensive research at RRI regarding data types; collection methods;

NEW DELHI DATA PROCESSING, SOFTWARE DEVELOPMENT & COMPUTER CONSULTANCY SERVICES.. 48.94

Software development for the Computer (Processing Unit): To receive data via serial communication from the ATmega32 and smoothens the data and calculate