Скачать презентацию Virtual University of Pakistan Data Warehousing Lecture-23 Total Скачать презентацию Virtual University of Pakistan Data Warehousing Lecture-23 Total

04409de49d7b4d6f52a3c133785a3d54.ppt

  • Количество слайдов: 12

Virtual University of Pakistan Data Warehousing Lecture-23 Total DQM Ahsan Abdullah Assoc. Prof. & Virtual University of Pakistan Data Warehousing Lecture-23 Total DQM Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research www. nu. edu. pk/cairindex. asp National University of Computers & Emerging Sciences, Islamabad Email: ahsan 101@yahoo. com DWH-Ahsan Abdullah 1

Data Quality Management Process Establish TDQM Environment Evaluate Data Quality Management Methods Scope Data Data Quality Management Process Establish TDQM Environment Evaluate Data Quality Management Methods Scope Data Quality Projects & Develop Implementation Plans Implement Data Quality Projects (Define, Measure, Analyze, Improve) 2

Data Quality Management Process 1. Establish Data Quality Management Sub-bullets will not go to Data Quality Management Process 1. Establish Data Quality Management Sub-bullets will not go to graphics Environment • IS project managers • Development professionals. • Functional users of legacy information systems with domain knowledge • IS developers know solutions but don’t know how and where to modify 3

Data Quality Management Process yellow will go to graphics 2. Scope Data Quality Projects Data Quality Management Process yellow will go to graphics 2. Scope Data Quality Projects & Develop Implementation Plans • Task Summary: Project goals, scope, and potential benefits • Task Description: Describe data quality analysis tasks • Project Approach: Summarize tasks and tools used to provide a baseline of existing data quality • Schedule: Identify task start, completion dates, and project milestones • Resources: Include costs connected with tools acquisition, labor hours (by labor category), training, travel, and other direct and indirect costs 4

Data Quality Management Process 3. Implement Data Quality Projects (Define, Measure, Analyze, Improve) Yellow Data Quality Management Process 3. Implement Data Quality Projects (Define, Measure, Analyze, Improve) Yellow will go to graphics • Define: Identify functional user DQ requirements and establish DQ metrics • Measure: conformance to current business rules and develop exception reports • Analyze: Verify, validate, and assess poor DQ causes. Define improvement opportunities • Improve: Select/prioritize DQ improvement opportunities i. e. data entry procedures, updating data validation rules, and/or company data standards. 5

Data Quality Management Process 4. Evaluate Data Quality Management Methods • modifying or rejuvenating Data Quality Management Process 4. Evaluate Data Quality Management Methods • modifying or rejuvenating existing methods of DQ management Sub-bullets will not go to graphics • determining if DQ projects have helped to achieve demonstrable goals and benefits. Evaluating and assessing DQ work as, it is not a program, but a new way of doing business. 6

The House of Quality Matrix Technical Correlation Matrix Customer Requirements Technical Design Requirements Interrelationship The House of Quality Matrix Technical Correlation Matrix Customer Requirements Technical Design Requirements Interrelationship Matrix DWH-Ahsan Abdullah 7

How to improve Data Quality? The four categories of Data Quality Improvement § Process How to improve Data Quality? The four categories of Data Quality Improvement § Process § System § Policy & Procedure § Data Design 8

Quality Management Maturity Grid CMM Level-1 Uncertainty CMM Level-2 Awakening CMM Level-3 Enlightenment CMM Quality Management Maturity Grid CMM Level-1 Uncertainty CMM Level-2 Awakening CMM Level-3 Enlightenment CMM Level-4 Wisdom CMM Level-5 Certainity 9

Misconceptions on Data Quality § You Can Fix Data § Problem NOT in data, Misconceptions on Data Quality § You Can Fix Data § Problem NOT in data, but how it was used. § It is NOT a one time process. § Buying a cleansing tool is NOT the solution. § Some live with the problem, cant afford the tool. Sub-bullets will not go to graphics § Data Quality is an IT Problem § It is the company problem. § Define the metrics of quality. § Business has to strike a balance between quality and ROI. § Joint business and IT effort. 10

Misconceptions on Data Quality § (All) Problem is in the Data Sources or Data Misconceptions on Data Quality § (All) Problem is in the Data Sources or Data Entry § NOT the only problem. § Systems could be responsible, but actually it is the metrics. § Two divisions using different codes for same entity. § Need to track, trace, check data from creation to usage. § The Data Warehouse will provide a single source of truth § In ideal world it is indeed true. § In real world maybe multiple data warehouses, data marts, external source i. e. silos of data resulting in multiple sources of “truth”. § Even with single source of truth, if transformations and interpretations are different, an issue. 11 Sub-bullets will not go to graphics

GIGO 12 GIGO 12