In the never ending quest to determine exactly why so many BI projects and programs still fail, regardless of everything we've learned over the years, another trend has hit my top 10 list. It appears that in the quest to reduce cost and time on BI projects, there seems to be more cases where data is being consolidated rather than integrated. What's the difference? Data consolidation is the process of bringing together entities and attributes into a data warehouse, each retaining their form, value and technical characteristics as they exist in the source. Data integration is the process of combining entities and attributes such that they have common form, meaning and technical characteristics. (See graphic)

Why does data consolidation present a challenge in the data warehouse, and/or data mart? It reduces the ability of users to understand and analyze the data. Users are left to try and determine which attributes and values mean the same thing, and often attach different meaning than their colleagues. It adds time to every analysis and often reduces the value of the result. I usually find that companies try and compensate by building the complex integration rules into the semantic layers of the front end tools. Aside from the additional time required to create the model, it can significantly slow down response time, especially if the rules are being processed by the BI tool instead of the database.
Why do companies end up with data consolidations rather than data integration? In my experience, there are a number of reasons. One cause is an inexperienced DW data modeler. Often data modelers have more experience in the OLTP world than the DW world. They reverse engineer the sources and try to preserve all the attributes that meet reporting requirements. Another reason is that it takes more time to integrate data. It requires data profiling, data analysis and interaction with the subject matter experts. There can also be political factors involved with reaching common meaning. It's also difficult to integrate data. Fortuntely, we've learned techniques over the years for handling different situations, like preserving source values while still providing integrated data. Lastly, a DW may start out with only a single source and the data model is not created to accomodate integrated data.
If your BI program isn't delivering the value that was promised, maybe the issue is with what your delivering.
You can also follow me on Twitter @steve_dine
Posted March 30, 2010 8:46 AM
Permalink | No Comments |



