Business Intelligence Network Business Intelligence Resources

Blog: Dan E. Linstedt

« Demystifying the MDM definition | Main | A tribute to my mentors »

Understanding MDM: Rough Seas and Muddy Water

I recently blogged on this subject, and then I asked my friends in the industry what they thought MDM meant to them. Not surprisingly, I received a number of different answers. I'll post their answers in this entry, and then discuss what I believe MDM to be. David Loshin recently wrote a new article on MDM and the basics of MDM, some of which I agree with, other parts I disagree with - I'll post here what my thoughts are on this subject as well. In the next blog entry we'll dive a little deeper and work at defining specifics for Master Data.

Bill Inmon on MDM:

I have always thought of MDM as reference tables. You can extend the definition a little and include other kinds of metadata I suppose. But I think people are stretching MDM way out of proportion.

Stephen Brobst:

I would extend MDM beyond just dimensional data.

I like to think of data in three categories:

* Data that keeps track of events (e.g., sales detail, deposit/withdrawal txns, CDRs, etc.)
* Data that keeps track of the “state” of an entity (e.g., customer, account, product, etc.)
* Data that keeps track of “relationships” (e.g., the relationship between products and categories, persons and households, etc.)

MDM generally encompasses the second category and the third category as it applies to the second category. In other words, MDM keeps track of “state” entities and relationships between “state” entities. But not events or relationships to events.

Claudia Imhoff:
Claudia and I had a discussion about MDM, I will have a quote for the next entry I make, and she and my good friend Colin White are currently writing a research study on MDM, due out soon. I expect this to be extremely helpful. However, in paraphrasing what Claudia and I discussed, here is the general gist:

Master Data Management should be thought of as a process, a business process, not as a tool set or something that a vendor sells. Master Data exists as reference tables throughout the enterprise, be-it on source systems, data marts, data warehouses, staging areas, or operational data stores. It doesn't matter where the tables reside, just that they do reside within the organization as a centralized master list of information containing both the KEY and the DESCRIPTIVE components of what that key represent today.

David Loshin:

Master data objects are those core business objects that are used in the different applications across the organization, along with their associated metadata, attributes, definitions, roles, connections and taxonomies. Master data objects are those "things" that we care about - the things that are logged in our transaction systems, measured and reported on in our reporting systems, and analyzed in our analytical systems. Common examples of master data include:

* Customers
* Suppliers
* Parts
* Products
* Locations
* Contact mechanisms

Ok, I can see where all the confusion comes from. The father of data warehousing and my good friend Bill Inmon has said quite plainly that MDM is nothing more than REFERENCE TABLES, which can include references to anything you want. Centralized information, centralized metadata, and centralized definitions - all quality checked, integrated and providing singularity for data attributes by their corresponding key.

MDM is NOT:
* A data warehouse
* A data Mart
* An Operational Data Store
* A single source system
* A Tool, or technology per-say

MDM IS:
* A governance process, a business driven process, a consolidation of localized information used for reference by the business to better understand their business processes.

Yes, MDM includes Governance, Data Administration, and parts of Data Management. It also includes the data itself as the "master list" of reference material. We've had these concepts for years, especially back on the mainframes. I remember 12 to 15 years ago working with "master parts lists, master contracts lists, master work-orders lists, master employees lists", and of course "master systems" that are supposed to dictate the creation and management of the data set, dispersing it to "child systems" - of course this is all part of what lead to integration mass confusion (nightmarish over-night synchronization processes that were supposed to keep the master lists in synch across the different systems).

I just read a Knightsbridge paper that stated that "localized copies of master data are OK." I beg to differ. Today's networks are fast enough and global definitions (metadata synchronization) is better done these days, good enough to create and locate a SINGLE MDM data source for reference information. The point is to avoid the future "fights" over who's right and who's wrong with their "copies" of the data sets. Integration NOT DIS-INTEGRATION is what we are trying to achieve with MDM.

If you need to warehouse your Master Lists, then do so, but fight the good fight to consolidate the metadata definitions, and consolidate the data sets across the same semantically defined layers (see Data Vault Data Modeling). At the end of your MDM initiative you should be able to identify true MASTER data stores that are not localized, that can be utilized horizontally and vertically across your organization without fear. Quality data and single copies of information are what we want. Utilizing the Data Vault Data Modeling architectures, you can easily encompass the MDM definition of all four individuals above.

In my next blog entry I'll go through a few things like creating a single definition for MDM,

How are you implementing MDM? Which of these definitions matches your business?

Come to TDWI in August, San Diego, share your MDM experience in my MDM night course.

Thanks,
Dan L

  Posted by Dan Linstedt on April 3, 2006 5:15 AM |

Comments

Dan, every so often someone will resuscitate and old practice calling it by a new name. I have been in the business of data stewardship for over 40 years and I remember when we had files as stores, master files, disk storage, data bases, etc. All of a sudden, we exploded into directories, dictionaries; later, data warehouses, business rules, etc. At one point, our industry hit rock bottom and these 'new' methods came to be because we needed to generate revenues; many people were starving because the technology had reached its peak and couldn't keep up with methodology. We were centralized then but we decentralized; this was the catalyst needed for the Immons, Ross' would start their race on their own track. Only few have remained focused (J. Zachman, William G. Smith and others)in what really MDM should be; but they don't call it MDM but Data Resource Management, Data Stewardship (of which data management is a subset),proclaiming that centralization is where the data-business driven systems should be.

We all are in state of confusion not knowing what to do with the ever and overflowing problem we have with the data resource; oh!, let's create anothe DW but make sure that the rules apply. Trying to solve the 'silo' problem, we are creating more and bigger ones. In old times, everything was self contained and it served us well but the technology was faulty; now we have plenty of technology to go about but we have bigger problems, we want more, the more we get the further we distance ourselves from the solution.

MDM is what you say it is but let's not forget that data is what business functions run on; let's define the business functions, build the data that supports these business, utilize technology to link these functions. Not just build isolated mammoth systems just because we have powerful technology but bring these systems centrally.

Hi S. Ramos,

Thank-you so much for the kind and thoughtful comments, I really appreciate your thoughts on the matter - and am in whole-hearted agreement. I feel the same way about Governance, and Compliance - these were "metrics" and Lean-Initiatives and "Cycle Time Reduction" in the old days... Lest we forget about Business Process Re-Engineering.

Centralization is a great thing, now that the technology has caught up and can do it. Decentralized views of data can be good too - but IF AND ONLY IF they are the single source or master list of the data in existence, and if it's managed in an enterprise fashion by a single metadata view.

Ahh could this be the wakings of EII coming to the fore-ground to "manage master metadata" and assist with the tracking and location of "master data sets"??? Intermingle that with metadata management behind the scenes and we have one powerful combination for business.

Your comment about the data - and the fact that it's what the business runs on is appropriate. It's one of those things that I commented on in an earlier blog posting: data has value, data and business and technology all need to work together. There's a funny line (gray mostly) between the three components:
1. We need to better understand the data to make better business decisions.
2. We need to understand what technology CAN do (new and different) in order to use it to enable better business decisions.
3. We need to think outside the box, and ASK for technology and data to do things that it doesn't currently do.

No matter what we call it, it can be "old is new again, but with different twists."

Thank-you kindly for your feedback,
Dan L

Eric, not sure that I got this right. No local copies ? So, I have this automotive planning application in a retail corporation. One of the entry screens validates against a list of automotive parts which is a UNION of 5 sets of parts, each selected through a complex SQL from a master list of over 300,000 products. There are 100 concurrent users, and each enters 50 parts in one cycle. Response time should be sub-second. Multiply this by the hundreds of applications, and thousands of users existing in this retail chain, across supply chain, marketing, advertising, finance. Are you saying that all these should hit the MDM ?
If there is a glitch on the network, or the MDM is down for some reason, are you saying that the whole corporation should come to a standstill ?

Post a comment