When I was privileged to write Data Warehouse Architecture: The Great Debate between Bill Inmon and Ralph Kimball, I had to do a lot of research on both of their philosophies and methods.
One of the things that stayed with me was that, if you follow either of their methods to the letter, you will have a superior result.
One of the requirements for Ralph Kimball’s data marts has troubled me, however. This is his requirement for conformed dimensions. Conformed dimensions usually require that everyone who will use the data will refer to the data by the same name. This actually leads to the concept of a metadata repository to effectively utilize the data warehouse.
This article will relate my experiences as general counsel in a Fortune 100, and several larger multinational corporations, dealing with corporate data systems and their impact on litigation.
In the first instance, unless the corporation is just starting its data warehouse effort, or unless it has gone through some sort of overall corporate trauma, my experience has been that the capturing and storing of the data that the business needs is done by each department.
One would think that this would be the ideal for the data mart approach, except, it would suggest that consistent naming conventions must be used.
The corporations I worked for were heavy-manufacturing and most specifically water and wastewater treatment manufacturers. When data systems were set up, it was mostly department by department. Each manager had, or wanted, his own system.
As an example, if we sold a clarifier; which is a large tank with complex internal machinery to initially start the water purification process:
Compound this naming challenge with the fact that we used independent distribution and sales representatives who also used different naming conventions in their own business such as sales person, county, vendor, order number etc. One could argue that all of these entities should be required to use the same naming convention. But, we all know that if marketing is using its budget for creating its system, it doesn’t want to subsidize accounting, manufacturing etc. (and vice versa). Also, with regard to the independent entities, our sales representatives and distributors, the corporation had no control over what they chose to use.
When you have one event, the sale of a clarifier, that is classified under so many different names, as long as everyone is used to the process, there is no problem and business proceeds as usual. However, as I said, I managed everything related to legal matters, and almost every one of my requirements was an exception to normal business practices.
Actual Litigation Experience
When my company was served with a summons and complaint, alleging product malfunction or personal injury, the key nomenclature in the summons and complaint was the name of the plaintiff. If it was an individual because of a complaint for personal injury it would never coincide with any of the data names the company used for the product. If it was for malfunction or improper performance, it could be named by any one of our departments' nomenclature.
In a legal situation such as this, because of very stringent time deadlines to answer, or otherwise plead, there are many critical assessments and communications to be made within the corporation as soon as the summons and complaint is served. These are required to preserve evidence of the matter and include:
Is there a possible metadata repository or “universal translator” that every department would use? Would one department volunteer to pay for it? Probably not, because this really needs to be an enterprise-driven initiative. What does the rule 34 (b) deponent do when he finds that because we haven’t correctly specified the documents that a major part of them have been destroyed? An entire company can be lost through “shredding.” We remember Arthur Andersen, don’t we? These are problems that every corporation must address. The vendor community can help businesses develop the metadata repositories so critically needed.
Recent articles by Katherine Drewek
Katherine has more than 30 years of experience in the editorial and corporate law environment. She has been responsible for the content review, editing and formatting of international newsletters focusing on business intelligence and data warehousing. She is a frequent lecturer and panelist for the American Bar Association, the National Association of Credit Managers and the Corporate Practice Institute. She has been a mentor for the Women's Leadership Conference. Katherine serves as Managing Editor for the Business Intelligence Network. Katherine can be reached at kdrewek@b-eye-network.com.