Blog: Dan E. Linstedt« Part 6: Secrets of the Masters | Main | Unstructured Information Pushes Dynamic Restructuring » Dynamic Data Models - Automorphic ChangesIt seems people have taken the term "Dynamic Data Warehousing" and abused it. They've made it out to be about "Dynamic Data" and completely ignored "Dynamic Modeling", or dynamic restructuring as the case may be. Automorphic means self-changing, self-adapting. In this entry we'll talk about different capabilities of Dynamic Data Warehousing and the changes to data models as they grow. First, let's define what we mean by Dynamic Data Warehousing: My definition of DDW has come to mean: Now this may all sound really interesting, and extremely future based - but I can assure you - it's not. I am currently working on solutions in my lab which entail the execution of portions of these elements. The hardest one (you might think) is the dynamic modeling, or dynamic restructuring of the database... well, let me tell you - nope! That’s not the hardest piece (when the Data Vault modeling architecture is used)... Keep in mind that the DDW is one or two steps beyond the Operational Data Warehouse, which I've just begun writing about. Also remember that the term: DDW retains all the responsibilities of the "Data Warehouse", as in: time-variant, non-volatile, granular, etc... That said, the question for Dynamic Data Modeling then becomes: how do you keep history on massive volumes of information without losing value, and without "reorganizing" or altering existing structures? The answer is to come later, but with the Data Vault modeling methodology it CAN be done... So how about 3rd normal form? "Danger Will Robinson!" (quote from a U.S. T.V. show ... lost in space, from the 1960's) http://en.wikipedia.org/wiki/Danger,_Will_Robinson Ok, so what about Star Schema's? Have you ever tried to change the structure of a conformed dimension? Does it indeed get harder as the system grows, and/or the more conformity it has? Does it slow down your development efforts? Yes to all of these (at least from my personal experience). Does that make Star Schema bad? NO! Star Schemas are AWESOME, WONDERFUL, and the ONLY solution to work for OLAP, and Drill Down... Do they have a place in the DDW? YES! ABSOLUTELY! well then, where? They have a place as adaptable cubes. Something funny happens when Star Schemas are used as SINGULAR STARS to LOGICALLY define VIRTUAL marts. They work extremely well, and as long as they are logical (not physically implemented), then dynamic memory cubes can become a reality. That's right! IN-MEMORY CUBING, it's happening already in certain DB engines, but it's not yet dynamic. However, as a DDW foundational structure, we need something else. The Data Vault Model seems to be (today) the only other choice available that is actually capable of executing on this dream. We'll talk more about this in my Data Vault blog on http://www.BetterDataModel.com Cheers for now, |
Comments
I think you are correct that most people ignore the 'dynamic modeling' part of dynamic data warehousing, although one major exception to that has been Kalido (which has been a very vocal advocate of that approach for years).
To reinforce this dedication, Kalido recently added a visual modeling component to their offering that allows Kalido customers to automatically drive their warehouse from a business focused conceptual layer diagram that is automatically translated to metadata to drive warehouse processing. This includes everything from staging table creation to the storage mechanism in the relational database, to the generation of physical reporting structures, to the population of BI semantic layers like Business Objects Universes and Cognos Framework Manager models.
The underlying mechanism for this is similar (though not exact) to your Data Vault approach (e.g. Hub, Link, and Satellite -> Kalido Adaptive Services Core, Transaction Datasets, and Mapping Tables). This contributed to Kalido being the first software package being certified compliant with DW 2.0 by Bill Inmon.
This is not to say Kalido addresses everything on your list--yet. But the footprint is certainly expanding. This leads me to a question, though. There are a class of tools in the market that I will call “Usage and Query Monitoring”. Examples include Teleran and BEZ. When deployed, these tools gather a lot of information about who is using what data, allowing you to manually make decisions about things you mention (tuning queries, adding indexes, building summaries) based on actual data points. Do you envision a place for these types of applications in addressing dynamic data warehousing (e.g. automatically acting on information gathered), or are you thinking about a new class of tool here? Thanks.
Posted by: Stephen Pace | April 28, 2008 2:51 PM