Blog: Dan E. Linstedt« Dynamic Data Warehousing and IBM (a rant) | Main | DDW - Detecting and Securing Changes » Dynamic Data Warehousing... A new definitionI've been writing (scantily) about DDW in the past, in this entry we will take a look at what the definition appears to be in the industry, and then I will offer my opinion on what I think the definition _should_ be for DDW. If vendors believe that they have a DDW, or a DDW solution, then I open heartedly invite them to contact the COBICC board members, and give us all a demonstration, along with definitions of what they've produced. Dynamic Data warehousing, what does it mean to you? The first link is to a student’s research project regarding what their view of DDW is: DWs are dynamic entities that evolve continuously over time. As time passes, new queries need to be answered by them. Some of the new queries can be answered by the views already materialized in the DW. Other new queries, in order to be answered by the DW, necessitate the materialization of new views. In any case, in order for a query to be answerable by the DW, there must exist a complete rewriting 5 of it over the old and new materialized views. Ok, this is an interesting look - but certainly not the complete picture of what I see DDW to be. To give them credit, they are attacking a difficult problem: how to answer a new query that doesn't have the appropriate data set available - by building new materialized views. The concept is decent, but the words "materialized view" make the approach locked in to Oracle, as other databases do not have the notion of a materialized view. They go on to discuss how to create new views that are needed, and they do a good job of expressing the mathematics behind the desire. Again, this is only one piece of Dynamic Data Warehousing. Here's another project: Here's another interesting look, they start out sounding very promising, but when it comes to brass tacks they are merely discussing Dynamic View Generation - still a worthy cause, but not quite a DDW (as they originally claim). IBM has been at it a while, and in this definition - they are defining (you guessed it) an appliance, with bundled software, but in their press release I blogged on yesterday they said DDW is not a tool, a product, or a service... yet again they contradict themselves. Besides that - what they really have is an ADW, not a DDW.... read on... http://www.intelligententerprise.com/showArticle.jhtml;jsessionid=MHPVP5URAXGSAQSNDLRSKH0CJUNN2JVN?articleID=198000675 And another post by Doug Henschen agrees with me. My friend Lou Agosta does a decent job of discussing some of the background pieces involved in Dynamic Data Warehousing. I think what's missing here is the definition of what Dynamic really means... Should Dynamic mean the data warehouse is dynamic with near-real time data? should it mean it is dynamic with query changes? Dynamic with unstructured data? what does it mean? Here's another vendor (Axiom Software Labs) that claim to have a DDW, they are probably closer to the mark, but again all of these solutions say they have dynamic abilities, but none of them talk about HOW these abilities work, nor do they disclose what true DDW needs to be. Oh yes, a new acronym is emerging (unfortunately) DyDa - what? Ok, here are my thoughts on what is required in order to be "the next level" or to be a DDW. We require that all of the following be recognized as dynamic: And so on. Dynamic is a very versatile word, and DDW (in my mind) encompasses a whole lot more than just one piece of the pie (Dynamic Data or Dynamic Views). While these are noble efforts and steps in the right direction, they are _not_ qualified to be called a DDW environment, because they are only pieces of a larger puzzle. I welcome your comments as always, do you have a definition of DDW that you can share? What is it in your mind? Thank-you, |
Comments
So, I am trying to get my mind wrapped around this. I can see you are trying to get to something beyond ADW and real-time DW. However, I need one clarification (for today) - when you say "dynamically" (as in: "BI Reports and dashboards are dynamically altered to include the new elements" ), do you mean "automatically" (as in without human intervention)?
Thanks for the post.
Posted by: Kent Graziano | June 7, 2007 12:22 PM
Hi Kent,
Yes, what I am discussing is similar to temperature of data, but rather temperature of change to the structures. If the change appears to be measured as "harmless" then it may be colored green, and be applied automatically.
If the change is more than expected, but not a major shift (like new structures, new releationships, etc..) then it's colored yellow, and an email is sent stating that a change occurred.
If the change is quite abrupt, or "large" by some scale, it is colored red hot. The email goes out, and the change is put on hold until human interventention can figure it out.
Hope this helps,
Dan L
Posted by: Dan Linstedt | June 7, 2007 9:37 PM
Yes that helps and clarifies it. So dynamic is an automatic reaction to a change. The type of reaction is determined by the type or scope of the change (and by defined business rules).
Posted by: Kent Graziano | June 8, 2007 7:39 AM
Dan,
I honestly believe that DDW is just a theory and will remain as theory for the next decade.
Take for example OLTP systems which have been in the market for atleast two decades and still they are not dynamic. I'm not sure if there is an acronym called dynamic OLTP databases OR DTB(Dynamaic Transaction base) etc.
You just can't make these databases(OLTP,DW,ODS etc) as dynamic. The case you described above with RED, YELLOW and GREEN sounds good in theory, but in practice, all the scenarios will fall into RED category (according to your definition), as impact analysis need to be done in every case and requires human eye to evelaute the results.
Enterprises are struggling today to get to a basic datawarehouse. For DDW to become real and practical, metadata management layer in a given enterprise has to be very strong, robust and proven. Most of the enterprises today are at level 2 or level 3 (10 being best) in managing their metadata.
So I would say DDW is a good academic subject to write articles upon, but its real life applicability is at a distant future
Chandra Kapireddy
Chief BI Architect, D2I3 Inc
Posted by: Chandra Kapireddy | June 16, 2007 10:33 PM
Very interesting viewpoint. The same was said (that it was all hypothetical) to manage terabytes of data from a traditional RDBMS. The same was said about Strategic Analysis being Theoretical when Bill first discussed the need of Data Warehousing. And the same was said about Artifical Intelligence, learning systems and neural networks. All of these things have come to pass, and in fact AI, DW, and neural networking are all a part of daily life today.
I think DDW is _NOT_ theoretical, I think there are parts of it in existence today, things are being done at the semantic level of integration that were previously unnatainable. However, DDW is some years off (in reality), like anything else, it too will come to pass because there are those of us developing the parts and pieces of the architecture today.
Thank-you for your thoughtful comments. People said the same thing about quantum physics and nano technology...
Dan Linstedt
DanL@RapidACE.com
Posted by: Dan Linstedt | June 19, 2007 7:11 AM