Business Intelligence Network
business intelligence resources

Blog: William McKnight

« Famous Last Words | Main | Wednesday What: Data Modeling, the Enigma »

Wednesday What: Prove you did it

Time for the Wednesday (or thereabouts) “What” (what I have learned…). OK, I seem to be endlessly prompted in my client work with these learnings so there’s no shortage of them, but sometimes I don’t have an elegant preamble to a blog entry. So, I’ll just say it.

You’ve got to tie that warehouse data back to source or users will cry foul. It doesn’t matter how dirty the source data is. If you want to change the data en route to the warehouse to clean it, fine, change it, but bring the original data as well in a different set of columns in order to prove your tie-out.

Tie-out should make you more comfortable with your ETL as well. It sometimes involves adding pre-extract queries to the source data and post-load queries to the warehouse data. It sometimes involves ‘spot’ query checks, which can get tricky. I.e., the method used to pick your spot data can come under scrutiny. It also gets tricky when the ETL is run intra-day or real-time, when ETL cycles are at an absolute premium. However, you still need to do it IMO. These tie-out results go in your operational metadata.

Tie-out is part of weaning users from their old ways to the new way (the data warehouse way). It’s part of the bottoms-up approach to a successful data warehouse rollout. Ask key users what they will use to deem the warehouse effort successful – and do that and more. Remember, users are from Missouri - the show-me state - and IT is from Mars (according to many users I have dealt with.) And if they don’t ask about tie-out, do it anyway!

Technorati tags: data warehouse, ETL

  Posted by William McKnight on September 20, 2007 7:32 AM |

Post a comment