Blog: Dan E. Linstedt« Dynamic Nature of Systems | Main | Welcome to Thought Experiments » Appliances for Business Intelligence and Data WarehousingThere's a movement afoot in the appliance world. Appliances are growing up. They are getting faster, smaller, cheaper, and yes: more specialized. I had it in my mind that the appliance market would combine on a single platform, and provide common plug & play hardware interfaces, well that just doesn't seem to be the case (maybe in the future, but then again maybe not). In this entry we will explore the different classes of appliances that are available, and what they do. We will also take a look at where they may go within the next 12 to 18 months. There are classes of appliances for DW and BI these days, and there's still some debate about what "appliances" really are. But all of that aside, if we take a general definition of the "Appliance" for BI / EDW space - then it might look something like this: A basic plug and play black box with a programmable interface, and an embedded data management engine (might be a database, might be an indexing engine, or something else). In any case, it manages "data in, and data out" at high rates of speed, mostly through network traffic, and a listener of some sort. Most of these appliance boxes come as autonomous network aware components with self-contained hardware. The "magic" is usually buried somewhere in the firmware algorithms, the high speed data stores, the caching mechanisms, and the internal data placement. Some would go so far as to say: if you could predict the context of a query, and match it with the context of the data before executing the query, you'd have the world’s fastest data retrieval and data placement engine. The problem is: the context changes when the query / question changes. The other problem is: data by itself is not deterministic of context; therefore logical context groupings do not make sense in the storage patterns. This is the age-old problem of Random Access File Systems on Physical Disk (Hard drives). Some of this can be solved with RAM disk, USB Flash Drives, and so on - but I ramble. So let's see what we can find about appliances. I used a web search with the following terms to dig up information: Term: "BI Appliance" Term: "DW Appliance" And let's not forget: "Data Appliance" Ok, so what does all this mean? This brings me to the following conclusion: A data appliance simply manages the data access and retrieval, regardless of the type of data, the source or target of the data and the functionality. Within "Data Appliances" their might be specific needs (like backup / restore), high speed text / unstructured access, or database "file" access. The question here is: what then, differentiates a "Data Appliance" from something like an EMC, Hitatchi, or Fujitsu Smart SAN/NASD array? At first glance, not much. However, when we pull back the covers we begin to see some levels of specialized functions: like managing database files, or managing unstructured text documents, or specifically backup/restore as if it were a hot-swap drive within an existing RAID array. This is a hard market to compete in, producing differentiators will be critical to the success of specific vendors like Dataupia. They are already making a splash in the BI world, but they'll have to go a few steps further (which they are already investigating according to their partner’s page). The BI appliance can mean all kinds of things, but one thing (in my mind) it certainly DOESN'T mean is: "I don't need a data warehouse anymore." That couldn't be further from the truth. There are some vendors out there touting Business Intelligence without a Data Warehouse. That may be, and yes, you can get Operational Business Intelligence (a phrase coined by Claudia Imhoff and Colin White) without a Data Warehouse, but Claudia and I agree: to get the analytics, historical trends and patterns, and provide true data mining capabilities, a data warehouse must be part of the picture. So, where does that leave the BI APPLIANCE? The BI Appliance can become the Data Mart Appliance, or act as an Operational BI Appliance, possibly incorporating the physical ODS (operational data store) within it's hardware. The BI Appliance can speed up queries, understand data access patterns and focus on sharing data across web services (transactional data that is). But this is an Operational BI Appliance. What are you talking about? So what, why should I care? How do they lower the cost? Ok, so you get the picture - there's a whole lot more to this than I've discussed, but it is clear that this really is an optimal functionality. Who wants or needs to maintain "separate hardware, separate databases, separate functionality" anymore? Why not have it bundled and working together, pre-packaged and already performance tuned? What does the future look like? For now, there will be an appliance for reporting, one for databases, one for OLAP cubes, one for Web Services, one for ETL, one for Quality/Cleansing, hopefully one for Metadata and so on. The race goes on, and we will continue to see different vendors enter and leave this space. I also think this space is ripe for acquisitions and consolidations. Cheers, |
Comments
Another good article. I really don't envy the people who need to make purchasing decisions around DW and BI.
You used to be able to just compare the major DW vendors, but now appliances and EII and enterprise search vendors are offering easier implementations and shortcuts. The big vendors are countering with mega suites and pseudo appliances. Open source vendors have moved to suites instead of stand alone products. You also have SaaS offerings and web 2.0 offerings to go with it and OEM products.
All these offerings require different ways to calculate the total cost of ownership - a complex equation of skills, re-use, flexibility, ease of implementation etc.
No wonder so many software makers are forced to spend more money on marketing then software development! One challenge for specialised appliance makers is that once a mega vendor gets one product into a company it is becoming easier for the customer to keep choosing products from that vendor that are compatible to avoid this product selection confusion - with the risk that you are not getting best of breed.
Posted by: Vincent McBurney | August 14, 2007 6:58 PM
Nice overview, Dan. Anyone in the BI marketplace has to consider these products, as they offer some real advantages to their big name counterparts. However, appliances don't fit in every scenario. All the more reason to understand our business requirements before making hardware/software decisions.
Posted by: Stuart Mullins | August 24, 2007 11:23 AM