The question is "do we need SQL" to use a database or can we live in a world of NoSQL. The answer in this case, is probably minimal to no SQL will be the future EDW world. The reason for this being technologies like Hadoop and MapReduce, which are reducing workload complexity from Applications. These applications are being increasingly built and delivered on the cloud and mobile platforms, which require a very light front-end footprint and heavy back-end processing power.
Another driving trend is the increasing adoption to semantic technologies. The semantic technologies propel another trend "in-memory analytics", whereby SQL overheads are minimized on query performance. Backend systems will be SQL intensive and will use a database
A third trend is to integrate "unstructured" or"semi-structured" data and query that result set, which is largely semantic driven.
In conclusion, we will use the DW as a backend, number crunching platform, and slowly move away from "SQL" dependency on the front end, for building out Analytical and BI application.
Virtualization and Cloud will definitely be drivers, but I do not see EDW's or even large DW's being run on pureplay Cloud platforms.
Posted December 25, 2010 3:21 PM
Permalink | 1 Comment |




I believe the real question as what is considered to be a database today ?
Before Hadoop and MapReduce, a database was a packaged product, it had several features to persist and retrieve data, but it had clear boundaries.
Files or source-data were still termed as files or in other words they were outside the territory of a 'database'. While a database is still a set of files, the collective operation through few control structures is what we've been commonly referring to as a database.
With the need to store and analyze more information, in the fastest time possible the definition of a 'database' has been stretched and its going through transformation of its own.
Expectation of a database 'setup' to be able to give a collective image of both database structures and file structures (as what you mentioned unstructured data) is the need of the day.
Other thing that has changed in the old database world is how it was setup. Underlying infrastructure for a database were mostly local and remote access and collective access was a "feature" in that world.
That has totally shifted today - while the necessacity to see them as a one (big) database has remained the same (and most important than others), infrastructure costs and constraints around that, paved ways for cloud computing. This shift has been so strong that we look back to our needs of the day and tier them up - spring a database only when you need.
On SQL, again I think we got some clues from SAS, on the possibility of interaction with database structures without a free form query layer as SQL.