Last week I had an interesting discussion regarding technical aspects of data cleansing, particularly in the context of acquired data. The challenge posed was that the organization needed to collect data sets from numerous sources with no ability to introduce any types of data controls or dat avalidations. In other words, the data they got was what it was, and if they wanted to use it, they'd have to clean it up themselves.
So the discussion led to talk about tools for cleansing, and I mentioned that most products today provide some means of parsing and standardization as aprelude to entity resolution, matching, and consolidation. In fact, I will be continuing this discussion at a web seminar next week on Parsing and Standardization, and I hope you can attend!
Posted July 22, 2010 1:51 PM
Permalink | No Comments |




Leave a comment