Blog: David Loshin« July 2006 | Main | September 2006 » August 10, 2006Anonymization and IdentifiabilityAOL admits their goof in publishing huge amounts of search data that is questionably anonymized. The New York Times describes some details of the person identified through analysis of the released search data, as was reported in Martin McKeay's blog entry. I always have a dual reaction to the uproar over the privacy issues associated with the release of this kind of data. First, I am amused that a big company like AOL doesn't have the governance controls in place to assess the public's reaction to the publication of what might be considered sensitive data. The second is surprise that "The Public" is concerned over the exposure of what they suddenly consider to be private information, when in fact the privacy policy states that the data may be presented to others in a nonidentifiable way ("(others) ...receive aggregate data about groups of AOL Network users, but do not receive information that personally identifies you" Of course, AOL thought that the released data was presented in a way that did not personally identify anyone. The fact that others are able to extract identifiable information from presumably anonymized data should be a wake up call to AOL to review how their governance practices are deployed to ensure they are abiding by their own policies. August 4, 2006Now You Can Listen to my Articles Also!I am a firm believer in providing multiple channels for conveying information about data quality, data integration, master data management, business intelligence, so I have undertaken a program to begin recording the text of my articles as mp3 files so that you can download and listen to them at your convenience. The first is my July, 2006 Business Intelligence Network article, Identity Resolution and Data Integration. Let me know what you think by dropping me an email (loshin@knowledge-integrity.com)! |