<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
<channel>
<title>Blog: David Loshin</title>
<link>http://www.b-eye-network.com/blogs/loshin/</link>
<description>Welcome to my BeyeNETWORK Blog. This is going to be the place for us to exchange thoughts, ideas and opinions on all aspects of the information quality and data integration world. I intend this to be a forum for discussing changes in the industry, as well as how external forces influence the way we treat our information asset. The value of the blog will be greatly enhanced by your participation! I intend to introduce controversial topics here, and I fully expect that reader input will &quot;spice it up.&quot; Here we will share ideas, vendor and client updates, problems, questions and, most importantly, your reactions. So keep coming back each week to see what is new on our Blog!</description>
<language>en</language>
<copyright>Copyright 2008</copyright>
<lastBuildDate>Wed, 02 Jul 2008 10:37:14 -0700</lastBuildDate>
<generator>http://www.movabletype.org/?v=3.33</generator>
<docs>http://blogs.law.harvard.edu/tech/rss</docs> 

<item>
<title>IAP Day 3: Governance by Lumigent</title>
<description><![CDATA[<p>This presentation by Roger Hodskins from Lumigent focused on their AppGRC product, and Roger discussed their value propostion of monitoring database access logs to observe compliance to specified policies. The Lumigent product seems to be a good hook to include with ERP applications, and might be an interesting acquisition target by an Oracle- or IBM-style company.</p>]]><![CDATA[<img src="http://stats.b-eye-network.com/b/ss/powmbeyenetwork/1/H.12-Pdvu-2/123456?pageName=subscribe:rss:blogs:loshin&amp;v16=subscribe:rss:blogs:loshin&amp;hier1=subscribe,rss,blogs,loshin&amp;c5=blog&amp;c6=subscribe&amp;c7=subscribe:rss&amp;c8=subscribe:rss:blogs&amp;c9=subscribe:rss:blogs:loshin" width="1" height="1" alt="" border="0" />]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/07/iap_day_3_gover.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/07/iap_day_3_gover.php</guid>
<category></category>
<pubDate>Wed, 02 Jul 2008 10:37:14 -0700</pubDate>
</item>
<item>
<title>IAP Day 2: Informatica</title>
<description><![CDATA[<p>Informatica's IAP presentation focused on the evolution of the data quality technology plus the capabilities obtained from the Itemfield acquisition into an "extra-enterprise" data governance and process orchestration offering.</p>

<p>A welcome and interesting trend is the introdution of business process modelng into the data management operations silo.</p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/07/iap_day_2_infor.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/07/iap_day_2_infor.php</guid>
<category>Data Quality</category>
<pubDate>Wed, 02 Jul 2008 09:41:33 -0700</pubDate>
</item>
<item>
<title>Beer and Diapers</title>
<description><![CDATA[<p>First of all, the canonical example of the power of data mining and preditive analytics, the correlation of purchasing beer and diapers, is widly misused. The notion is that that through analysis, one data miner discovered that males typically buy diapers along with beer, and this is typically followed by explaining why males buy beer with diapers, and <strong>then</strong> saying that putting beer and diapers together will increase overall sale of both items.</p>

<p>Anyone familiar with urban legends would immdiately be dubious, and so I did a quick search and found a <a href="http://www.dssresources.com/newsletters/66.php">http://www.dssresources.com/newsletters/66.php</a>good analysis of the history (and relevance) of diapers and beer.</p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/07/beer_and_diaper.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/07/beer_and_diaper.php</guid>
<category>Business Intelligence</category>
<pubDate>Tue, 01 Jul 2008 13:35:46 -0700</pubDate>
</item>
<item>
<title>BPEL - People who need BPEL - Are the Luckiest People!</title>
<description><![CDATA[<p>Yet another quickie from the Independent Analyst Plaftorm in Phoenix. I am listening to the Informatica presentation by Karen Hsu, and she is discussing the introduction of orchestrating buiness process and workstreaming into Informatica's support platform. I could see an interesting workflow integration impact for extra-enterprise information quality management and information integration.</p>

<p>Something to explore a little more: <a href="http://en.wikipedia.org/wiki/Business_Process_Execution_Language">Business Process Execution Language </a>(BPEL).</p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/07/bpel_people_who.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/07/bpel_people_who.php</guid>
<category>Business Intelligence</category>
<pubDate>Tue, 01 Jul 2008 09:41:54 -0700</pubDate>
</item>
<item>
<title>Composite Discovery and Structured Search</title>
<description><![CDATA[<p>Next up at IAP: Composite Software, introducing a combination of a search capability and the use of a relatively sophisticated approach to profiling across federated data in order to present a portal for searching though collections of data and prioritizing views that can be materialized in real time. Noted expert Clive Finkelstein commented on the similarity with what used to be the Axio product from Evoke (now part of Informatica), but the interesting part is their use of the relationship discovery purely for searching.</p>

<p>Also: the product is an "appliance," meaning that it is packaged software on top of hardware. No details of the hardware were presented, but it probably uses a number of multi-core CPUs with a lot of memory (how else could they do the analysis?).</p>

<p>Seems like an extremely interesting product, especially in the context of supporting <a href="http://en.wikipedia.org/wiki/E-discovery">e-discovery.</a></p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/06/composite_disco.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/06/composite_disco.php</guid>
<category>Business Intelligence</category>
<pubDate>Mon, 30 Jun 2008 17:02:14 -0700</pubDate>
</item>
<item>
<title>My Way or the iWay? More from the IAP</title>
<description><![CDATA[<p>The second set of presentations at the Independent Analyst Platform was by Kevin Quinn and Vincent Lam, representing Information Builders and iWay Software, an owned subsidiary of Information Builders. Kevin's presentation screamed through the extremely versatile organization and presentation of reporting, analysis, and some of the ways that Information Builders' product landscape feeds into an organizational business productivity and improvement activity. Reliance on data integration that has evolved over 30+ years from within lends a degree of credibility to the claims of pervasiveness and scalability.</p>

<p>Vincent's presentation on iWay spanned the capability spectrum of numerous abilities for data integration. One interesting note: many other BI vendors have recognized the need for a data integration (or ETL) capability, then went out and bought a vendor or two to fill that void, then wriggled and writhed through the process of making the purchased tools work together. Information Builders has grown their own internal data integration suite, which obviates that need to make things work together, and that is an extremely appealing notion. </p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/06/my_way_or_the_i.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/06/my_way_or_the_i.php</guid>
<category>Business Intelligence</category>
<pubDate>Mon, 30 Jun 2008 14:01:01 -0700</pubDate>
</item>
<item>
<title>Business Objects and SAP - Easing Toward Intergation?</title>
<description><![CDATA[<p>I am at the IAP, Independent Analyst Platform, in sunny (and hot) Phoenix AZ, and the first set of presentations came from folks at Business Objects, an SAP company. I mention it this way since it seemed pretty clear that the presenters were coming from the BOBJ point of view, looking at where Business Objects fits into the SAP strategy.</p>

<p>One thing that looks clear: the objective in maintaining Business Objects as a separate division focusing on being "open and agnostic" as to the platforms to be supported (aside from SAP) may allay fears of existing customers of being forced into the newly created "mega-stack."</p>

<p>I do wonder, though, about the extent to which there is cross-pollination between the product suites of SAP and Business Objects; there may be some nice chunks in NetWeaver that could be siphoned off and offered as part of a Business Objects Enterprise Information Management suite.</p>

<p>Another interesting point with respect to MDM: Aaron Mahimainathan, who is the Senior Director, Platform Marketing SAP, did admit the challenges in synchronizing master data between the BI environment and the operational SAP applications.</p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/06/business_object.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/06/business_object.php</guid>
<category></category>
<pubDate>Mon, 30 Jun 2008 13:52:23 -0700</pubDate>
</item>
<item>
<title>Microsoft to Purchase Zoomix?</title>
<description><![CDATA[<p>Just heard through the rumor mill that Microsoft is <a href="http://www.globes.co.il/serveen/globes/DocView.asp?did=1000349510&fid=1725">"thinking about acquiring Zoomix."</a> It would make sense that Microsoft might consider bulking up in its capability to support a potential MDM offering (note last year's <a href="http://www.informationweek.com/news/windows/showArticle.jhtml?articleID=199902354">acquisition of Stratature</a>). I would look forward to seeing something official, though...</p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/06/microsoft_to_pu.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/06/microsoft_to_pu.php</guid>
<category>Master Data Management</category>
<pubDate>Fri, 06 Jun 2008 15:26:20 -0700</pubDate>
</item>
<item>
<title>Metadata for Really Unstructured Stuff</title>
<description><![CDATA[<p>I have been tinkering with some of the blogging tools out there (so far I like <a href="http://www.wordpress.org">wordpress </a>a lot). One nice aspect of the blogging framework is the expectation of meta-tagging of your content that helps in organization and presentation, which is quite nice because the system does some of the work that I have always been loathe to do (that is, "organizing things").</p>

<p>One way to do this is by categorizing your entries as well as adding additional tags. I was pondering this at some point, thinking that it should be possible at this point to use text mining tools to scan your content and pull out the "statistically improbable" phrases (as our friends at Amazon like to say) to be used as tags.</p>

<p>But what about non-text content? I can think of three commonly used content types that are growing in popularity yet require some extra thought for assigning meta-tags: pictures, voice recordings, and video recordings. As more of this unstructured stuff comes down the pike, we metadata folks should think hard about how to assess and capture semantics associated with these objects for the purposes of organization.</p>

<p>A few years back my friend Greg Elin put together a system for selectively annotating pictures. Check out his <a href="http://www.fotonotes.net/">fotonotes </a>web site. Perhaps there is some future in this for video?</p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/05/metadata_for_re.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/05/metadata_for_re.php</guid>
<category>Metadata Musings</category>
<pubDate>Wed, 28 May 2008 08:03:36 -0700</pubDate>
</item>
<item>
<title>Appliances are Getting Hot!</title>
<description><![CDATA[<p>Strolling around the exhibit floor at the TDWI conference in Chicago the past few days provided an interesting look into a rapidly evolving trend in data warehousing applicances. Of the 30 or so vendors exhibiting, I counted at least 7 that would be considered appliance vendors:</p>

<p>DATAllegro<br />
Dataupia<br />
Kognitio<br />
Netezza<br />
ParAccel<br />
Sybase<br />
Teradata</p>

<p>I might throw Oracle, HP, and Sand Technology in there as well, but I think you see my point - there seems to be the perception that there is a market for high performance "plug-in" systems to deploy data warehouses. What is perhaps even more interesting is that half of these vendor offerings are not specifically hardware appliances, but rather software database systems that can be deployed on top of different hardware systems - in other words, they are "software appliances" (!?)</p>

<p>In essence, many of these approaches, along with some from other vendors as well (Vertica was notably absent from this crowd, but showed up at the previous Las Vegas TDWI) focus on structural optimizations (such as columnar-oriented databases) that are very well-suited for loading into core memory and providing very fast read access, making it especially nice for query/reporting clients. The realization that the database system can be optimized and parallelized in a way that is decoupled from the hardware makes these software-only approaches look very cost-effective, especially when considering sizing a warehouse to meet current needs while considering future growth. Not only that, these systems are finely tuned for performance, (see <a href="http://www.intelligententerprise.com/blog/archives/2007/12/paraccel_lowers.html">Mark Madsen's comments about ParAccel's TPC-H benchmark scores</a>).</p>

<p>The common theme with the software appliance crowd is lowering the barrier of entry to Small/Medium businesses seeking to jump on the BI bandwagon. WIth a variety of operational modes that span full-blown deployments (with hardwre purchase and integration) down to a service-based hosted model, this platform enables data warehousing at a fraction of the cost. This concept in its own right is worth some more exploration, and I think I may try to address that in an upcoming column.</p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/05/applicances_are.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/05/applicances_are.php</guid>
<category>Cutting Edge</category>
<pubDate>Thu, 15 May 2008 11:40:08 -0700</pubDate>
</item>
<item>
<title>The Blog as Content Manager</title>
<description><![CDATA[<p>We are currently updating our company web site, and I am extremely impressed with the ways that emerging blogging tools are able to solve certain "challenges" associated with managing a web site's content. I am planning to put together a new web site to accompany my MDM book and I am also thinking that blogging is the way to go. </p>

<p>Check out <a href="http://www.movabletype.org">Movable Type </a>and <a href="http://www.wordpress.org">Wordpress </a> - pretty impressive. The software is pretty cool, provides all sorts of widgets and plug ins, and makes life a lot easier for keeping a web site fresh. </p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/05/the_blog_as_con.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/05/the_blog_as_con.php</guid>
<category>Cutting Edge</category>
<pubDate>Mon, 12 May 2008 08:07:39 -0700</pubDate>
</item>
<item>
<title>Whew...Wrapping up my MDM Book</title>
<description><![CDATA[<p>Hey, sorry it has been a while since my last blog entry. I have been focused on finishing up my book on master data management (MDM), which thankfully is now finished. Some interesting thoughts gelled over the past 6 months in which I have been furiously assembling material for the book, which is due now to be published in the Fall by Elsevier:</p>

<p>- MDM is more of a means than an end, and it is more likely to be justified in the context of other enterprise activities such as CRM or ERP.</p>

<p>- I have started to bristle at the phrase "golden copy." I now think that MDM is more about providing universal transparent access to a sngle representation of uniquely identifiable entity data, but that does not mean that entity data has to sit in its own silo.</p>

<p>- Comprehensive master metadata should include more than just data dictionary information</p>

<p>Stay tuned for more information on the book...</p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/04/whewwrapping_up.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/04/whewwrapping_up.php</guid>
<category>Master Data Management</category>
<pubDate>Fri, 25 Apr 2008 12:18:17 -0700</pubDate>
</item>
<item>
<title>Pre-Conference Session at Informatica World - June 2 Las Vegas</title>
<description><![CDATA[<p>For anyone interested in learning about how to engineer data quality into the system development life cycle, sign up for my <a href="http://www.informatica.com/events/customer_conference/sessions/preconference.htm#edq">pre-conference session at Informatica World </a>in Las Vegas on June 2, 2008. Contact me directly for more information!</p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/04/preconference_s.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/04/preconference_s.php</guid>
<category></category>
<pubDate>Mon, 14 Apr 2008 08:04:23 -0700</pubDate>
</item>
<item>
<title>Separated at Birth?</title>
<description><![CDATA[<p>I couldn't resist: Is disgraced super advocate governor Eliot Spitzer somehow related to super sailor Popeye?<br />
<img alt="spitzer.jpg" src="http://www.b-eye-network.com/blogs/loshin/spitzer.jpg" width="182" height="156" /><br />
<img alt="popeye2.jpg" src="http://www.b-eye-network.com/blogs/loshin/popeye2.jpg" width="200" height="200" /></p>

<p>Two interesting aspects of the Spitzer situation. First, his tactics at using information to track down targets for prosection as NY State Attorney General are prime exmaples of exploiting business intelligence to identify patterns of misbehavior. Second, one would think that, knowing the tactics to be used to seek out suspicious activity, would have hesitated to expose himself to discovery via the <a href="http://seattletimes.nwsource.com/html/politics/2004276458_spitzermoney12.html">same tactics.</a></p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/03/separated_at_bi.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/03/separated_at_bi.php</guid>
<category>Just for fun</category>
<pubDate>Tue, 11 Mar 2008 14:33:14 -0700</pubDate>
</item>
<item>
<title>My Business Intelligence (or is it Intelligent Business) Library</title>
<description><![CDATA[<p>For some reason, I have acquired a habit of buying books at the airport. It could be that due to some lingering guilt about limitations on my personal productivity as I spend time getting from one place to another, I feel compelled to buy books that have some business relevance to read at the gate while waiting for all the business class and premier travelers to board the airplane.</p>

<p>I am finding, though, that I am building up an interesting set of books that provide value to the way I look at the use of information, so I thought I'd share a list of books that I have recently read, am currently reading, or plan to read some time in the near future. Each one deals with aspects of how we can learn from what we know, learn from what we don't know, then exploit what we can learn:</p>

<p>"<a href="http://www.amazon.com/Wisdom-Crowds-James-Surowiecki/dp/0385721706">The Wisdom of Crowds</a>," by James Surowiecki<br />
"<a href="http://www.amazon.com/Freakonomics-Revised-Expanded-Economist-Everything/dp/0061234001/ref=pd_bbs_1?ie=UTF8&s=books&qid=1204919118&sr=1-1">Freakanomics</a>, " by Steven Leavitt and Stephen Dubner<br />
<a href="http://www.amazon.com/Tipping-Point-Little-Things-Difference/dp/0316346624/ref=pd_bbs_2?ie=UTF8&s=books&qid=1204919118&sr=1-2">"The Tipping Point," </a>by Malcolm Gladwell<br />
"<a href="http://www.amazon.com/Blink-Power-Thinking-Without/dp/0316010669/ref=pd_sim_b_img_1">Blink</a>," by Malcolm Gladwell<br />
"<a href="http://www.amazon.com/Black-Swan-Impact-Highly-Improbable/dp/1400063515/ref=pd_bbs_sr_1?ie=UTF8&s=books&qid=1204919286&sr=1-1">The Black Swan</a>," by Nassim Nicholas Taleb<br />
"<a href="http://www.amazon.com/Fooled-Randomness-Hidden-Chance-Markets/dp/0812975219/ref=pd_sim_b_img_1 ">Fooled by Randomness</a>," by Nassim Nicholas Taleb<br />
"<a href="http://www.amazon.com/Long-Tail-Future-Business-Selling/dp/1401302378/ref=sr_1_1?ie=UTF8&s=books&qid=1204919362&sr=1-1">The Long Tail</a>," by Chris Anderson<br />
"<a href="http://www.amazon.com/Long-Tail-Future-Business-Selling/dp/1401302378/ref=sr_1_1?ie=UTF8&s=books&qid=1204919362&sr=1-1">Fortune's Formula</a>,"  by William Poundstone<br />
"<a href="http://www.amazon.com/Linked-Everything-Connected-Else-Means/dp/0452284392/ref=pd_bbs_sr_1?ie=UTF8&s=books&qid=1204919575&sr=1-1">Linked</a>," by  Albert-Laszlo Barabasi<br />
"<a href="http://www.amazon.com/World-Flat-3-0-History-Twenty-first/dp/0312425074/ref=pd_sim_b_img_8">The World is Flat</a>," by Thomas Friedman<br />
"<a href="http://www.amazon.com/Collapse-Societies-Choose-Fail-Succeed/dp/B000IJ7Q32/ref=pd_sim_b_img_32">Collapse</a>," by Jared Diamond</p>]]></description>
<link>http://www.b-eye-network.com/blogs/loshin/archives/2008/03/my_business_int.php?ua=</link>
<guid>http://www.b-eye-network.com/blogs/loshin/archives/2008/03/my_business_int.php</guid>
<category>Reflections</category>
<pubDate>Fri, 07 Mar 2008 12:37:47 -0700</pubDate>
</item>


</channel>
</rss>