We use cookies and other similar technologies (Cookies) to enhance your experience and to provide you with relevant content and ads. By using our website, you are agreeing to the use of Cookies. You can change your settings at any time. Cookie Policy.


University at Buffalo, SUNY, Leads Data-Intensive Discovery Initiative Via an Academic, Government and Industry Collaboration

Originally published May 7, 2009

Netezza Corporation, a provider in data warehouse and analytic appliances, recently announced that the academic, government and industry consortium the Data Intensive Computing Initiative (Di2) has emerged and has entered its second phase by deploying Netezza’s largest appliance in open science. The system was deployed this past March 16 and was operational the next day, delivering its first query response against a 1.2 billion–row test database in two seconds, proving fast implementation and performance as a data–intensive computing platform.

“The University at Buffalo is excited to be leading this collaborative effort,” said Bruce A. Holm, Ph.D., senior vice-provost and executive director of the New York State Center of Excellence for Bioinformatics and Life Sciences (COE). “The University’s commitment to knowledge discovery through computational methods is longstanding. Virtually all science and engineering domains are facing data tsunamis. The latest micro-array, sensor technologies and computational simulations are creating data sets at an unprecedented scale. Exploiting the data-intensive computing capabilities of the Netezza platform with our Di2 partners will help shorten the time to discovery in the science and engineering domains.”

“The Di2 has been a key partner in extending the Netezza data warehouse appliance into the high-performance, data-intensive computing realm,” said Jim Baum, president and CEO of Netezza. “As founding members of the Netezza Developer Network (NDN), the Di2 has helped extend our capabilities in the science and engineering domains. We look forward to continuing to support their efforts to increase the pace of data-intensive discovery.”

“During our testing we performed analytics of micro-array data similar to the real discovery problems the biological sciences are facing. We obtained more than two orders of magnitude performance speedup as compared to traditional high-performance computing (HPC) clusters,” said Vipin Chaudhary, director of the Di2 and associate professor of Computer Science and Engineering in the School of Engineering and Applied Sciences. “The mapping of our algorithm onto Netezza took one day to achieve that performance while the same effort took weeks on the HPC cluster. Data-intensive science and engineering requires multiple HPC architectures and platforms,” Dr. Chaudhary continued. “The Netezza system is a massively parallel device that combines a storage device, FPGA, memory and CPU on each of more than one hundred blades in a standard cabinet. The architecture delivers the query to the data and executes it in parallel across the hundreds of blades simultaneously. The result is incredibly fast response times against data sets containing many terabytes of data.”

"A key element of the Di2 is its industry collaboration with NDN partners and the end users of data-intensive applications,” said Michael Upchurch, COO of Fuzzy Logix, a Di2 and NDN partner. “Our pattern matching and predictive analytics algorithms are historically used by the consumer goods, financial services and web analytics markets. Working with the Di2 we have identified new science and engineering markets for our high performance library of over one hundred algorithms. The Di2 provides access to domain experts who understand the 'language' and discovery space of specific science and engineering applications. We are now involved with the Di2’s teams for data-intensive aerospace, genomic and healthcare applications.”

To read full press release, click here.

This BeyeNETWORK news item contains information from a recent press release by the company mentioned.