EMC (NYSE: EMC) says the company has a new strategy for distributing, integrating and supporting the Apache Hadoop open-source software that is emerging as the preferred solution for Big Data analytics across unstructured data in enterprises.
Announced at EMC World 2011 in Las Vegas, the EMC Greenplum HD Data Computing Appliance is a purpose-built data co-processing Hadoop appliance that integrates Hadoop with the EMC Greenplum Database.
EMC acquired Greenplum's data warehousing and business analytics technology last year, and the acquisition formed the foundation of EMC's data computing division and has since become part of the company's cloud strategy.
"EMC has a responsibility to help our customers realize all that's possible with Big Data, both structured and unstructured," said Bill Cook, president and general manager of EMC's Data Computing Division. "There's a time and a place for the value that relational databases add to structured data, and there's a time and a place for the value Hadoop can give to unstructured data. Many of our enterprise customers need both, and with the help of our partners, we're able to provide them both."
According to EMC, the Greenplum HD Data Computing Appliance allows for co-processing of both structured and unstructured data within a single solution. Apache Hadoop-based batch processing of unstructured and structured data at massive scale using commodity hardware has led to a profound change in analytics. By extracting the knowledge wrapped within unstructured machine-generated data, organizations can make better decisions that drive revenue, improve service and reduce costs, the data storage giant says.
The EMC Greenplum HD product family includes two editions: the Hadoop-based EMC Greenplum HD Community Edition, and EMC Greenplum HD Enterprise Edition software. The company previously released the free community edition of the EMC Greenplum Database, along with analytic algorithms and data mining tools.
EMC said its appliances are combined with product certification by a dozen leading partners to enable technology innovations such as real-time data interaction, offer greater reliability, and make Hadoop much easier to deploy and use for enterprise customers.
For more on Big Data analytics, see Analytics and In-Memory Databases Are Changing Data Centers