Product and service reviews are conducted independently by our editorial team, but we sometimes make money when you click on links. Learn more.

Cloudera Takes Hadoop To Another Level

By - Source: Toms IT Pro

Today at the Strata Conference + Hadoop World 2013, Apache Hadoop-based enterprise analytics software maker, Cloudera, announced a new version of its big data platform, Cloudera Enterprise 5. Currently available as a public beta, Cloudera Enterprise 5 offers some unique features and improvements.

According to Mike Olson, chairman and chief strategy officer at Cloudera, enterprise customers have been asking for "a unified data management platform built on top of Hadoop that streamlines their existing architectures to accelerate time to insight." And Cloudera aims to deliver.

The company gained immense popularity after releasing Impala, a real-time queries engine for Apache Hadoop, last year. Impala is a query system that uses a subset of HiveQL (a SQL-like query language) to provide a SQL-like experience with Hadoop integration. What's more, Impala is under an Apache license and is 100% open source. It works directly with Hadoop Distributed File System (HDFS) to provide increased performance and speed. Impala is also an integral part in Cloudera's enterprise solution.

[LEARN MORE: Cloudera Impala: SQL for Hadoop at Last?]

This year the Palo Alto company shifted focus to its enterprise offering, a subscription-based solution that focuses on data management in Apache Hadoop environments. "Built for the demanding requirements of enterprise customers, Cloudera Enterprise enables companies to store, process and analyze unlimited amounts of data and applications from a single system," the company stated. According to today's press release, the 5th generation of Cloudera Enterprise offers these 3 key advancements:

  • Accelerated time-to-value;
  • Improved efficiency;
  • Comprehensive data management.

The new release, which includes Apache Hadoop 2, incorporates "the latest open source advancements into its industry leading Hadoop distribution, CDH (Cloudera’s Distribution Including Apache Hadoop)" as stated in the press release. The company claims its newest solution "delivers the industry's first and only truly unified platform for enterprise data management." Dubbed as the "enterprise data hub," Cloudera Enterprise allows you to:

  • Retain queryable access during automatic archiving of enterprise data (for example, to meet compliance requirements);
  • Get faster ETL/ELT data processing at scale;
  • Support business intelligence initiatives on more and different kinds of data;
  • Get enterprise search on data and documents in-place;
  • Accelerate a wide range of advanced analytics solutions (for example, recommendations, image processing).

For more information and to take part in the public beta program, visit:

This week Cloudera also announced an expansion of its partner ecosystem, including Savvis, IBM’s SoftLayer, T-Systems and Verizon Cloud, in pursuit of the company's mission is to bring Hadoop to the cloud. 

About the Author:

Kasia Lorenc is the Managing Editor for Tom's IT Pro. Connect with her on TwitterGoogle+  & LinkedIn.

More from Kasia Lorenc