SOURCE: Cloudera, Inc.

Cloudera, Inc.

April 24, 2012 08:00 ET

Cloudera Delivers CDH4 Beta

Cloudera Leads the Industry in Open Source Big Data Solutions, Delivering the First Integrated Distribution With Hadoop High Availability, Enhanced Security, Greater Extensibility and Increased Performance

PALO ALTO, CA--(Marketwire - Apr 24, 2012) - Cloudera, the leading provider of Apache Hadoop-based data management software, services and training, today announced that version four of Cloudera's Distribution Including Apache Hadoop (CDH4) is now available in public beta. Integrating feedback from enterprise customers and partners with the contributions of Cloudera's engineering team and the larger Apache open source community, the new release marks a major advancement in the evolution of the Hadoop platform. With robust new features and expanded functionality that deliver high availability (HA), increased security and improved extensibility, CDH4 offers a stable, integrated, enterprise system for Big Data management.

"Cloudera has worked hard to add important security, high availability and usability features to its distribution that are essential to the adoption of Hadoop in the enterprise," said Jo Maitland, Research Director for Infrastructure at GigaOM Pro.

CDH is the industry's most widely adopted Apache Hadoop distribution, a 100% open source platform comprised of Apache Hadoop and more than a dozen additional open source components integrated into a single enterprise-ready data management system. CDH4 integrates optimizations for stability, usability, security and performance. CDH4 incorporates landmark new features, including:

  • High Availability: a highly available HDFS NameNode improves usability for mission critical applications.
  • Extensibility: HBase co-processors and a new open resource management model enable developers to create real-time big data applications.
  • Performance: improvements in HBase, HDFS, MapReduce, Flume and system-wide compression performance set a new standard for Big Data management systems.
  • Usability: broader BI support and expanded API access ensure seamless integration and ease-of-use.
  • Security: HBase table and column permissions and Fair Scheduler ACL's facilitate multi-tenancy and increase users' ability to store sensitive data in Hadoop.

CDH4 also offers significant component upgrades to Apache Flume, Apache Sqoop, Hue, Apache Oozie and Apache Whirr, as well as support for new versions of Red Hat, Centos, SUSE, Ubuntu and Debian.

"Data is imperative to our business and Cloudera's Distribution Including Apache Hadoop is at the center of our analytics ecosystem," said Amy O'Conner, Senior Director, Big Data at Nokia. "The new release of CDH reinforces why we selected Cloudera and continue to partner with them: an open system that has been integrated with enterprise functionality and is delivered with robust support."

"We are pleased to deliver CDH4 into public beta and proud of the significant ground we and the extended Apache community have been able to cover in terms of enhancements," said Charles Zedlewski, VP of Product at Cloudera. "CDH4 represents a significant leap in the evolution of the CDH platform. It is the result of rigorous testing and feedback from our enterprise customer and partner organizations. CDH integrates the fine work of the Apache open source community where our own dedicated engineers invest heavily. Our team continues to work tirelessly on CDH4 and we invite anyone who is interested in deploying a Hadoop cluster, or already has a cluster in production, to download CDH4 for free and contribute feedback. We believe CDH4 is the strongest, highest performing Big Data management platform yet."

To learn more about CDH4, read the company blog post at:, or download it for free from:

About Cloudera
Cloudera, the leader in Apache Hadoop-based software, services and training, enables data driven enterprises to easily derive business value from all their structured and unstructured data. Cloudera's Distribution including Apache Hadoop (CDH) is available to download for free at and is the most comprehensive, tested, stable and widely deployed distribution of Hadoop in commercial and non-commercial environments. For the fastest path to reliably using this completely open source technology in production for Big Data analytics and answering previously un-addressable big questions, organizations can subscribe to Cloudera Enterprise, comprised of Cloudera Manager software and Cloudera Support. Cloudera also offers training and certification on Apache technologies, as well as consulting services. As the top contributor to the Apache open source community and with tens of thousands of nodes under management across customers in financial services, government, telecommunications, media, web, advertising, retail, energy, bioinformatics, pharma/healthcare, university research, oil and gas and gaming, Cloudera's depth of experience and commitment to sharing expertise are unrivaled.

Connect with Cloudera
Read the blog:
Follow on Twitter:
Visit on Facebook:

Contact Information