SOURCE: Cloudera


October 28, 2013 07:00 ET

New Cloudera Partner Program Harnesses Power of Innovative Startups

Databricks, the Inaugural Partner of Cloudera Connect: Innovators, Teams With Cloudera for High-Speed Data Analytics

PALO ALTO, CA and NEW YORK, NY--(Marketwired - Oct 28, 2013) - At the Strata Conference + Hadoop World 2013, Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop™, today announced its new Cloudera Connect: Innovators program to help customers harness the latest innovations from projects and companies, while getting the reliability and production support they've come to expect from Cloudera.

The program's charter partner, Databricks, which was spun out of AMPLab at the University of California (UC), Berkeley, is the company behind the popular Apache Spark framework. Concurrently, Cloudera also announced direct support for Apache Spark with CDH, Cloudera's market-leading distribution including Apache Hadoop. With Apache Spark, Cloudera users can now perform rapid, resilient processing of in-memory datasets stored in Hadoop, as well as general data processing.

Together with Cloudera Impala for interactive SQL workloads, Cloudera Search for interactive full text search, and Cloudera Enterprise 5 -- which includes the latest innovations from Hadoop 2, as well as new high performance, efficiency, and data management enhancements -- Cloudera is now further enabling customers to choose the right tool for any big data workload, and helping them succeed in using data to solve business problems as quickly and effectively as possible.

"Cloudera is a leader in enterprise analytic data management and the company's new Innovators program is a great opportunity for us to benefit from their world-class expertise in the enterprise market to bring Spark to new users," said Ion Stoica, chief executive officer, Databricks. "We look forward to collaborating with them on Spark and helping our mutual customers do more with their data."

Cloudera + Apache Spark: Lightning-Fast Data Processing for Hadoop
Originally developed by the AMPLab at the UC Berkeley, Spark is a cluster computing system and execution engine that supports high-speed data analytics. Spark provides a strong complement to Hadoop and is well suited to perform high-speed data processing with the ability to run complex computations up to 100x faster than MapReduce. Spark gets this speed and power from its generic execution model, which optimizes arbitrary operator graphs, and from its ability to process data in-memory rather than always reading it from disk. In addition, Spark provides powerful and easy to use APIs in Scala, Java and Python, which dramatically reduces development time. Combined with facilities to provide fault-tolerance, Spark is an exciting alternative framework for Hadoop.

"No one company by itself can develop all the innovation that enterprises require. The Cloudera Connect: Innovators program will allow our customers to tap into the best innovations the open source community has to offer, whether that innovation was developed by Cloudera or not," said Charles Zedlewski, vice president, products at Cloudera. "Apache Spark is a prime example. It provides excellent data processing functionality and performance and has a vibrant developer and user community."

Customers can download a beta release of the Spark service from

About Cloudera
Founded in 2008, Cloudera pioneered the business case for Hadoop with CDH, the world's most comprehensive, thoroughly tested and widely deployed 100% open source distribution of Apache Hadoop in both commercial and non-commercial environments. Now, the company is redefining data management with its Platform for Big Data, Cloudera Enterprise, empowering enterprises to Ask Bigger Questions™ and gain rich, actionable insights from all their data, to quickly and easily derive real business value that translates into competitive advantage. As the top contributor to the Apache open source community and leading educator of data professionals with the broadest array of Hadoop training and certification programs, Cloudera also offers comprehensive consulting services. Over 700 partners across hardware, software and services have teamed with Cloudera to help meet organizations' big data goals. With tens of thousands of nodes under management and hundreds of customers across diverse markets, Cloudera is the category leader that has set the standard for Hadoop in the enterprise.

Connect with Cloudera
Read our blog:
Follow us on Twitter:
Visit us on Facebook:

Contact Information