SOURCE: Cloudera, Inc.

Cloudera, Inc.

May 31, 2012 12:05 ET

Cloudera and DataSift Partner to Deliver Big Data Insights From Social Data

Cloudera Powers DataSift's Hadoop Clusters and Crunches Through the Vast Volumes of Historical Data to Help Deliver Social Media Insights to DataSift Customers

PALO ALTO, CA--(Marketwire - May 31, 2012) - Cloudera Inc., the leading provider of Apache Hadoop-based data management software, services and training, and DataSift Inc., the social data platform company, today announced that the two companies are working together to deliver business insights from one of the largest public sources of social data on the planet. DataSift is powering its Hadoop clusters with Cloudera's Distribution Including Apache Hadoop (CDH), which performs the Big Data heavy lifting to help deliver DataSift's Historics, a cloud-computing platform that enables entrepreneurs and enterprises to extract business insights from historical public tweets.

With the help of Cloudera's technology, DataSift enables companies to extract insights from billions of public social interactions. DataSift's powerful platform evaluates each social interaction from multiple dimensions, applying natural language processing to turn unstructured data into structured, digestible information ready for analysis to identify sentiment, topics, web-links, location and social media influence. These capabilities provide an unprecedented ability for companies to filter social data, extract meaning and create actionable insights. As the preferred and most widely deployed Hadoop platform on the market, CDH supports over half a petabyte of relevant data for DataSift.

"DataSift was founded with the vision that companies are increasingly looking to social media to provide answers to questions about their business and brand, whether that's for social media monitoring, business intelligence, tracking breaking news or making stock-trading decisions," said Nick Halstead, Founder and CTO at DataSift. "The integration of Cloudera's technology into DataSift provides us with a robust, enterprise platform that enables our customers to ask and answer these questions in minutes, whether they are analyzing data from last week or last year."

"As data continues to accumulate at an unprecedented rate, enterprises need to be able to analyze and act at an ever increasing velocity," said Charles Zedlewski, VP of Product at Cloudera. "In partnering with DataSift, Cloudera is excited to help unlock these insights into what is rapidly becoming one of the largest bodies of social data on the planet."

About DataSift
DataSift Inc. is a social data platform company, enabling enterprises and entrepreneurs to aggregate, filter and extract insights from the billions of public social conversations on Twitter, leading social networks and millions of other sources. Through a licensing agreement with Twitter, DataSift provides companies with both real-time and historical Tweets to filter and uncover insights and trends that relate to brands, businesses, financial markets, news and public opinion. DataSift is an on-demand platform with a flexible pricing scale that makes enterprise-level data accessible to companies of any size. DataSift has offices in San Francisco and Reading, U.K. It has received investment from IA Ventures (Roger Ehrenberg), a fund that is focused exclusively on Big Data, and from GRP Partners (Mark Suster). For more information, visit and follow us on Twitter @datasift.

About Cloudera
Cloudera, the leader in Apache Hadoop-based software, services and training, enables data driven enterprises to easily derive business value from all their structured and unstructured data. Cloudera's Distribution Including Apache Hadoop (CDH), available to download for free at, is the most comprehensive, tested, stable and widely deployed distribution of Hadoop in commercial and non-commercial environments. For the fastest path to reliably using this completely open source technology in production for Big Data analytics and answering previously un-addressable big questions, organizations can subscribe to Cloudera Enterprise, comprised of Cloudera Manager software and Cloudera Support. Cloudera also offers training and certification on Apache technologies, as well as consulting services. As the top contributor to the Apache open source community and with tens of thousands of nodes under management across customers in financial services, government, telecommunications, media, web, advertising, retail, energy, bioinformatics, pharma/healthcare, university research, oil and gas and gaming, Cloudera's depth of experience and commitment to sharing expertise are unrivaled.

Connect with Cloudera
Read the blog:
Follow on Twitter:
Visit on Facebook:

Contact Information