SOURCE: Melissa Data

Since 1985, Melissa Data specializes in full spectrum global data quality tools and services.

April 25, 2016 08:00 ET

Melissa Data and Pentaho Make Hadoop Data Quality Achievable

Close Integration Streamlines Data Quality Operations in the Data Center; Melissa Data Webinar Demonstrates Scalability to Billions of Records Across the Hadoop Cluster

RANCHO SANTA MARGARITA, CA--(Marketwired - April 25, 2016) - Melissa Data, a leading provider of global data quality solutions, today introduced flexible, scalable data quality to the Hadoop framework for storing and processing Big Data in a distributed environment. Fueled by partnership with Pentaho, a Hitachi Group Company, and close integration with Pentaho's Big Data Integration and Analytics platform, Melissa Data's global data quality tools and services can be scaled across the Hadoop cluster to cleanse and verify billions of data center records. This creates a significant advantage for enterprise IT and data managers, now better equipped to leverage the distributed computing power of Hadoop to handle rapidly expanding data volumes feeding master data management systems.

Pentaho Data Integration offers intuitive drag-and-drop data integration coupled with data agnostic connectivity, and is designed to deliver accurate, analytics-ready data to business users from any source. Coupled with Melissa Data's integrated data quality tools available via API or local web service, users are able to eliminate the complex and time-consuming coding and programming requirements traditionally required to achieve Hadoop data quality. Processes can be automated through the chosen Melissa Data component for enhancing, verifying, correcting, standardizing or deduplicating customer records -- options include full spectrum data quality that supports the entire Big Data lifecycle. These operations can also leverage Hadoop data processing frameworks, further maximizing the investment in Hadoop infrastructures. 

"Consistently excellent data quality is essential to protect and maximize the long-term value of analytics, yet cleansing the vast number of records on a Hadoop cluster is not an inherently simple task," said Bud Walker, vice president enterprise sales and strategy, Melissa Data. "By pairing data quality with Pentaho Data Integration, users can quickly automate sophisticated data quality initiatives -- capitalizing on the potentially massive scope of a Hadoop system to optimize business intelligence and reporting."

Melissa Data will host a free webinar demonstrating quick and easy integration and analysis of large data sets, leveraging Pentaho Business Analytics for Hadoop deployments. Attendees will learn orchestration and automation techniques that build on Hadoop capabilities to transform data into clean, reliable assets. Click here to register for this live online event on Tuesday, May 3, 2016 at 1:00 p.m. Eastern. For more information, visit or call 1-800-MELISSA (635-4772).

About Melissa Data

Since 1985, Melissa Data has specialized in contact data quality and address management tools with a global perspective. The company's solutions help organizations capture and maintain international customer contact data at the point of entry, ensuring accurate customer information enterprise-wide. More than 10,000 clients worldwide in arenas such as retail, education, healthcare, insurance, finance, and government, rely on Melissa Data for full spectrum data validation software and services. For more information or free product trials, visit or call 1-800-MELISSA (635-4772). Follow Melissa Data on Twitter, Facebook, LinkedIn and YouTube.

About Pentaho, a Hitachi Group Company

Pentaho, a Hitachi Group company, is a leading data integration and business analytics company with an enterprise-class, open source-based platform for diverse big data deployments. Pentaho's unified data integration and analytics platform is comprehensive, completely embeddable and delivers governed data to power any analytics in any environment. Pentaho's mission is to help organizations across multiple industries harness the value from all their data, including big data and IoT, enabling them to find new revenue streams, operate more efficiently, deliver outstanding service and minimize risk. Pentaho has over 15,000 product deployments and 1,500 commercial customers today including ABN-AMRO Clearing, BT, EMC, Landmark Halliburton, Moody's, NASDAQ and Staples. For more information visit

Contact Information