SOURCE: LexisNexis HPCC Systems

LexisNexis HPCC Systems

February 23, 2012 12:01 ET

HPCC Systems From LexisNexis to Participate at O'Reilly Strata to Address the Complete Big Data Value Chain and What Enterprise Organizations Need to Know to Manage and Glean Insight From Big Data

The Complete Big Data Value Chain Includes Tasks That Cover Big Data Discovery, Integration and Analytics

ATLANTA, GA--(Marketwire - Feb 23, 2012) - HPCC Systems™ from LexisNexis® Risk Solutions will participate at the O'Reilly Strata conference to address the Complete Big Data Value Chain, which is a series of steps needed for enterprise organizations to effectively manage and glean pertinent insight from Big Data. The Complete Big Data Value Chain includes data collection, ingestion, discovery, cleansing, integration, analysis and delivery. Representatives from HPCC Systems will offer insight to understanding the Complete Big Data Value Chain and what enterprises should know at each point in the chain at the 2012 O'Reilly Strata Conference, February 28 to March 1, at the Santa Clara Convention Center in Santa Clara, Calif.

HPCC Systems is an open source, enterprise-proven Big Data analytics platform. HPCC Systems grew out of the need for LexisNexis Risk Solutions to manage, sort, link, join and analyze billions of records within seconds. Designed by data scientists, HPCC Systems is a data intensive supercomputer that has evolved for more than a decade, with enterprise customers who need to process large volumes of data in critical 24/7 environments.

"For enterprises with a Big Data challenge, it's important to consider having an integrated solution throughout the chain, instead of a lot of little point solutions that don't integrate well," said Armando Escalante, Chief Technology Officer of LexisNexis Risk Solutions and head of HPCC Systems. "When you have an integrated and holistic solution, the end result is significant development and operational savings. You are not wasting time on integrating a plethora of unrelated tools. Having an integrated solution frees up organizations to extract insights from Big Data on customer purchasing decisions, or to discover the next big fraud trend, or to find faster ways to bring products to market."

The Complete Big Data Value Chain includes:

  • Collection -- collecting structured, unstructured and semi-structured data
  • Ingestion -- consuming vast amounts of data efficiently
  • Data Discovery and Data Cleansing -- clean up, formatting and statistical analysis of the data
  • Integration -- linking, indexing and data fusion
  • Analysis -- extraction of information, non-obvious relationships and machine learning
  • Delivery -- including querying, visualization, and redundancy, enterprise-class availability

"Enterprises can sometimes get caught up in data storage. There's no point in only storing data -- that's where the value is, in the data. You need that data available for analysis and decision-making. It's like having money stuck in your attic. You need to put the money into circulation to make more money," said Mr. Escalante. "We've been doing the Complete Big Data Value Chain for 12 years now and we look forward to sharing our tips with colleagues at Strata."

Big Data analytics experts from HPCC Systems will speak on the following sessions at Strata:

Tuesday, February 28: TUTORIAL: Entity Extraction in Unstructured Text.
A hands-on use of an emerging data-centric programming language. Tutorial will focus on entity extraction in both semi-structured and free-form text data. Click for more information:

Wednesday, February 29, KEYNOTE: Machine Learning and Big Data: Sustainable Value or Hype?
Will machine learning take over the world this time? What exactly does it mean to be fully parallel? Do I care? Will I be any better if I get it right? Click for more information:

Wednesday, February 29, SESSION: Data Ingest, Linking, and Data Integration via Automatic Code Generation
How to simplify the data integration process and save a significant amount of development time by automatically generating code for processes (data profiling, data cleansing, and record linkage). A case study will show a complex, Big Data linking application on insurance data. Click for more information:

Thursday, March 1, SESSION: Solving Big Data Analytics with an Emerging Data Centric Language
Learn a more simplified way to solve Big Data with an emerging data centric language. A demonstration will be provided on various queries. Click for more information:

Attendees can also visit the HPCC Systems booth (#706) to speak with experts about the Complete Big Data Value Chain.

About HPCC Systems™
HPCC Systems™ from LexisNexis® Risk Solutions offers a proven, data-intensive supercomputing platform designed for the enterprise to process and solve Big Data analytical problems. As a superior alternative to Hadoop and legacy technology, HPCC Systems offers a consistent data-centric programming language, two processing platforms and a single, complete end-to-end architecture for efficient processing. For more information, visit HPCC Systems at

Connect with us on Twitter (@hpccsystems), Facebook ( and LinkedIn (

About LexisNexis Risk Solutions
LexisNexis® Risk Solutions ( is a leader in providing essential information that helps customers across all industries and government predict, assess and manage risk. Combining cutting-edge technology, unique data and advanced scoring analytics, we provide products and services that address evolving client needs in the risk sector while upholding the highest standards of security and privacy. LexisNexis Risk Solutions is part of Reed Elsevier, a leading publisher and information provider that serves customers in more than 100 countries with more than 30,000 employees worldwide.

Contact Information

  • Media Contact
    Kristina Grammatico
    Email Contact