SOURCE: Trifacta


May 18, 2017 03:01 ET

New 'Principles of Data Wrangling' Book Provides First How-To Guide for Extracting Value from Data with Data Wrangling

Newest O'Reilly Media eBook defines how data wrangling can help organizations innovate and accelerate analytics processes and gain a competitive edge

SAN FRANCISCO, CA--(Marketwired - May 18, 2017) - Trifacta, the global leader in data wrangling, today announced the availability of "Principles of Data Wrangling: Practical Techniques for Data Preparation," a new book published by O'Reilly Media and written to help business leaders, data engineers and data architects extract more value from their data. The book is available online, with hardcover copies available May 22, 2017 at Strata Data Conference London 2017.

The book was authored by Connor Carreras, Jeffrey Heer, Joe Hellerstein, Sean Kandel and Tye Rattenbury. Jeffrey Heer, Joe Hellerstein, and Sean Kandel are the co-founders of Trifacta, the first self-service-data preparation solution. This is the first book that provides a how-to guide on data wrangling to help the wide range of people responsible for managing the analysis and application of data within their organizations. The book also outlines how organizations innovate and accelerate their analytics process and gain a competitive edge through effective data wrangling.

"Data-driven organizations are easy to spot: they are full of creative people constantly finding new ways to translate data into value. That ability to aggressively leverage data requires new approaches to data and computing across the organization--from the traditional tasks of IT departments to the latest in agile data analytics," said Joe Hellerstein, co-founder and chief strategy officer at Trifacta. "Talking to data professionals, you hear over and over that Data Wrangling takes up the lion's share of the time they spend converting data into value through analytics, statistics and machine learning. This book lays out the lessons we have learned working with data-driven organizations across the globe, with a specific focus on adopting modern agile analytic processes."

The book outlines how improving your data wrangling efforts can increase the near-term and long-term value of data. The first chapters define a workflow framework that links activities focused on value, and explains how data wrangling factors into those activities and the overall workflow. It then dives into a collection of techniques analysts can use when moving through the stages of the workflow. The later stages of the book focus on roles and responsibilities in data wrangling projects and explore a variety of different data wrangling tools. Throughout the book, the authors ground the discussion in real-world example data, transformations of that data and various visual and statistical views of that data.

The book is available online now. One of the authors, Sean Kandel, CTO of Trifacta will also be presenting at the upcoming Strata Data Conference in London 2017, which will take place May 22-25, 2017 at ExCeL London. Stop by and visit the Trifacta booth #102 on the exhibition show floor. Sean will be signing hardcopies of the book at the conference.

Additional Resources:

About Trifacta
Trifacta, the global leader in data wrangling software, significantly enhances the value of an enterprise's big data by enabling users to easily transform and enrich raw, complex data into clean and structured formats for analysis. Leveraging decades of innovative work in human-computer interaction, scalable data management and machine learning, Trifacta's unique technology creates a partnership between user and machine, with each side learning from the other and becoming smarter with experience. Trifacta is backed by Accel Partners, Cathay Innovation, Greylock Partners and Ignition Partners.

Contact Information

  • Media Contacts:
    Nolan Necoechea for Trifacta
    Email Contact