Start Free

Speed up your data preparation with Trifacta

Free Sign Up
Trifacta Ranked #1 in Data Preparation Market Study

Dresner Advisory Services study reviews and ranks 24 vendors

Get the Report
Schedule a Demo

July ‘19 Wrangler Release — Macros and Enhancements to Transform by Example and Cluster Clean

July 30, 2019

Trifacta’s July ‘19 Wrangler release includes Macros–a new way to create repeatable bulk actions in Trifacta–and enhancements to Transform by Example, which was released last month. Macros allow you to bundle multiple steps in a Trifacta recipe as a single object and to create parameters for it, which simplifies the use of repetitive or complex recipe tasks.

Macros

Macros provide a repeatable way to accomplish repetitive or common tasks in Trifacta. In the example shown below, we use three steps to create a Macro to remove outliers. Here are the steps bundled up into the Macro:

  1. Create a column of the standard deviations, 
  2. Create a column of the mean, and 
  3. Create a formula to flag outliers based on whether or not the value falls more than 3.5 standard deviations from the mean. 

In the video below, we create a macro out of these three steps, with the original column as a parameter that can be changed from recipe to recipe. Rather than create these three steps from scratch, or have to locate them in a separate recipe and copy and paste the work into the current recipe, we can instead find the macro in our library of macros directly from the transformer page to reduce the busy work. 

As needed, you can inspect a macro to see the underlying steps to verify the correct behavior. You can also parameterize more than just columns, including numbers, strings, patterns, booleans, and more, to really customize macros. If you need to modify any step in a macro to tweak it slightly, you can also convert the macro back to the original set of discreet steps and modify.

Reusing a Macro is as easy as selecting it and entering the needed parameters.

Transform by Example Enhancements

The July ‘19 Wrangler release includes enhancements to Transform by Example, which allows you to validate that each distinct pattern present in a column is resolved as intended. With the July release of Wrangler, you can now validate on each pattern present, as seen below

Cluster Clean Enhancements

The July ‘19 release also comes with enhancements to Cluster Clean, allowing for auto standardization. Auto standardization will standardize values in which an algorithm can determine a clear primary value in a cluster. 

For the full July ‘19 release notes as well as past months notes, click here

Related Posts

Bringing a Whole New World of Connectivity to Wrangler Pro & Enterprise

Over the next few weeks, we’ll be highlighting some of the latest features we’ve added to our Wrangler... more

  |  November 13, 2017

User Empowerment in Trifacta v3

Imagine: what if non-technical analysts could more easily transform data in a way that doesn’t feel like... more

  |  September 23, 2015

December ’19 Wrangler Release – Rapid Target Fuzzy Match, UI Improvements, Downloadable Profiles

The December ‘19 Wrangler release brings several new and exciting features to Trifacta’s free product.... more

  |  December 18, 2019