See How Data Engineering Gets Done on Our Do-It-Yourself Data Webcast Series

Start Free

Speed up your data preparation with Trifacta

Free Sign Up
Summer of SQL

A Q&A Series with Joe Hellerstein

See why SQL is Back
 
All Blog Posts

December ’19 Wrangler Release – Rapid Target Fuzzy Match, UI Improvements, Downloadable Profiles

December 18, 2019

The December ‘19 Wrangler release brings several new and exciting features to Trifacta’s free product. These features include:

  • Improved Fuzzy Matching on Rapid Target
  • Improved Date Time Interface
  • Downloadable Profiles
  • New Statistical Functions

Let’s take a look at each of these new features. 

Improved Fuzzy Matching on Rapid Target

Rapid Target allows you to insert a table or dataset as a target schema for your work. Say for instance you have a dashboard that has a specific order of columns, set of column names, and number of columns that you have to match in order to make use of that dashboard with different or new data. Rapid Target allows you to insert a dataset used for that dashboard, and helps guide you in transforming your data to that schema. With the December ‘19 release of Trifacta, we have added new fuzzy matching algorithms that improves this functionality to auto-align columns to the target columns they need to match.

Improved Date/Time Interface

We have streamlined various menus in Trifacta over the last year, most notably is the date/time data type menu. The old interface has been replaced by a navigable, searchable menu. This new menu makes it far easier to find the exact date/time format you are looking for. 

Downloadable Profiles

Lastly, we now have downloadable profiles from the job results. This has been a much requested feature from our community. Now, after completing a job, you can download a .json of the profile results, and use that for tracking purposes.

You can even wrangle these in Trifacta 🙂

New Statistical Functions

There are several new statistical functions in the December release of Wrangler, including:

 

Function NameDescription
CORREL FunctionComputes the correlation coefficient between two columns. Source values can be of Integer or Decimal type.
COVAR FunctionComputes the covariance between two columns using the population method. Source values can be of Integer or Decimal type.
COVARSAMP FunctionComputes the covariance between two columns using the sample method. Source values can be of Integer or Decimal type.
PERCENTILE FunctionComputes a specified percentile across all row values in a column or group. Input column can be of Integer or Decimal.
QUARTILE FunctionComputes a specified quartile across all row values in a column or group. Input column can be of Integer or Decimal.

 

These functions can be used as single and multi-row formulas, group bys, pivots, window functions and more. 

That’s it for the December ‘19 release of Trifacta Wrangler. If you are interested in trying these new features for yourself, sign up for free today!