Data School

Learn More

2019: a Year of Innovation with Cloud Dataprep on the Google Cloud

January 18, 2020

As we get into the new year, it’s a good time to look back in 2019 and reflect on a productive year of innovation bringing exciting new features to Cloud Dataprep, our solution jointly developed with Google. We’ve improved many different parts of the product, from adding new transformations and functions, to improving various user experience elements. Let’s take a look at the top features of 2019! 

1. Macros: Macros provide customizable, reusable sets of steps to build a shared library of recipes that you can leverage to more consistently and accurately prepare data to solve your analytics challenges.

2. Transform by Example: Transform by Example allows you to provide examples of how you’d like your data to be and Cloud Dataprep will figure out the steps needed to get there. This is magic!

3. Smart Cleaning: Smart Cleaning brings a new approach to quickly and intuitively resolve data quality issues such as standardization of values and patterns.

4. Active Profiling: blending visual guidance, user interaction, and machine intelligence into an intuitive experience is what Active Profiling offers. It enhances the ability to assess data quality issues to better clean and transform data.

5. Feature Engineering Transformations: feature engineering for AI/ML initiatives needs specific transformations such as One Hot Encoding, Binning, and Scaling.

6. Join Enhancements: simplifies the join experience by providing more visibility on the data throughout the join process and keep you in the transformer window.

7. Recipe Interaction Improvements: perform recipe operations on multiple steps such as multi-step disabling, moving, copying, and duplicating. 

Tip: You can also copy and paste multiple steps from one recipe to another, which enables the reuse of recipe parts in multiple flows.

Have you leveraged these new features yet to wrangle your data? Access Cloud Dataprep now and try them out!   

Happy Wrangling!

Related Posts

A New Approach to Data Quality for the Era of Cloud & AI

Data quality has been going through a renaissance recently. As a growing number of organizations ramp up... more

  |  March 19, 2019

From Exploration to Production – Unboxing the Spring ’17 Release of Wrangler Enterprise

With the latest release of Wrangler Enterprise, our team drew upon our experience working with large-scale... more

  |  May 24, 2017

Tutorial: Trifacta String Manipulation

Guest Contributor: Curtis Seare cohosts the Data Crunch podcast, edits the AI & ML Biweekly Beat... more

  |  May 16, 2018