Trifacta wins the Best Data-Driven SaaS Product award at the 2021 Annual Cloud & SaaS Awards

Start Free

Speed up your data preparation with Trifacta

Free Sign Up
Summer of SQL

A Q&A Series with Joe Hellerstein

See why SQL is Back
All Templates

Filter Required Data Based on Values from Another Dataset

Conditional filtering of data Flow The flow view of this template

array functions (arrayintersect, arraylen), extractlist, list, join, filtering (keep)

This template shows how you can filter your data based on values found in another reference dataset. It makes use of array functions such as arrayintersect after extracting all the values to look for in the reference dataset into an array via the list function.

To customize this template for your own use case, supply your own reference dataset in place of categories.txt and modify the Find values and filter recipe to accommodate the number of columns in your source data for the filtering.


New user?

Use the buttons above and start your 30-day free trial. If your data is mostly on Google Cloud Platform, please use Dataprep. Otherwise, choose Trifacta.

Learn more about Dataprep