Start Free

Speed up your data preparation with Trifacta

Free Sign Up
Wrangle Summit 2021 On Demand

You can still experience the best people, ideas and technology in data engineering, all in one place

Get All-Access Pass
All Templates

Filter Required Data Based on Values from Another Dataset

Conditional filtering of data Flow The flow view of this template

array functions (arrayintersect, arraylen), extractlist, list, join, filtering (keep)

This template shows how you can filter your data based on values found in another reference dataset. It makes use of array functions such as arrayintersect after extracting all the values to look for in the reference dataset into an array via the list function.

To customize this template for your own use case, supply your own reference dataset in place of categories.txt and modify the Find values and filter recipe to accommodate the number of columns in your source data for the filtering.

New to Trifacta?

Sign up below to our free 30-day trial to use this template.


Already have an account?

Download template (Trifacta version) and import it on the Flows page.

Is your data on Google Cloud?

  1. Download template (Dataprep version)
  2. Launch Dataprep on Google Cloud
  3. Import it on Flows page

Learn more about Dataprep

How to Import