Trifacta wins the Best Data-Driven SaaS Product award at the 2021 Annual Cloud & SaaS Awards

Start Free

Speed up your data preparation with Trifacta

Free Sign Up
Summer of SQL

A Q&A Series with Joe Hellerstein

See why SQL is Back
All Templates

Extract Valuable Information by Parsing PDF Files

Wrangling PDF Flow The flow view of this template

Understand data in your PDF files better by parsing, extracting, and importing data into individual datasets

Data Sources:
PDF files

This template allows you to see how you can wrangle and parse PDF files.. Right click the data source in this template to see various parsing options available. You can extract all the tables in the PDF document into a single dataset or have each table be imported in as a separate dataset. Once you understand these options, feel free to replace the example data with your own PDF files.

For more information, please read this detailed documentation guide.


New user?

Use the buttons above and start your 30-day free trial. If your data is mostly on Google Cloud Platform, please use Dataprep. Otherwise, choose Trifacta.

Learn more about Dataprep