See How Data Engineering Gets Done on Our Do-It-Yourself Data Webcast Series

Start Free

Speed up your data preparation with Trifacta

Free Sign Up
Summer of SQL

A Q&A Series with Joe Hellerstein

See why SQL is Back


Get the latest insights on data engineering

Posts about Product

What’s New in Trifacta 8.7 Release

We are back with a brand new release from Trifacta that delivers more capabilities towards better data engineering. Here are the highlights from the latest 8.7 release from Trifacta. Data Pipelines and Slack Channels We now facilitate two powerhouses to work with each other. You can create tasks from Trifacta Plans, the command center for […]

Shyam Srinivasan  |  September 16, 2021

What’s New in Trifacta 8.6 Release

The summer of 2021 is speeding away, and our journey of innovation continues here at Trifacta. It’s now time to share the highlights from the latest 8.6 release from Trifacta.  Flexible Schedule Management Collaboration with flexibility is one of the key aspects of data engineering. You can now experience seamless collaboration with the ability to […]

Shyam Srinivasan  |  August 25, 2021

Leveraging Data Analytics to Enhance Athletic Performance: Part 3

Content-Based Recommendation Engine I worked for a fitness-training startup that offered personalized recommendations to help people reach their fitness goals. The company wanted to know if applying data analytics and artificial intelligence/machine learning (AI/ML) techniques could answer some of their business questions and enhance trainees’ performance. This is the third of a 3-part blog series […]

Angel Aponte  |  August 18, 2021

Leveraging Data Analytics to Enhance Athletic Performance: Part 2

Weighted Association Rules Mining and Graph Analysis I worked for a fitness-training startup that offered personalized recommendations to help people reach their fitness goals. The company wanted to know if applying data analytics and artificial intelligence/machine learning (AI/ML) techniques could answer some of their business questions and enhance trainees’ performance. This is the second of […]

Angel Aponte  |  August 12, 2021

Leveraging Data Analytics to Enhance Athletic Performance: Part 1

Gathering Data and Identifying Key Variables If you’ve been watching the 2020 Olympics Games from Tokyo, you’ve likely come to appreciate how much effort is expended in achieving athletic excellence. Athletes are closely coached, monitored, and receive guidance on each performance. Their progress is tracked and recorded, and data plays a big part in the […]

Angel Aponte  |  August 6, 2021

Follow Trifacta on Facebook, LinkedIn and Twitter.

Enabling the Modern Data Stack with the Unified Data Warehouse / Data Lake Architecture

Data continues to be the most relevant entity for any organization. This trend continues to gain steam with recent concepts such as “data is the new oil of the digital economy” and “data is the new software.” Companies are moving towards modern data architectures, especially with the cloud being the catalyst to enable scale and […]

Shyam Srinivasan  |  August 3, 2021

BigQuery Optimization from Trifacta
Behind The Scenes

This is the second blog post in a series on how Trifacta integrates with BigQuery to achieve performance gains with data transformation using SQL queries from Trifacta recipes. In the first part of this series, we discussed how BigQuery Pushdown from Trifacta leverages the scale and efficiency of cloud data warehouses to process data quickly, […]

Shyam Srinivasan  |  
Himanshu Shekhar  |  July 29, 2021

From Trifacta Recipes to SQL Scripts with Google BigQuery

As a leader in data engineering, one of the key focus areas for Trifacta is to play a key role in the modern data stack. Cloud data warehouses are moving from traditional ETL to modern ELT architectures and Trifacta plays an important role by enabling the “T” in ELT with advanced data transformation, at the […]

Shyam Srinivasan  |  
Himanshu Shekhar  |  July 28, 2021

What’s New in Trifacta 8.5 Release

Time flies faster when you are having fun. It’s already a month since our last update on what’s new from Trifacta. Our innovation journey continues and we are happy to announce the latest 8.5 release from Trifacta. Below are the highlights. Efficient Data Management  One of the key traits of the modern cloud data warehouse […]

Shyam Srinivasan  |  July 26, 2021

Analyze YouTube Channel Data With Dataprep and BigQuery

You have launched your YouTube channel with great fanfare! You have posted your first few videos, and the likes and comments are starting to flow in – hooray! Now what? Obviously you have invested a lot of effort in launching your channel. You want to monitor how your videos are doing, and how the audience […]

Vijay Balasubramaniam  |  July 20, 2021

Be a part of our internationally growing team.

Join The Team

Faster, Easier Data Engineering with Ready-to-use Templates

I work with a lot of data (who doesn’t?) and I need to make it usable and actionable to deliver advanced insights for my applications. I wish I could look at some ready-made examples, especially for common tasks and use cases and follow along. If you are thinking like me, look no further as I […]

Shyam Srinivasan  |  June 10, 2021

Blazingly fast execution with BigQuery Pushdown for Google Cloud Dataprep

In the previous blog of this series, we discussed how Trifacta intelligently “pushes-down” the transformation logic to the source database, thereby reducing the overall data load ingested into the application, and significantly improving performance. Now let’s take it to the next level, and talk about what Trifacta can do if the source and destination for […]

Mohit Gulati  |  April 7, 2021

Trifacta Partners with Databricks to Deliver Faster ROI on Data Lakehouses

Today, we’re excited to announce that Trifacta now natively integrates with the Databricks Lakehouse Platform. As a quintessential data lakehouse, the Databricks Lakehouse Platform is streamlined, open, and capable of supporting a wide range of analytics workloads and data types. For those unfamiliar with the term, a “lakehouse” combines elements of both a data lake […]

Ash Vijayakanthan  |  April 7, 2021

Which edition of Google Cloud Dataprep by Trifacta would you use?

We recently announced a new pricing model with multiple editions to enable any organization, be it companies with mature data engineering teams accustomed to managing modern data pipelines or companies with relatively new data engineering teams with little to no experience. We will now dive deep into each edition of the pricing model to help […]

Bertrand Cariou  |  April 7, 2021

Connect your Data from Anywhere for Data Engineering

From the millions of records created each day by enterprises in business applications, to the billions of hourly visits to social media sites, to the trillions of data points collected by IoT sensors and monitoring applications, data is being collected everywhere and at an astonishing pace. It is imperative that we enable connectivity to this […]

Mohit Gulati  |  April 6, 2021

Blueprint to Implement a Survey Data Warehouse Solution on Google Cloud

Designing a new cloud data warehouse on Google Cloud can feel like choice overload. There’s a variety of analytic services to consider that will help build your data pipelines, not to mention the many reporting solutions available for selection. And even after those choices are resolved, how do you integrate all these services? Or design […]

Bertrand Cariou  |  April 2, 2021

Pushing Data Engineering Tasks to Go Faster with New Pushdown Optimization

In the modern data stack, the hard work of data transformation—the “T” in “ELT”—is pushed into powerful cloud data warehouses. But because raw data is everywhere—in files, on-premises relational databases, NoSql Databases, and SaaS applications—it’s not always possible to push the full data transformation workload into your cloud data warehouse.  You need flexibility that doesn’t […]

Mohit Gulati  |  March 23, 2021

Introducing Trifacta Photon Job Execution on All Editions

Agile data transformation requires fast iteration: Assess the current state of your data, transform it to move closer to your desired end state, repeat. Accelerating that interactive loop dramatically accelerates the overall time it takes to build data pipelines. Executing transformations is a major factor in the latency of this loop. Transforming data requires computation, […]

Sean Kandel  |  February 9, 2021