See How Data Engineering Gets Done on Our Do-It-Yourself Data Webcast Series

Start Free

Speed up your data preparation with Trifacta

Free Sign Up
Summer of SQL

A Q&A Series with Joe Hellerstein

See why SQL is Back


Get the latest insights on data engineering

  • English
  • French
  • German
  • English
  • Introducing Trifacta’s integration with dbt Core on Google BigQuery

    One of Trifacta's most popular capabilities is the generation of rich data profile reports from datasets that help with data validation. These reports enable our users to troubleshoot mismatches or other issues with the datasets, validate and correct them, to ensure high-quality data is always delivered through their data pipelines.

    Nate Vaziri  |  October 26, 2021

    What’s New in Trifacta 8.8 Release

    Greetings! We’re back with a new release from Trifacta. Here are the highlights from our latest 8.8 release.  Pushdown Optimization Now Supports Additional Transformations With the 8.8 release, we now support more than 80 data types, functions, and transformations for Pushdown Optimization. This is part of our continued efforts to enable the modern data stack […]

    Shyam Srinivasan  |  October 21, 2021

    Google Next ‘21 Top 3 Data Cloud Innovations

    Google Next ‘21 brought big changes and ideas to the data analytics world. Google Cloud’s annual conference always promises to share the latest and greatest about the Google Cloud Platform, and this year the conference delivered with BigQuery Omni GA, Dataplex GA and Trifacta’s Google Cloud Dataprep success.  For the second year in a row, […]

    Bertrand Cariou  |  October 18, 2021

    Making the Most of Your BigQuery Investments for Scalable Data Engineering Pipeline

    When we released BigQuery Pushdown for Dataprep on Google Cloud back in April, we knew that it was a highly anticipated ELT (Extract Load & Transform) feature that would help both design time and processing time. However, we did not expect it to be adopted so quickly. Our internal benchmark of 20x job acceleration was […]

    Bertrand Cariou  |  October 11, 2021

    Data Engineering, Science, Analytics? Data Jobs Explained

    Every week, there’s a new hot jobs list. Data engineering, data science, or data analytics roles tend to appear near the top. Amid the “Great Resignation,” people across industries and career paths are considering what’s next: a new company, a new career, back to school to learn new skills, or maybe a different option. Constant […]

    Mark Sarbiewski  |  October 8, 2021

    Follow Trifacta on Facebook, LinkedIn and Twitter.

    Trifacta Monthly Roundup: September 2021

    Hello and welcome to our first edition of the monthly roundup of activities at Trifacta. The beautiful fall season is upon us and what a wonderful month it has been. It has truly been a September to Remember at Trifacta meeting customers at events, winning industry awards, launching new capabilities, and publishing new data engineering […]

    Shyam Srinivasan  |  October 1, 2021

    Hiring in High Tech: Navigating the Workforce Shift

    The workforce shift is undeniable: from career changes to return to office, from remote work to global, flexible teams, the trends we see today will permanently alter worker expectations. The tech industry has long led the way for hybrid approaches to work. All eyes are on these companies of all sizes as the “return to […]

    Katie Murphy  |  September 30, 2021

    View from the Summit: It Takes a Village to Raise a Dataset at Eli Lilly

    As the saying goes, it takes a village—of family members, teachers, neighbors, and a greater community—to raise a child into a healthy, productive member of society. Similarly, it takes a village to raise a productive dataset, according to Randy Santiano, associate technical consultant at global healthcare leader Eli Lilly. Randy presented his work at Wrangle […]

    Mark Sarbiewski  |  September 24, 2021

    What’s New in Trifacta 8.7 Release

    We are back with a brand new release from Trifacta that delivers more capabilities towards better data engineering. Here are the highlights from the latest 8.7 release from Trifacta. Data Pipelines and Slack Channels We now facilitate two powerhouses to work with each other. You can create tasks from Trifacta Plans, the command center for […]

    Shyam Srinivasan  |  September 16, 2021

    3 Reasons to Attend Big Data London

    There’s a tradeshow on my calendar next week. And there’s not a Zoom link in sight. Virtual conferences have made a huge impact in the past year, and I’m glad to see them continue, but right now, I’m thrilled to see the return of an in-person event: Big Data London. In the words of the […]

    Andy Bromley  |  September 15, 2021

    Be a part of our internationally growing team.

    Join The Team

    Introducing the Trifacta Python SDK

    Background In recent years, Python has become one of the most popular object-oriented programming languages. Whether you are a beginner or an experienced programmer, Python’s simple, easy-to-learn syntax enables quick readability and integration with heterogeneous systems. This simple method of programming makes Python very attractive for scripting as well as connecting different components of software […]

    Shyam Srinivasan  |  September 14, 2021

    View from the Summit: Data vs. Delta (and Other Infectious Diseases)

    Here in the United States, it looks like the Delta variant of COVID-19 may drive us back into our pandemic pods instead of back to school. But I have reasons to be optimistic. Fighting infectious diseases is, of course, a health problem, but the key to enabling every stakeholder – from the scientists to the […]

    Mark Sarbiewski  |  September 8, 2021

    Back to SQL: Data Engineering

    As part of growing our massive new Data Science program at Berkeley, it became clear that we needed to target a class specifically for Data Engineering. The goals of Data Engineering are different than Software Engineering. So it was interesting to think through this curriculum and how we would teach it differently than our established database classes.

    In this new approach, we ended up emphasizing four steps to SQL for Data Engineering that are atypical of a traditional databases class: data quality, data reshaping, “spreadsheet tasks,” and data pipeline testing.

    Joe Hellerstein  |  September 7, 2021

    Good Stuff You Can Learn From Bad Data

    Let’s say you’re remodeling your kitchen. You want to replace the old linoleum with beautiful new hardwood floors. But as you rip up the old flooring, you realize the subfloor next to your kitchen sink is rotted through.   What do you do? At a minimum, you need to patch the hole. You may need to […]

    Shyam Srinivasan  |  September 2, 2021

    Transformation: Next Level SQL

    When we use SQL for Transformation—the “T” in ELT—the focus changes. In this case, we’re taking many messy and disparate tables and manipulating them into a more usable or common form. To take our example from before, we may be extracting and loading sales data from 17 electronics chains that sold the phones, and our job in SQL is to write transformation queries that integrate that data together.

    Joe Hellerstein  |  August 30, 2021

    What’s New in Trifacta 8.6 Release

    The summer of 2021 is speeding away, and our journey of innovation continues here at Trifacta. It’s now time to share the highlights from the latest 8.6 release from Trifacta.  Flexible Schedule Management Collaboration with flexibility is one of the key aspects of data engineering. You can now experience seamless collaboration with the ability to […]

    Shyam Srinivasan  |  August 25, 2021

    SQL Pipelines and ELT

    ELT is increasingly attractive these days. Modern data warehouses are flexible and increasingly cost-effective, allowing us to store large volumes of data—even messy data that includes volumes of text and images. In this environment, transformations occur in the data warehouse, where the native language is SQL. 

    Joe Hellerstein  |  August 23, 2021

    Leveraging Data Analytics to Enhance Athletic Performance: Part 3

    Content-Based Recommendation Engine I worked for a fitness-training startup that offered personalized recommendations to help people reach their fitness goals. The company wanted to know if applying data analytics and artificial intelligence/machine learning (AI/ML) techniques could answer some of their business questions and enhance trainees’ performance. This is the third of a 3-part blog series […]

    Angel Aponte  |  August 18, 2021