Start Free

Speed up your data preparation with Trifacta

Free Sign Up
Summer of SQL

A Q&A Series with Joe Hellerstein

See why SQL is Back
 

Blog

Get the latest insights on data engineering

Language
  • English
  • French
  • German
  • English
  • What’s New in Trifacta 8.7 Release

    We are back with a brand new release from Trifacta that delivers more capabilities towards better data engineering. Here are the highlights from the latest 8.7 release from Trifacta. Data Pipelines and Slack Channels We now facilitate two powerhouses to work with each other. You can create tasks from Trifacta Plans, the command center for […]

    Shyam Srinivasan  |  September 16, 2021

    3 Reasons to Attend Big Data London

    There’s a tradeshow on my calendar next week. And there’s not a Zoom link in sight. Virtual conferences have made a huge impact in the past year, and I’m glad to see them continue, but right now, I’m thrilled to see the return of an in-person event: Big Data London. In the words of the […]

    Andy Bromley  |  September 15, 2021

    Introducing the Trifacta Python SDK

    Background In recent years, Python has become one of the most popular object-oriented programming languages. Whether you are a beginner or an experienced programmer, Python’s simple, easy-to-learn syntax enables quick readability and integration with heterogeneous systems. This simple method of programming makes Python very attractive for scripting as well as connecting different components of software […]

    Shyam Srinivasan  |  September 14, 2021

    View from the Summit: Data vs. Delta (and Other Infectious Diseases)

    Here in the United States, it looks like the Delta variant of COVID-19 may drive us back into our pandemic pods instead of back to school. But I have reasons to be optimistic. Fighting infectious diseases is, of course, a health problem, but the key to enabling every stakeholder – from the scientists to the […]

    Mark Sarbiewski  |  September 8, 2021

    Back to SQL: Data Engineering

    As part of growing our massive new Data Science program at Berkeley, it became clear that we needed to target a class specifically for Data Engineering. The goals of Data Engineering are different than Software Engineering. So it was interesting to think through this curriculum and how we would teach it differently than our established database classes.

    In this new approach, we ended up emphasizing four steps to SQL for Data Engineering that are atypical of a traditional databases class: data quality, data reshaping, “spreadsheet tasks,” and data pipeline testing.

    Joe Hellerstein  |  September 7, 2021

    Follow Trifacta on Facebook, LinkedIn and Twitter.


    Good Stuff You Can Learn From Bad Data

    Let’s say you’re remodeling your kitchen. You want to replace the old linoleum with beautiful new hardwood floors. But as you rip up the old flooring, you realize the subfloor next to your kitchen sink is rotted through.   What do you do? At a minimum, you need to patch the hole. You may need to […]

    Shyam Srinivasan  |  September 2, 2021

    Transformation: Next Level SQL

    When we use SQL for Transformation—the “T” in ELT—the focus changes. In this case, we’re taking many messy and disparate tables and manipulating them into a more usable or common form. To take our example from before, we may be extracting and loading sales data from 17 electronics chains that sold the phones, and our job in SQL is to write transformation queries that integrate that data together.

    Joe Hellerstein  |  August 30, 2021

    What’s New in Trifacta 8.6 Release

    The summer of 2021 is speeding away, and our journey of innovation continues here at Trifacta. It’s now time to share the highlights from the latest 8.6 release from Trifacta.  Flexible Schedule Management Collaboration with flexibility is one of the key aspects of data engineering. You can now experience seamless collaboration with the ability to […]

    Shyam Srinivasan  |  August 25, 2021

    SQL Pipelines and ELT

    ELT is increasingly attractive these days. Modern data warehouses are flexible and increasingly cost-effective, allowing us to store large volumes of data—even messy data that includes volumes of text and images. In this environment, transformations occur in the data warehouse, where the native language is SQL. 

    Joe Hellerstein  |  August 23, 2021

    Leveraging Data Analytics to Enhance Athletic Performance: Part 3

    Content-Based Recommendation Engine I worked for a fitness-training startup that offered personalized recommendations to help people reach their fitness goals. The company wanted to know if applying data analytics and artificial intelligence/machine learning (AI/ML) techniques could answer some of their business questions and enhance trainees’ performance. This is the third of a 3-part blog series […]

    Angel Aponte  |  August 18, 2021

    Be a part of our internationally growing team.


    Join The Team

    Summer of SQL: Why It’s Back

    For the first decades of the Millenium, it seemed like the Java-centric approach was the "hot new thing," but SQL has been roaring back. Today, SQL seems to be the focus of every data engineering conversation and popping back up on billboards in Silicon Valley. 

    The comparison of the two "shops" inevitably leads to the question: which is better? There are pros and cons to emphasizing one or the other. 

    Joe Hellerstein  |  August 16, 2021

    Leveraging Data Analytics to Enhance Athletic Performance: Part 2

    Weighted Association Rules Mining and Graph Analysis I worked for a fitness-training startup that offered personalized recommendations to help people reach their fitness goals. The company wanted to know if applying data analytics and artificial intelligence/machine learning (AI/ML) techniques could answer some of their business questions and enhance trainees’ performance. This is the second of […]

    Angel Aponte  |  August 12, 2021

    Trifacta at HIMSS21
    Building a Healthier World with Clean Data

    If we were to pause and reflect on the past 20+ months, one major takeaway would be ‘Health Is Wealth’. As we strive to get back to a normal world, healthcare innovation must expand and accelerate on all fronts for a better tomorrow. At Trifacta, we are privileged to work with healthcare organizations around the […]

    Mark Sarbiewski  |  August 9, 2021

    Leveraging Data Analytics to Enhance Athletic Performance: Part 1

    Gathering Data and Identifying Key Variables If you’ve been watching the 2020 Olympics Games from Tokyo, you’ve likely come to appreciate how much effort is expended in achieving athletic excellence. Athletes are closely coached, monitored, and receive guidance on each performance. Their progress is tracked and recorded, and data plays a big part in the […]

    Angel Aponte  |  August 6, 2021

    Enabling the Modern Data Stack with the Unified Data Warehouse / Data Lake Architecture

    Data continues to be the most relevant entity for any organization. This trend continues to gain steam with recent concepts such as “data is the new oil of the digital economy” and “data is the new software.” Companies are moving towards modern data architectures, especially with the cloud being the catalyst to enable scale and […]

    Shyam Srinivasan  |  August 3, 2021

    BigQuery Optimization from Trifacta
    Behind The Scenes

    This is the second blog post in a series on how Trifacta integrates with BigQuery to achieve performance gains with data transformation using SQL queries from Trifacta recipes. In the first part of this series, we discussed how BigQuery Pushdown from Trifacta leverages the scale and efficiency of cloud data warehouses to process data quickly, […]

    Shyam Srinivasan  |  
     
    Himanshu Shekhar  |  July 29, 2021

    From Trifacta Recipes to SQL Scripts with Google BigQuery

    As a leader in data engineering, one of the key focus areas for Trifacta is to play a key role in the modern data stack. Cloud data warehouses are moving from traditional ETL to modern ELT architectures and Trifacta plays an important role by enabling the “T” in ELT with advanced data transformation, at the […]

    Shyam Srinivasan  |  
     
    Himanshu Shekhar  |  July 28, 2021

    What’s New in Trifacta 8.5 Release

    Time flies faster when you are having fun. It’s already a month since our last update on what’s new from Trifacta. Our innovation journey continues and we are happy to announce the latest 8.5 release from Trifacta. Below are the highlights. Efficient Data Management  One of the key traits of the modern cloud data warehouse […]

    Shyam Srinivasan  |  July 26, 2021