The Different Approaches to “T” in ELT and What’s Required to Drive Mass Adoption
Much has been written about the shift from ETL to ELT and how ELT enables superior speed and agility for modern analytics. One important move to support this speed and agility is creating a workflow that enables data transformation to be exploratory and iterative. Defining an analysis requires an iterative loop of forming and testing […]
3 Data-led Companies, 3 Data Warehouses: Why They All Chose Trifacta for Data Preparation
It’s safe to say that 2020 showed us that data warehouses have not only found a new home in the cloud but have cemented their position as the foundation of every organization’s data strategy moving forward. By 2022, Gartner predicts the overwhelming majority of all databases (75%) will be deployed or migrated to a cloud […]
Wrangle Summit Sneak Peek – The First Industry Event Focused on Data Engineering
Exponential growth, coupled with a whirlwind of change – this is how I would describe the past five years of the data and analytics industry. At the platform level, it seems like only yesterday Big Data was at its peak and we were watching many of the major platform providers go public. Now, it’s undeniable […]
Trifacta Legend December 2020: Alex Hardman
Alex is an avid Trifacta user and is championing Trifacta at Rezco Asset Management which was established in 1981 in South Africa with a deliberate focus on preserving capital and creating wealth.
Setting Up Data Quality Monitoring For Cloud Dataprep Pipelines
Build a simple, flexible, yet comprehensive Data Quality monitoring solution for your Google Cloud Dataprep by Trifacta pipelines with Cloud Functions, BigQuery and Data Studio Building a Data Quality Dashboard Building a modern data stack to manage analytic pipelines—such as Google Cloud and a BigQuery data warehouse or data lake—has many benefits. One such benefit […]
Monitoring Data Quality Trends with Cloud Dataprep and Data Studio
Automatic data quality assessment is a Trifacta user favorite. Who wouldn’t want to give their eyes a rest from combing through data while Trifacta automatically points out possible data flaws? The feature is particularly useful when onboarding or integrating unfamiliar data. With unfamiliar data, it’s not only difficult to tell what errors might be lurking […]
What Is a Data Stack and How Does It Impact Analytics?
We hear a lot about organizations undergoing “data modernization” in order to become more data-driven. Essentially what that means is that these organizations have recognized that legacy data tools aren’t very good at solving modern data problems. They’re in the process of moving data out of legacy mainframe databases and, at the same time, replacing […]
Google Sheets: Data Validation Tips & Tricks
Google Sheets is one of the most widely-used spreadsheet tools. Still, many of its best features go undiscovered. Let’s take a closer look at how to do data validation in Google Sheets, which is commonly used to build drop-down lists. Why data validation matters Data validation is like the analytic version of copyediting. As much […]
Orchestrate Your Data Pipelines on Trifacta Using Plans
Why create a plan? The short answer is to operationalize and automate your data pipelines on Trifacta.
Easily Publish to Data Warehouses with New Rename Functions in Trifacta
Chances are you’re having to work with several different databases and data warehouses in your analytics stack. It just is what it is today. In order to get an accurate picture in your reporting you have to use everything. However, working with these different database can be like, well this: When publishing tables in different […]
Be a part of our internationally growing team.
Join The Team
How to Automatically Deploy a Google Cloud Dataprep Pipeline Between Workspaces
This article explains how to use Cloud Composer to automate Cloud Dataprep flow migration between two workspaces. This process can be leveraged for your Cloud Data Warehouse project to move from development, test, and production following what is known as Continuous Integration and Continuous Delivery (CI/CD) pipeline in agile development. At a high level, this […]
November Legend: Angel Aponte
Through advanced machine learning (ML) techniques and the use of key technologies like Trifacta, Ángel has built a COVID-19 Diagnosis Tool that helps healthcare professionals better respond to COVID-19 cases.
Why ESADE MIBA Students Learn to Use Trifacta and AWS
The MIBA programme at ESADE focuses on how data-driven technologies are reshaping the world. We approach this transformation from three different perspectives: Business, Data Science, and Engineering. In the real world, the engineering part related to data sciences is progressing quickly thanks to the democratization tools and platforms that allow companies to streamline and industrialize […]
Data Preparation Best Practices for Snowflake Data Warehouses
Snowflake is a platform known for their separation of storage and compute, which makes scaling data more efficient. However, to get the most value from your investment in Snowflake’s Cloud Data Warehouse, your organization must break through the biggest bottleneck to analytics and AI: data preparation. Here are five data preparation best practices your organization […]
Advanced Analytics vs. Business Intelligence: What’s The Difference?
Advanced analytics and business intelligence (BI) have more or less the same objective—use data to drive insights that inform business strategy. So what’s the difference? What is business intelligence? Business intelligence is an umbrella term for software and services that provide comprehensive yet straightforward insights about an organization’s current state. Think routine reporting or dashboarding, […]
How to Change Date Format in Excel
When you enter a date into Microsoft Excel, the program will format it according to the default date settings. For example, if you want to enter the date February 6, 2020, the date could appear as 6-Feb, February 6, 2020, 6 February, or 02/06/2020, all depending on your settings. You may find that if you […]
Publishing Data to Snowflake Using Trifacta Data Quality Rules
When publishing data to cloud data warehouse Snowflake for analytic use, data quality is of the utmost importance. Improperly curated data threatens the validity of the end analysis. Data Quality Rules in Trifacta accelerates the process of ensuring data quality by automatically generating a list of data quality rules for users to select from and […]
How to Use Trifacta and Snowflake to Prepare Data for Home Price & Rental Analysis
If you are using Snowflake as your cloud analytics platform, Trifacta can help accelerate the process of data preparation and cleaning. In this demo, we will demonstrate how to use Trifacta to accelerate the process of preparing data before publishing the results to cloud data warehouse Snowflake. Specifically, we will showcase finding the price-to-rent ratio […]