Data School

Learn More


Trifacta for Google Cloud

Trifacta Powers Google Cloud’s Native Dataprep Service

Google Cloud Dataprep by Trifacta is a native Google Cloud service jointly developed and supported by the two companies. Cloud Dataprep combines Trifacta’s award-winning, interactive data wrangling experience with the elastic scale of Google Cloud storage and processing. Cloud Dataprep by Trifacta enables data engineers and analysts to prepare diverse data & configure data pipelines to feed downstream analytics and machine learning initiatives on Google Cloud. Cloud Dataprep is available in the GCP console and adheres to the same Google consumption, invoicing and security principles of Google Cloud.

Learn about Cloud Dataprep Capabilities, Plans and Pricing
Cloud Dataprep Data Privacy and Security Frequently Ask Questions

Augment Marketing Analytics
Advance Cloud Data Warehouse Adoption

Enhance the quality and value of data available in BigQuery with Cloud Dataprep’s automated data cleaning and transformation capabilities.

Refine Your Cloud Data Lake

Empower analysts, data scientists and engineers to access and refine raw data in Google Cloud Storage in a centrally governed platform.

Supported Services

  • Google Cloud Storage
  • Cloud Dataflow
  • Google BigQuery
  • BigQuery ML
  • Google Data Studio
  • Looker
  • Google AutoML
  • Cloud ML Engine

With over a thousand stores and hundreds of thousands of employees, Woolworths Australia requires careful planning and optimization of our facilities to maximize returns. Every step to produce useful data insights, from data collection to advanced analytics, influence significantly the company’s strategy. With Cloud Dataprep new capabilities, such as orchestration, the addition of new connectors, and enterprise operationalization, we’ll be able to deploy Cloud Dataprep more broadly and guarantee repeatable and trustworthy data outcomes to inform our business.

The biggest bottleneck we see in organizations transitioning to the cloud for their analytics is data ingestion, preparation, and cleaning. Initially, organizations need to move data to a common cloud data lake or warehouse leveraging an ETL platform such as Cloud Data Fusion and once it’s there it needs to be exposed to end-users in a self-service manner for the specific use case they’re trying to execute. This is exactly why we partnered with Trifacta on Google Cloud Dataprep to provide our customers with the best cloud solution for data integration and data wrangling.

Cloud Dataprep by Trifacta has enabled several of our analysts and data stewards to automate complex data preparation routines, build large analytical data models, and work with files too large for Excel or Access. This allowed us to avoid piling more work on top of our heavily backlogged data engineering teams and more than doubled our development velocity. By empowering data analysts, stewards, and scientists to perform tasks that would otherwise require a full development team, Cloud Dataprep has the potential to help us substantially improve the speed with which we deliver value.

Cloud Dataprep allows us to quickly explore new datasets and its flexibility supports all our data transformation needs. Data preparation work at Merkle is now completed in minutes, not hours or days, accelerating our data preparation time by 90%.