Alteryx Announces Acquisition of Trifacta

Start Free

Speed up your data preparation with Trifacta

Free Sign Up

Hosted by

Joe Hellerstein
Joe Hellerstein

Professor of Computer Science,
UC Berkeley
Co-Founder, Trifacta

Jeffrey Heer
Jeffrey Heer

Professor of Computer Science,
U. of Washington
Co-Founder, Trifacta

How to subscribe and listen

You can listen to THE DATA WRANGLERS in many places - right here on our site and on your favorite podcatcher. You can also subscribe so that you never miss an episode.

Episodes

Tidy Data with Hadley Wickham

Tidy Data with Hadley Wickham

By

What is the Tidyverse and why is it important? Hadley Wickham is a leading data scientist and advocate for improving data science with tidy data and data hygiene. He’s the Chief Scientist at RStudio and an Adjunct Professor of Statistics at the University of Oakland, Stanford University and Rice University. Join the Tidyverse discussion with The Data Wranglers Joe Hellerstein and Jeffrey Heer. #TheDataWranglers

Data Trends and Fragmentation with Bill Hostmann

Data Trends and Fragmentation with Bill Hostmann

By

How will the pendulum swing in data engineering? Join Bill Hostmann, senior research fellow at Dresner Advisory Services, and Data Wranglers Joe Hellerstein and Jeffrey Heer as they talk about fragmentation in the data industry and how to improve the performance of distributed database design.
#TheDataWranglers

2021 The Year in Data

2021 The Year in Data

By

What’s above the cloud? Learn what’s hot and what’s not, as The Data Wranglers Joe Hellerstein and Jeffrey Heer look back at the year in data. Facebook went down, SQL is back, and Hadoop is dead. What’s up with Spark, and are streaming databases finally real? The cloud continues to rise, there are issues in data ethics, and data is helping to combat Covid-19. And then there’s the kerfuffle between Snowflake and Databricks. #TheDataWranglers

Data with a Purpose with Moritz Stefaner

Data with a Purpose with Moritz Stefaner

By

Meet Moritz Stefaner, a data designer who uses data for storytelling and who helped design the official German Covid-19 vaccine data dashboard. Moritz tells The Data Wranglers — Jeffrey Heer and Adam Wilson — how he creates a character from a dataset to give it emotional meaning and talks about the Covid vaccine clock he created. And, he dives into his data visualizations for train traffic on a German railroad network, the promises and pitfalls of using machine learning for data design, and what it took to visualize 175 years of text from Scientific American. Moritz hosts the popular podcast, Data Stories. #TheDataWranglers

D3 and Data Visualization Insights with Mike Bostock

D3 and Data Visualization Insights with Mike Bostock

By

What’s the secret for D3’s long-time success? Mike Bostock, the creator of D3 shares the reasons for his data visualization tool’s longevity, and why it won the 10-year Test-of-Time award from the IEEE. Mike goes deep on D3 and Observable, which he also founded, and talks about all things visualization with The Data Wranglers Joe Hellerstein and Jeffrey Heer, including when it’s OK to use a bar-chart for getting quick data insights and the applications of time zone wrangling. #TheDataWranglers

Best Use Cases of Apache Kafka with Jun Rao

Best Use Cases of Apache Kafka with Jun Rao

By

Can Kafka be my database? Jun Rao, a co-founder at Confluent, the rocket-ship startup behind Apache Kafka®, answers this question and more in a round of database bingo with The Data Wranglers Joe Hellerstein and Jeffrey Heer. Rao discusses best use cases for Kafka, both traditional and newer applications, along with how to use SQL and variations for data transformation. Rao, who goes deep in both open-source and the roots of the database industry, is the co-author of more than 20 reference research papers and the co-inventor of more than a dozen U.S. software patents. #TheDataWranglers

The Inside Story of Apache Airflow with Steven Hillion

The Inside Story of Apache Airflow with Steven Hillion

By

What data orchestration platform is downloaded more than 10,000 times a day? Data scientist Steven Hillion joins The Data Wranglers Joe Hellerstein and Jeffrey Heer to give the inside story on Apache Airflow, used by data scientists and data engineers around the world. Apache Airflow is managed commercially by Astronomer.io, where Hillion is Head of Data and in his spare time, is writing a book of poems from mathematic formulas. #TheDataWranglers

Redefining Metadata and Data Science with Shirshanka Das

Redefining Metadata and Data Science with Shirshanka Das

By

What does metadata really mean? Data scientist Shirshanka Das joins The Data Wranglers, Joe Hellerstein, Jeffrey Heer and Adam Wilson, to re-define metadata. Das discusses his innovative work in data, including a decade at LinkedIn where he was part of a now-legendary data cabal that coined the term “data science” and built the open-source engineering tools Kafka, Pinot and DataHub. Recently, Shirshanka co-founded a new company, Acryl Data, to support the DataHub open-source project. #TheDataWranglers

Introducing: The Data Wranglers

Introducing: The Data Wranglers

By

Meet The Data Wranglers, with co-hosts Joe Hellerstein, Jeff Heer and their data wrangling expert guests. On Thursdays, The Data Wranglers will discuss and riff on data engineering, analytics, data science and all things modern data management. Don’t be surprised if Adam Wilson joins from time to time with insights on all things data. A Trifacta Podcast. #TheDataWranglers Visit www.Trifacta.com/podcast for more info.

About the Data Wranglers

The Data Wranglers is a biweekly podcast featuring experts in data engineering, analytics, data science, and all things modern data management, hosted by Joe Hellerstein and Jeff Heer.

Joe Hellerstein
Joe Hellerstein
Professor of Computer Science,
UC Berkeley
Co-Founder, Trifacta

Joe is Trifacta’s Chief Strategy Officer, Co-founder and Jim Gray Chair of Computer Science at UC Berkeley. His career in research and industry has focused on data-centric systems and the way they drive computing. In 2010, Fortune Magazine included him in their list of 50 smartest people in technology, and MIT Technology Review magazine included his Bloom language for cloud computing on their TR10 list of the 10 technologies “most likely to change our world”.

Jeffrey Heer
Jeffrey Heer
Professor of Computer Science,
U. of Washington
Co-Founder, Trifacta

Jeff Heer is Trifacta’s Chief Experience Officer, Co-founder and a Professor of Computer Science at the University of Washington, where he directs the Interactive Data Lab. Jeff’s passion is the design of novel user interfaces for exploring, managing and communicating data. The data visualization tools developed by his lab and collaborators (D3.js, Vega/Vega-Lite, Protovis, Prefuse) are used by thousands of data enthusiasts around the world. In 2009, Jeff was named to MIT Technology Review’s list of “Top Innovators under 35”.

s