Wrangle Summit 2021
The first industry event focused on data engineering
If you STANDARDIZECLEANTRANSFORMBLEND data this is your event.
Overview
The Best People, Ideas and Technology in Data Engineering, All in One Place
Data engineering is hot. Job listing site Dice shows demand for data engineers was up 50% in 2020. If you move, structure, clean, or pipeline data to drive critical enterprise decisions, you’re doing the work of data engineering.
It’s time you got your own conference.
Join hosts Trifacta and Google Cloud, along with other industry leaders, at Wrangle Summit 2021, the first conference dedicated exclusively to data engineering.
Join Us To:
- Learn, share and grow the discipline of data engineering and network with some of the world’s leading minds who are shaping its future
- Promote data engineers as the hero in data-driven projects, the ones who preserve the credibility of analytics and protect ML/AI models
- See the latest tools and techniques for accelerating the data analysis process
- Get training and certifications in new data engineering skills
Speakers
Debanjan Saha
VP/GM, Data Analytics, Google | IEEE Fellow
Debanjan Saha is the GM of Data Analytics at Google where he leads the strategy and execution of analytics services in GCP. Prior to joining Google, Debanjan was VP of Amazon Aurora and RDS at Amazon Web Services. Earlier in his career Debanjan held multiple executive and technical leadership positions at IBM and Tellium. Debanjan is a Fellow of the IEEE and a Distinguished Scientist of the ACM. He has co-authored a book, 50+ patent applications, and numerous technical articles including award winning papers and Internet standards. He received MS and PhD degrees from the University of Maryland, and a B.Tech from IIT, all in Computer Science. In 2019, Business Insider named him as one of the top 10 technology executives transforming business.
Benoit Dageville
Co-Founder, President of Products, Snowflake
Benoit co-founded Snowflake and currently serves as President of the Product division. Benoit is a leading expert in parallel execution and self-tuning database systems. Prior to founding Snowflake, Benoit was with Oracle for over 10 years as a lead architect for parallel execution in Oracle RAC and a key architect in the SQL Manageability group. Prior to Oracle, Benoit worked at Bull Information Systems. He helped define the architecture and lead database performance efforts for Bull’s parallel systems. Benoit has a PhD in Computer Science with a focus in Parallel Database Systems and is a named inventor on more than 80 patents.
Matei Zaharia
Co-Founder & Chief Technologist, Databricks | Assistant Professor at Stanford
Matei Zaharia is an Assistant Professor of Computer Science at Stanford University and Chief Technologist at Databricks. He started the Apache Spark open source project in 2009 as well as the MLflow open source machine learning platform, and helped design other widely used open source data and ML infrastructure including Spark SQL and Delta Lake. Matei’s academic research work was recognized through the 2014 ACM Doctoral Dissertation Award for the best PhD dissertation in computer science, an NSF CAREER Award, and the US Presidential Early Career Award for Scientists and Engineers (PECASE).
Tristan Handy
Founder & CEO, Fishtown Analytics
Tristan Handy is the CEO and Founder of Fishtown Analytics, a Philadelphia startup pioneering the practice of modern analytics engineering. Fishtown’s product, dbt, is used by over 3,000 companies to organize, catalog, and distill knowledge from the data in their data warehouses, including companies like JetBlue, HubSpot, GitLab, and the ACLU.
Tristan has been working in data for two decades in both in-house and consulting roles with both large enterprises and small startups.
Barr Moses
Co-Founder & CEO, Monte Carlo
Barr Moses is CEO & Co-Founder of Monte Carlo, a data reliability company backed by Accel, GGV, Redpoint, and other top Silicon Valley investors. Previously, she was VP Customer Operations at customer success company Gainsight, where she helped scale the company 10x in revenue and among other functions, built the data/analytics team. Prior to that, she was a management consultant at Bain & Company and a research assistant at the Statistics Department at Stanford. She also served in the Israeli Air Force as a commander of an intelligence data analyst unit. Barr graduated from Stanford with a B.Sc. in Mathematical and Computational Science.
Rob Woollen
CTO / Co-founder, Sigma Computing
Rob Woollen is the Chief Technology Officer and Co-founder at Sigma Computing where he empowers business leaders and domain experts to ask any question of their data without writing a single line of SQL. He has more than two decades of experience building distributed and cloud systems. Prior to founding Sigma, Woollen was the Entrepreneur-in-Residence at Sutter Hill Ventures and he worked at Salesforce.com for more than six, serving in several roles including, Chief Technology Officer for the platform and Work.com. Previously, he was at BEA Systems and HP, and is the inventor on numerous patents. Woollen earned a Bachelor of Science degree in Computer Science from Princeton University.
Adam Wilson
CEO, Trifacta
Adam is CEO of Trifacta and has over 25 years of experience in leadership roles focused on data integration and analytics. Under his leadership, Trifacta has become the global leader in data wrangling, serving thousands of companies worldwide. Prior to Trifacta, Adam was GM of Informatica’s ILM division. He also served as SVP of Product Management for Informatica’s flagship data integration products. Adam co-founded Zimba, an analytics company. He holds an MBA from the Kellogg School of Management and an engineering degree from Northwestern University.
Joe Hellerstein
Co-Founder & CSO, Trifacta | Professor at University of California, Berkeley
Joe is Trifacta’s Chief Strategy Officer, Co-founder and Jim Gray Chair of Computer Science at UC Berkeley. His career in research and industry has focused on data-centric systems and the way they drive computing. Fortune Magazine included him in their list of 50 smartest people in technology , and MIT’s Technology Review magazine included his work on their TR10 list of the 10 technologies “most likely to change our world”.
Jeffrey Heer
Co-Founder & CXO, Trifacta | Computer Science Professor at University of Washington
Jeff is Trifacta’s Chief Experience Officer, Co-founder and a Professor of Computer Science at the University of Washington, where he directs the Interactive Data Lab. Jeff’s passion is the design of novel user interfaces for exploring, managing and communicating data. The data visualization tools developed by his lab (D3.js, Protovis, Prefuse) are used by thousands of data enthusiasts around the world. In 2009, Jeff was named to MIT Technology Review’s list of “Top Innovators under 35”.
Salahhuddin Khawaja
Managing Director of Automation / Global Risk at Bank of America
Salah Khawaja is the Managing Director of Automation / Global Risk at Bank of America. He’s responsible for overseeing the transition from a highly manual testing program to a fully automated process, resulting in reduced operational and compliance risk for the Bank. Prior to joining Bank of America, he spent 15 years at Deloitte and JP Morgan running large scale technology transformation projects across various disciplines. Salah has a Bachelors in Computer Science from the Lahore University of Management Sciences (LUMS) and Masters in Telecommunication and Network Management from Syracuse University. He is certified as an Agile Scrum Master and Project Management Professional.
Eileen M. Vidrine
Air Force Chief Data Officer, U.S. Air Force
Eileen M. Vidrine is the Air Force Chief Data Officer, Headquarters, United States Air Force, Washington, D.C. She develops and implements strategies for enterprise data management, analytics, and digital transformation to optimize performance and drive innovation in and across all missions and operations. Ms. Vidrine began her government career in 1986 as an enlisted member of the U.S. Army and was commissioned in 1987 through the U.S. Army Officer Candidate School Program as a Transportation Officer. Later in her Army career, she was selected and integrated into the U.S. Army Acquisition Corps. She began her civilian career as a senior faculty member at the Joint Military Intelligence College and led the college’s technology transformation as the first Director for the Center for Educational Technologies. From 2006 to 2012, Ms. Vidrine served in various positions of leadership at the Office of the Director of National Intelligence (DNI) culminating as the Chief of Staff for the Assistant DNI for Human Capital. In 2012, she was selected to serve as the DoD Intelligence Community Enterprise Architect assigned to the Office of the Under Secretary of Defense for Intelligence, Human Capital Management Office (HCMO), and in 2014 assumed HCMO chief of staff responsibilities. In 2016, Ms. Vidrine was selected as a White House Leadership Fellow, supporting the Office of Management and Budget and the Office of Personnel Management. After her fellowship she was detailed to the Executive Office of the President Office of Administration. Ms. Vidrine was appointed to the Senior Executive Service in June 2018.
Randy Santiano
Associate Technical Consultant, Eli Lilly
Randy Santiano has 20+ years of working within the Life Science and Pharmaceutical Industry. His experiences range from laboratory sciences as a Chemist to developing software applications as a Technical Specialist. He enjoys exploring new and emerging technologies and prefers to collaborate with diverse groups to solve problems. Randy currently works at Eli Lilly as an Associate Consultant within the Clinical Design, Delivery, and Analytics (CDDA) organization.
Brad Parkel
Director of Marketing, Singlewire Software
Brad Parkel is Director of Marketing at Singlewire Software and has led marketing efforts with the company since its founding in 2009. Experienced and accomplished in all facets of B2B marketing; from video, graphics and multimedia production, to data analysis, wrangling, and email communications, Brad aligns business goals and objectives with tactical marketing campaigns and execution. Brad attended the University of Wisconsin-Platteville, and graduated from the University of Iowa with a BA in Communication Studies-Radio/TV/Film emphasis. Brad volunteers his time advising local public schools on business, marketing and IT curriculum.
Zack Pike
CIO, Callahan
Zack Pike is the CIO at Callahan and is an analytics mastermind, scouring mounds of information to uncover key insights that keep our clients ahead of the curve. His expertise in designing margin-generating strategic opportunities applies to companies in many different verticals. Previously, Zack was director at Alight Analytics, where he led a cross-functional consulting team providing marketing strategy and analytics to B2B and B2C executives. When he’s not digging into big data for clients, Zack moonlights as a youth b-ball and t-ball coach, analyzing plays for that winning edge.
James Wilcox
Co-Founder & Managing Partner, PlusUp
James started PlusUp in 2016 with a business partner after managing more than $50 million on paid social advertising for numerous Fortune 500 clients including Target, Allstate, AT&T and more. Over the past five years, he has led a team that has managed over $250MM+ in media spend while also developing PlusUp analytics practice to help automated all reporting and analysis for their clients.
Ells Campbell
Computational Biologist, Centers for Disease Control and Prevention
Ells Campbell is a computational biologist at CDC. Residing in the Laboratory Branch of the Division of HIV/AIDS Prevention, he leads development of an outbreak response tool called MicrobeTrace. To support COVID-19 response, Ells assists epidemiologists around the country in cleaning, visualizing, and exploring contact network data collected during contact tracing interviews.
Kalynn Kennon
Head of Data Engineering, Infectious Diseases Data Observatory, University of Oxford
Kalynn Kennon joined IDDO in August 2016. As Head of Data Engineering, she is responsible for the development of data standards, the robustness of the curation process, assuring the quality of the data, and the efficiency of the data curation team.
Andrew Coe
Data Wrangler, Genomics England
Andrew is a data solutions professional with over 10 years’ experience with data engineering, business intelligence, solution architecture and management. He has worked extensively in the healthcare sector and is currently focused on data pipeline engineering for the Covid-19 Genomic Research Environment.
Bekkie Brown
Supervisor of Data Engineering & Analytics Technology, Amway
Bekkie Brown, CSPO Supervisor – Data Engineering & Analytics Technology Bekkie Brown is currently the Supervisor of Data Engineering & Analytics Technology at Amway. She has been with Amway for 14 years and held a number of roles both within IT and the Business. Bekkie has led the team through the implementation of Google Cloud while promoting self-service for their Analytics Community. This is the first self-service initiative for transforming raw data into curated business friendly tables that has been executed outside of IT. Aside from leading the Data Engineers, she also leads a team of Operations Analysts who are supporting the platform, production pipelines and Analysts.
Dharshini Manoharan Bhuvaneswari
Business Data Analyst, SPAR International
Dharshini currently works with SPAR International, Netherlands as a Business Data Analyst and is collaborating with Partners across the globe. She works in end to end data project starting from data gathering, wrangling to data analysis. She has a Masters’ degree in Data Science from Monash University, Melbourne, Australia. She started her career at Capgemini, India as an Analyst where she worked with world’s leading retail clients to bridge the gap between strategy, business and numerical facts. After moving to Australia, she joined Honeywell as a Data Scientist( Intern). At Honeywell, she was involved in a groundbreaking Machine learning project (Cognitive Building Project) followed by which she continued to work in ML Projects at Monash University (Buildings and Property division).
Kevin Schaefer
Senior Data Engineer – Global Data and Analytics, Amway Corporation
Kevin is part of a core team of data engineers/scientists/analysts who have been tasked to design and implement digital transformation at Amway. Supported by nearly 20 years of experience in data analysis, visualization, and architecture, his current engineering role involves building capability infrastructure and data solutions on Google Cloud Platform.
Vijay Balasubramaniam
Director, Partner Solutions Architect, Trifacta
Vijay Balasubramaniam leverages his expertise in data management to help partners and customers be successful in large-scale analytics initiatives. He has over 18 years of experience helping large organizations manage their data sets and produce insights. He specializes in best-in-class data preparation workflows and developing end-to-end solutions on the AWS platform. Outside of work, he enjoys biking, tennis, music and spending time with family.
Ryan Rolf
Channel Program Leader, Sr. Business Development Manager, AWS Data Exchange
Ryan Rolf is the Channel Program Leader and Sr. Business Development Manager on AWS Data Exchange at Amazon Web Services (AWS). In this role, Ryan drives strategic collaborations and programs for AWS Data Exchange, which is a service that makes it easy to find, subscribe to, and use third-party data in the cloud. Prior to AWS, Ryan has held multiple other startup technology leadership roles.
Giuseppe Tortorici
Business Intelligence & Visualisation Manager, AON
Giuseppe Tortorici manages the BI and Visualization team in the Aon Centre for Innovation and Analytics (ACIA) in Dublin and Krakow. The team establishes best practices and guidelines for the Business Intelligence processes across the centers and for the broader AON. Giuseppe is passionate in data wrangling and data visualization and has over 15 years of experience as Engineer and Business Intelligence Professional in leading Telecommunication, Banking and Risk Management Companies.
Connor Carreras
Director SaaS Adoption & Enablement, Trifacta
Connor Carreras leads SaaS adoption and enablement efforts at Trifacta, where she leverages Trifacta’s centralized data warehouse for insights about customer behavior. She can write SQL and code (but finds both to be a pain), and as such, avoids calling herself a data engineer. Connor previously worked at Informatica and Microsoft.
Benefits
Up Your Game
We’ll have some of the world’s leading minds in data unveiling the latest tools and techniques for accelerating the process of getting data ready for analytics and machine learning.
Up Your Team's Game
Trainings, certifications, and more so that you and your team can learn new skills and ensure everyone is on the same page.
Up Your Networking Game
Yes, it is possible to network and meet the individuals that will change the trajectory of your career at a virtual conference. It won’t be weird…trust us.
Who is the Wrangle Summit for?
Any and all organizations are welcome at the Wrangle Summit:
- Businesses of any type and size
- Government agencies or departments
- Consultants in financial services, healthcare, energy and utilities, telecommunications, manufacturing, transportation and beyond
Any and all data workers are welcome at the Wrangle Summit:
- Data analysts and scientists
- Data engineers and architects
- Analytics executives
- IT
- Data integration/ETL developers or team leads
- Managers of data governance & stewardship
Schedule
- April 7, 2021
- April 8, 2021
- April 9, 2021
Wrangle Summit Day 1
9:00 AM - 9:05 AM
Welcome to Wrangle Summit 2021
9:05 AM - 9:45 AM PDT
Keynote, Fireside Chat
9:50 AM - 10:30 AM PDT
Keynote
10:30 AM - 10:40 AM PDT
Sponsor Showcase Break
10:40 AM - 11:10 AM PDT
Session
11:15 AM - 11:45 AM PDT
Session
11:50 AM - 12:20 PM PDT
Session
12:20 PM - 12:30 PM PDT
Sponsor Showcase Break
12:35 PM - 1:05 PM PDT
Lightning Talks
1:10 PM - 1:50 PM PDT
Keynote, Panel
1:50 PM - 2:00 PM PDT
Wrap-up Day 1
Wrangle Summit Day 2
9:00 AM - 9:40 AM PDT
Keynote, Fireside Chat
9:45 AM - 10:15 AM PDT
Session
10:15 AM - 10:25 AM PDT
Sponsor Showcase Break
10:25 AM - 10:55 AM PDT
Panel
11:00 AM - 11:30 AM PDT
Session
11:35 AM - 12:05 PM PDT
Lightning Talks
12:05 PM - 12:15 PM PDT
Sponsor Showcase Break
12:50 PM - 1:30 PM PDT
Keynote, Founders Panel
1:35 PM - 1:45 PM PDT
Wrap-up Day 2
Wrangle Summit Day 3
8:20 AM - 10:30 AM PDT
Training Sessions
Session details coming soon.
The Role of Data Engineering in Analytics Modernization
Every business – both large-scale Fortune 500 organizations & fast-growing upstarts – are extremely focused on modernizing their company’s approach to analytics in order to stay ahead of their competition. At a high level, there are two consistent goals across nearly every organization’s analytics modernization efforts – utilize more data AND derive value faster. Businesses need to be able to incorporate more and more data into their analytics processes regardless of the origin, shape and size of the data. Then, turn that expanding variety of data into something that is valuable for their business faster.
Join Trifacta CEO, Adam Wilson and Google Cloud Vice President & GM, Debanjan Saha, as they dive into the evolution of analytics in the cloud over the past few years and why data engineering has emerged as the critical component for organizational success.
Debanjan Saha
General Manager & Vice President of Data Analytics, Google
Debanjan Saha is the GM of Data Analytics at Google where he leads the strategy and execution of analytics services in GCP. Prior to joining Google, Debanjan was VP of Amazon Aurora and RDS at Amazon Web Services. Earlier in his career Debanjan held multiple executive and technical leadership positions at IBM and Tellium. Debanjan is a Fellow of the IEEE and a Distinguished Scientist of the ACM. He has co-authored a book, 50+ patent applications, and numerous technical articles including award winning papers and Internet standards. He received MS and PhD degrees from the University of Maryland, and a B.Tech from IIT, all in Computer Science. In 2019, Business Insider named him as one of the top 10 technology executives transforming business.
Adam Wilson
CEO, Trifacta
Adam is CEO of Trifacta and has over 25 years of experience in leadership roles focused on data integration and analytics. Under his leadership, Trifacta has become the global leader in data wrangling, serving thousands of companies worldwide. Prior to Trifacta, Adam was GM of Informatica’s ILM division. He also served as SVP of Product Management for Informatica’s flagship data integration products. Adam co-founded Zimba, an analytics company. He holds an MBA from the Kellogg School of Management and an engineering degree from Northwestern University.
Lakehouse: A New Generation of Open Platforms for Data Warehousing and AI
Enterprise data architectures usually contain many systems—data lakes, message queues, and data warehouses—that data must pass through before it can be analyzed. Each transfer step between systems adds a delay and a potential source of errors, reducing the quality and freshness of data downstream. What if we could eliminate most of these steps? In recent years, cloud storage and new open source systems have enabled a radically new architecture: the lakehouse, an ACID transactional layer over cloud storage that can provide streaming, data versioning, and high-performance SQL access similar to a data warehouse, as well as direct access from non-SQL workloads such as AI applications. Thousands of organizations including the largest web companies are now using lakehouse technology to replace separate data lake, warehouse, and streaming systems and deliver high-quality data faster internally. I’ll discuss the key trends and recent advances in this area based on experience with Delta Lake, the most widely used open source lakehouse platform, which was developed at Databricks.
Matei Zaharia
Chief Technologist, Databricks | Assistant Professor, Stanford University
Matei Zaharia is an Assistant Professor of Computer Science at Stanford University and Chief Technologist at Databricks. He started the Apache Spark open source project in 2009 as well as the MLflow open source machine learning platform, and helped design other widely used open source data and ML infrastructure including Spark SQL and Delta Lake. Matei’s academic research work was recognized through the 2014 ACM Doctoral Dissertation Award for the best PhD dissertation in computer science, an NSF CAREER Award, and the US Presidential Early Career Award for Scientists and Engineers (PECASE).
Inspirations of the Future, Data, and AI: Bank of America's Automation Strategy
“Software ate the world” – we are all now massive consumers of software – from Snap to Uber to TikTok. Bank of America has transformed rapidly – 8 (of 10) Billion transactions are digital. The world is changing right in front of us. A bit about the past, some about today and more about how to take advantage of the future. Get inspired today to take advantage of the data & AI fueled future.
Salahuddin Khawaja
Managing Director of Automation / Global Risk, Bank of America
Salah Khawaja is the Managing Director of Automation / Global Risk at Bank of America. He’s responsible for overseeing the transition from a highly manual testing program to a fully automated process, resulting in reduced operational and compliance risk for the Bank. Prior to joining Bank of America, he spent 15 years at Deloitte and JP Morgan running large scale technology transformation projects across various disciplines. Salah has a Bachelors in Computer Science from the Lahore University of Management Sciences (LUMS) and Masters in Telecommunication and Network Management from Syracuse University. He is certified as an Agile Scrum Master and Project Management Professional.
How the Department of the Air Force is Transitioning to Self-Service Analytics in the Cloud
The Department of the Air Force is rapidly modernizing and working diligently to leverage advanced analytics across the enterprise to ensure mission success. This has driven the Chief Data Officer to launch the VAULT program: A cloud-native, enterprise, shared-service that will empower users more effectively operationalize data for a myriad of analytics needs. The design of the platform focuses on self-service, leveraging cloud-native, best-of-breed technology components tightly integrated to provide robust functionality and automation, expediting time-to-insight for users around the world. Eileen Vidrine will share her experience transitioning to a modern cloud data platform. What did their process and timeline look like? What surprises did they encounter along the way? How did they get organizational buy-in? We’ll chat through the details of how one of the world’s largest organizations tackled this move to the cloud that every enterprise is undertaking.
Eileen M. Vidrine.
Air Force Chief Data Officer, U.S. Air Force
Eileen M. Vidrine is the Air Force Chief Data Officer, Headquarters, United States Air Force, Washington, D.C. She develops and implements strategies for enterprise data management, analytics, and digital transformation to optimize performance and drive innovation in and across all missions and operations. Ms. Vidrine began her government career in 1986 as an enlisted member of the U.S. Army and was commissioned in 1987 through the U.S. Army Officer Candidate School Program as a Transportation Officer. Later in her Army career, she was selected and integrated into the U.S. Army Acquisition Corps. She began her civilian career as a senior faculty member at the Joint Military Intelligence College and led the college’s technology transformation as the first Director for the Center for Educational Technologies. From 2006 to 2012, Ms. Vidrine served in various positions of leadership at the Office of the Director of National Intelligence (DNI) culminating as the Chief of Staff for the Assistant DNI for Human Capital. In 2012, she was selected to serve as the DoD Intelligence Community Enterprise Architect assigned to the Office of the Under Secretary of Defense for Intelligence, Human Capital Management Office (HCMO), and in 2014 assumed HCMO chief of staff responsibilities. In 2016, Ms. Vidrine was selected as a White House Leadership Fellow, supporting the Office of Management and Budget and the Office of Personnel Management. After her fellowship she was detailed to the Executive Office of the President Office of Administration. Ms. Vidrine was appointed to the Senior Executive Service in June 2018.
Collaborative Engineering: It Takes a Village to Raise a Dataset
Designing certain types of Data Wrangling Flows require experience and knowledge. By implementing a collaborative infrastructure that allows for developers to share existing flows and managed assets, the develop cycle time needed for projects is drastically decreased in addition to an increase in new data products. This presentation will identify key elements for establishing a collaborative infrastructure, such as having a defined data hierarchy and inclusion of supporting data visualization products.
This presentation will identify key elements for establishing a collaborative infrastructure, such as having a defined data hierarchy and inclusion of supporting data visualization products.
Randy Santiano
Associate Technical Consultant, Eli Lilly
Randy Santiano has 20+ years of working within the Life Science and Pharmaceutical Industry. His experiences range from laboratory sciences as a Chemist to developing software applications as a Technical Specialist. He enjoys exploring new and emerging technologies and prefers to collaborate with diverse groups to solve problems. Randy currently works at Eli Lilly as an Associate Consultant within the Clinical Design, Delivery, and Analytics (CDDA) organization.
Why Data is the Ultimate Weapon in Fighting Infectious Diseases
Due to the COVID-19 global pandemic, the Wrangle Summit and many other events, are being held virtually. This virus has turned our world upside down and has impacted every single person in some form. There are major challenges with protecting public health and safety during a virus outbreak, but there are also significant issues related to data collection and analysis that makes things even more challenging.
During this session you will hear from three experts who are weaponizing data to fight infectious diseases. Genomics England, Infectious Diseases Data Observatory at the University of Oxford and the CDC are leveraging complex and massive amounts of data to help experts not only fight COVID-19 but other infectious diseases.
Ells Campbell
Computational Biologist, Centers for Disease Control and Prevention
Ells Campbell is a computational biologist at CDC. Residing in the Laboratory Branch of the Division of HIV/AIDS Prevention, he leads development of an outbreak response tool called MicrobeTrace. To support COVID-19 response, Ells assists epidemiologists around the country in cleaning, visualizing, and exploring contact network data collected during contact tracing interviews.
Kalynn Kennon
Infectious Diseases Data Observatory, University of Oxford
Kalynn Kennon joined IDDO in August 2016. As Head of Data Engineering, she is responsible for the development of data standards, the robustness of the curation process, assuring the quality of the data, and the efficiency of the data curation team.
Andrew Coe
Data Wrangler, Genomics England
Andrew is a data solutions professional with over 10 years’ experience with data engineering, business intelligence, solution architecture and management. He has worked extensively in the healthcare sector and is currently focused on data pipeline engineering for the Covid-19 Genomic Research Environment.
The Emergence of the Modern Data Stack
Over the past two years, cloud data warehouses have taken the analytics industry by storm. The rise of these architectures has also led to the emergence of a variety of new technologies purpose-built for these environments focused on data integration, transformation, quality monitoring and analytics. Businesses are not only leveraging cloud data warehouses as production reporting environments but also as exploratory sandboxes for a variety of diverse data landing in the cloud. Data that needs to be explored, structured, cleaned and monitored before analytics use.
Join executive leaders from Monte Carlo, Fishtown Analytics (makers of dbt) and Sigma Computing for a panel discussion focused on what has led to the formulation of the new Modern Data Stack and the different components that comprise it.
Tristan Handy
Founder and CEO, Fishtown Analytics
Tristan Handy is the CEO and Founder of Fishtown Analytics, a Philadelphia startup pioneering the practice of modern analytics engineering. Fishtown’s product, dbt, is used by over 3,000 companies to organize, catalog, and distill knowledge from the data in their data warehouses, including companies like JetBlue, HubSpot, GitLab, and the ACLU.
Tristan has been working in data for two decades in both in-house and consulting roles with both large enterprises and small startups.
Barr Moses
Co-Founder & CEO, Monte Carlo
Barr Moses is CEO & Co-Founder of Monte Carlo, a data reliability company backed by Accel, GGV, Redpoint, and other top Silicon Valley investors. Previously, she was VP Customer Operations at customer success company Gainsight, where she helped scale the company 10x in revenue and among other functions, built the data/analytics team. Prior to that, she was a management consultant at Bain & Company and a research assistant at the Statistics Department at Stanford. She also served in the Israeli Air Force as a commander of an intelligence data analyst unit. Barr graduated from Stanford with a B.Sc. in Mathematical and Computational Science.
Rob Woollen
CTO & Co-founder, Sigma Computing
Rob Woollen is the Chief Technology Officer and Co-founder at Sigma Computing where he empowers business leaders and domain experts to ask any question of their data without writing a single line of SQL. He has more than two decades of experience building distributed and cloud systems. Prior to founding Sigma, Woollen was the Entrepreneur-in-Residence at Sutter Hill Ventures and he worked at Salesforce.com for more than six, serving in several roles including, Chief Technology Officer for the platform and Work.com. Previously, he was at BEA Systems and HP, and is the inventor on numerous patents. Woollen earned a Bachelor of Science degree in Computer Science from Princeton University.
Adam Wilson
CEO, Trifacta (Moderator)
Adam is CEO of Trifacta and has over 25 years of experience in leadership roles focused on data integration and analytics. Under his leadership, Trifacta has become the global leader in data wrangling, serving thousands of companies worldwide. Prior to Trifacta, Adam was GM of Informatica’s ILM division. He also served as SVP of Product Management for Informatica’s flagship data integration products. Adam co-founded Zimba, an analytics company. He holds an MBA from the Kellogg School of Management and an engineering degree from Northwestern University.
Why Data Cloud? Why Now?
The Data Cloud has emerged as a transformative way for organizations to share and collaborate on data and increase the value it can deliver to the business. What is driving the adoption of the Data Cloud now? What sets it apart from an architecture, cost and performance perspective over other platforms? How will the Data Cloud platforms evolve in the future with changing workloads and applications?
To address these questions and more, this session will feature a lively discussion between two of the world’s most renowned experts in the space of data management – Snowflake’s co-founder and President of Products, Benoit Dageville and Trifacta co-founder and Chief Strategy Officer, Joe Hellerstein.
Benoit Dageville
Co-Founder, President of Products, Snowflake
Benoit co-founded Snowflake and currently serves as President of the Product division. Benoit is a leading expert in parallel execution and self-tuning database systems. Prior to founding Snowflake, Benoit was with Oracle for over 10 years as a lead architect for parallel execution in Oracle RAC and a key architect in the SQL Manageability group. Prior to Oracle, Benoit worked at Bull Information Systems. He helped define the architecture and lead database performance efforts for Bull’s parallel systems. Benoit has a PhD in Computer Science with a focus in Parallel Database Systems and is a named inventor on more than 80 patents.
Joe Hellerstein
Co-Founder & Chief Strategy Officer, Trifacta | Professor at University of California, Berkeley
Joe is Trifacta’s Chief Strategy Officer, Co-founder and Jim Gray Chair of Computer Science at UC Berkeley. His career in research and industry has focused on data-centric systems and the way they drive computing. Fortune Magazine included him in their list of 50 smartest people in technology , and MIT’s Technology Review magazine included his work on their TR10 list of the 10 technologies “most likely to change our world”.
Enriching Account Information with Trifacta and Dun & Bradstreet on the AWS Data Exchange
Enriching customer and prospect data is the key to driving business decisions, strategies and insights. That’s why we are excited to announce a collaboration between Triacta, Dun & Bradstreet and the AWS Data Exchange. In this session, we will present an example of creating a list from CRM data to send invitations to prospects for an Executive roundtable event. We will walk through the process to access a Dun & Bradstreet subscription on the AWS Data Exchange, moving the data to an Amazon S3 bucket and finally how to use Trifacta to enrich your CRM data to identify prospects utilizing the D-U-N-S Number.
ViJay Balasubramaniam
Director, Partner Solutions Architect, Trifacta
Vijay Balasubramaniam leverages his expertise in data management to help partners and customers be successful in large-scale analytics initiatives. He has over 18 years of experience helping large organizations manage their data sets and produce insights. He specializes in best-in-class data preparation workflows and developing end-to-end solutions on the AWS platform. Outside of work, he enjoys biking, tennis, music and spending time with family.
Ryan Rolf
Channel Program Leader, Sr. Business Development Manager, AWS Data Exchange
Ryan Rolf is the Channel Program Leader and Sr. Business Development Manager on AWS Data Exchange at Amazon Web Services (AWS). In this role, Ryan drives strategic collaborations and programs for AWS Data Exchange, which is a service that makes it easy to find, subscribe to, and use third-party data in the cloud. Prior to AWS, Ryan has held multiple other startup technology leadership roles.
Streamlining Pipelines for Marketing Analytics Panel Discussion
Pinpointing exactly where the biggest opportunities lie can save you time, resources and of course money. How do you pinpoint those opportunities as a Marketing professional? With data. Lots and lots of data. Marketing analytics can cover a wide range of responsibilities. Whether you are focused on digital marketing, event marketing or tracking campaigns for internal reporting, modern cloud data pipelines are key to helping to tell the story and identifying those opportunities.
In this panel discussion we’ll have three Marketing leaders who are focused on solving a diversity of problems within their organizations. Zack Pike, VP Data Strategy & Marketing Analytics at Callahan, and his team are building custom pipelines for their customers to create front-end data analysis to inform client strategy. Brad Parkel, Director of Marketing at Singlewire Software, is onboarding trade show data and tracking campaigns to drive sales to faster opportunities. James Wilcox, Manage Partner at PlusUp, leverages a cloud data warehouse to automate social media analytics for their clients.
Zack Pike
CIO, Callahan
Zack Pike is the CIO at Callahan and is an analytics mastermind, scouring mounds of information to uncover key insights that keep our clients ahead of the curve. His expertise in designing margin-generating strategic opportunities applies to companies in many different verticals. Previously, Zack was director at Alight Analytics, where he led a cross-functional consulting team providing marketing strategy and analytics to B2B and B2C executives. When he’s not digging into big data for clients, Zack moonlights as a youth b-ball and t-ball coach, analyzing plays for that winning edge.
James Wilcox
Co-Founder & Managing Partner, PlusUp
James started PlusUp in 2016 with a business partner after managing more than $50 million on paid social advertising for numerous Fortune 500 clients including Target, Allstate, AT&T and more. Over the past five years, he has led a team that has managed over $250MM+ in media spend while also developing PlusUp analytics practice to help automated all reporting and analysis for their clients.
Brad Parkel
Director of Marketing, Singlewire Software
Brad Parkel is Director of Marketing at Singlewire Software and has led marketing efforts with the company since its founding in 2009. Experienced and accomplished in all facets of B2B marketing; from video, graphics and multimedia production, to data analysis, wrangling, and email communications, Brad aligns business goals and objectives with tactical marketing campaigns and execution. Brad attended the University of Wisconsin-Platteville, and graduated from the University of Iowa with a BA in Communication Studies-Radio/TV/Film emphasis. Brad volunteers his time advising local public schools on business, marketing and IT curriculum.
Phil Lacorte
VP Revenue Operations, Trifacta (Moderator)
Phil Lacorteheads revenue operations at Trifacta where using data is a must to find our biggest opportunities to increase productivity. He is no stranger to marketing analytics using it as key leading indicators and creating pipeline throughout his career including Xactly, Informatica, and Host Analytics (now Planful).
Building a Modern Hub and Spoke Data Engineering Platform
As organizations move to the cloud, how can a small data engineering team best fulfill the demands of increasingly data-hungry business partners? In this session you will learn how Trifacta transitioned our analytics pipelines to a modern hub and spoke model. We explain how our data platform ensures data availability and integrity while simultaneously empowering downstream consumers to explore and prepare data to answer their specific questions. Learn best practices (and pitfalls to avoid!) and see how Trifacta’s adoption of the hub and spoke model has led to increased innovation.
Connor Carreras
Director SaaS Adoption & Enablement, Trifacta
Connor Carreras leads SaaS adoption and enablement efforts at Trifacta, where she leverages Trifacta’s centralized data warehouse for insights about customer behavior. She can write SQL and code (but finds both to be a pain), and as such, avoids calling herself a data engineer. Connor previously worked at Informatica and Microsoft.
César Jardim
Senior Product Manager, Trifacta
César Jardim leads strategic initiatives around Trifacta’s central data hub. Previously he led our operationalization and orchestration initiatives. Before joining Trifacta he was a product manager in the content management space.
Supply Chains to Data Pipelines: Modern Retail Analytics
Amway and Spar are two of the most recognizable retail brands in the world. Spar has over 13,000 stores across 48 countries. Similarly, Amway boasts more than 3 million independent consultants who sell its catalog of more than 450 personal care, household, nutrition, and cleaning products in more than 100 countries. When dealing with data on this scale, there are endless challenges with product hierarchies, language and currency conversions and merging with third party data such as Nielsen. Both companies are innovating their data pipelines with modern cloud technologies to automate their analytics and drive faster time to value.
In this session, Bekkie Brown, Supervisor – Data Engineering & Analytics Technology at Amway and Dharshini Manoharan Bhuvaneswari, Business Data Analyst at SPAR, will discuss how they are leading the efforts in their respective companies to drive better business decisions with less data challenges.
Bekkie Brown
Supervisor of Data Engineering & Analytics Technology, Amway
Bekkie Brown, CSPO Supervisor – Data Engineering & Analytics Technology Bekkie Brown is currently the Supervisor of Data Engineering & Analytics Technology at Amway. She has been with Amway for 14 years and held a number of roles both within IT and the Business. Bekkie has led the team through the implementation of Google Cloud while promoting self-service for their Analytics Community. This is the first self-service initiative for transforming raw data into curated business friendly tables that has been executed outside of IT. Aside from leading the Data Engineers, she also leads a team of Operations Analysts who are supporting the platform, production pipelines and Analysts.
Dharshini Manoharan Bhuvaneswari
Business Data Analyst, SPAR International
Dharshini currently works with SPAR International, Netherlands as a Business Data Analyst and is collaborating with Partners across the globe. She works in end to end data project starting from data gathering, wrangling to data analysis. She has a Masters’ degree in Data Science from Monash University, Melbourne, Australia. She started her career at Capgemini, India as an Analyst where she worked with world’s leading retail clients to bridge the gap between strategy, business and numerical facts. After moving to Australia, she joined Honeywell as a Data Scientist( Intern). At Honeywell, she was involved in a groundbreaking Machine learning project (Cognitive Building Project) followed by which she continued to work in ML Projects at Monash University (Buildings and Property division).
The Cloud Has No Walls
The traditional method in which data moved from a raw state into something of organizational value typically involved lobbing data and requests over a figurative “wall” that separated business and IT. This process would lead to lengthy back-and-forth processes between different constituencies, misconstrued requests and an overall feeling of frustration for everyone. With the advent of the cloud, organizations have a chance to remove these walls and create a single environment where stakeholders with differing technical and data skillsets can collaborate and get their work done. Whether you like to code or use drag-and-drop interfaces, the cloud has brought us the opportunity to bring these different sides together, empowering everyone to collaborate and get their work done faster.
Join Trifacta’s co-founders Joe Hellerstein and Jeffrey Heer as they discuss how the world of data cleaning, preparation and pipelines has evolved over the past few years with the advent of the cloud and the new opportunities it brings us to improve how teams work with data.
Joe Hellerstein
Co-Founder & Chief Strategy Officer, Trifacta | Professor at University of California, Berkeley
Joe is Trifacta’s Chief Strategy Officer, Co-founder and Jim Gray Chair of Computer Science at UC Berkeley. His career in research and industry has focused on data-centric systems and the way they drive computing. Fortune Magazine included him in their list of 50 smartest people in technology , and MIT’s Technology Review magazine included his work on their TR10 list of the 10 technologies “most likely to change our world”.
Jeff Heer
Co-Founder & CXO, Trifacta | Computer Science Professor at University of Washington
Jeff is Trifacta’s Chief Experience Officer, Co-founder and a Professor of Computer Science at the University of Washington, where he directs the Interactive Data Lab. Jeff’s passion is the design of novel user interfaces for exploring, managing and communicating data. The data visualization tools developed by his lab (D3.js, Protovis, Prefuse) are used by thousands of data enthusiasts around the world. In 2009, Jeff was named to MIT Technology Review’s list of “Top Innovators under 35”.
Trifacta Community Welcome
Session details coming soon.
Paul Staelin
Chief Customer Officer, Trifacta
Paul is Trifacta’s Chief Customer Officer and is responsible for ensuring that Trifacta’s customers drive value with its solutions. Prior to joining Trifacta, Paul was co-founder and Chief Customer Officer at Birst, the leading SaaS BI Platform. He also developed, launched and built Siebel’s Sales Analytics product family, the industry’s leading Sales Analytics solution at the time. Paul holds an MBA from Stanford, a master’s from MIT and a bachelor’s from Yale.
Building Self Service Culture
An overview of our digital transformation experience – using Trifacta as our foundational self-service data engineering tool. Use case supported.
Kevin Schaefer
Senior Data Engineer – Global Data and Analytics, Amway Corporation
Kevin is part of a core team of data engineers/scientists/analysts who have been tasked to design and implement digital transformation at Amway. Supported by nearly 20 years of experience in data analysis, visualization, and architecture, his current engineering role involves building capability infrastructure and data solutions on Google Cloud Platform.