NEW Tool:

Use generative AI to learn more about data.world

Product Launch:

data.world has officially leveled up its integration with Snowflake’s new data quality capabilities

Upcoming Digital Event

Learn how WR Berkley & Singlestone Consulting supported this distributed model with modern data practices and a data catalog built on a knowledge graph.

View all webinars

Does your data have a 'born on' date?

Clock Icon 37 minutes
Sparkle

About this episode

Where does this data come from? Who created it? How has it been used? Like the origins of the universe, there can be quite a mystery surrounding the genesis of your company’s datasets. Understanding data provenance is the first step in answering those critical questions. 

Join Tim, Juan, and Professor Deborah McGuinness of Rensselaer Polytechnic Institute, renowned AI scientist and pioneer in provenance research to discuss data provenance and why it matters to you. 

This episode features
  • The origin story and evolution of data provenance 

  • Provenance standards every data person should know

  • Which fictional character has the best origin story

Key takeaways
  • Focus on use cases: provenance is no different from other data initiatives

  • Be thoughtful about what provenance information you keep, what impact it has, and who has ownership or responsibility for it

  • Combine domain knowledge with provenance, and go a step beyond lineage

Mentioned in this episode

Provenance: “Who, what, when, where why. Not just one of those. All of those.”

Special guests

Avatar of Deborah McGuinness
Deborah McGuinness Professor & Chair, Rensselaer Polytechnic Institute
chat with archie icon