1. EachPod

Data Provenance

Author
[email protected] (Ben Jaffe and Katie Malone)
Published
Mon 04 Sep 2017
Episode Link
https://soundcloud.com/linear-digressions/data-provenance

Software engineers are familiar with the idea of versioning code, so you can go back later and revive a past state of the system.  For data scientists who might want to reconstruct past models, though, it's not just about keeping the modeling code.  It's also about saving a version of the data that made the model.  There are a lot of other benefits to keeping track of datasets, so in this episode we'll talk about data lineage or data provenance.

Share to: