A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].
Aadyot Bhatnagar, is a Senior Research Engineer at Salesforce, and co-creator of Merlion an open source framework for applying machine learning on time series data. Merlion supports a wide range of …
Maarten Grootendorst, is a data scientist at IKNL, and more importantly, he’s the author of two open source libraries that I’ve come to love: BERTopic (topic modeling with transformers and c-TF-IDF) …
Hamza Tahir and Adam Probst are co-creators of ZenML, an extensible open source framework for building reproducible pipelines. We discuss the current state of ZenML, the many use cases that ZenML has…
Dr. Omri Allouche is Head of Research at Gong, a company that uses advances in NLP and speech models to identify and highlight risks and opportunities during customer interactions.
Download a FREE co…
Danny Bickson and Amir Alush are the creators of fastdup, a very impressive free tool for surfacing duplicates, anomalies, and leakage in visual data. In line with its name, it’s fast: fastdup is wri…
Mark Chen is a Research Scientist at OpenAI and part of the team behind DALL·E 2, a new AI system that can create realistic images and art based on natural language descriptions.
Download a FREE copy…
Jules Damji is lead developer advocate, and Richard Liaw is an engineering manager at Anyscale, the startup founded by the creators of Ray, the open source project that makes it simple to scale any c…
Rick Lamers is co-Founder and CEO at Orchest, the startup behind an open source project that enables data scientists to create, manage, and execute complex end-to-end data pipelines.
Download the FRE…
Devin Petersohn is CTO and co-founder of Ponder, and the creator of Modin, a fast, scalable, drop-in replacement for the popular Pandas library.
Download the FREE Report: State of Workflow Orchestra…
Nick Schrock is founder and Elementl, the startup behind Dagster, a popular open source, data orchestration platform. We discussed recent trends in data engineering and infrastructure, and Dagster’s …
Edmon Begoli, leads the AI Systems R&D section at Oak Ridge National Laboratory (ORNL), where he is also a distinguished member of the ORNL research staff. Our conversation centered on his upcoming …
Haytham Abuelfutuh is co-founder and CTO of Union, a startup founded by the team behind Flyte, a popular open source project originated by Lyft. Flyte is a workflow automation platform used for many …
This week’s guest is Hilary Mason, co-founder of Hidden Door, a startup that uses AI and machine learning to help create and power role-playing games (RPG).
Download a FREE copy of our recent NLP Ind…
Oren Razon is CEO and co-founder of Superwise, a startup that builds tools to streamline observability for machine learning models. This episode provides a comprehensive overview of tools and best pr…
Jeremiah Lowin is co-founder and CEO of Prefect, the company behind the popular open source data workflow orchestration system with the same name. We discussed the major design changes in Prefect 2.0…
Sebastian Raschka is lead author of a new book from Packt entitled “Machine Learning with PyTorch and Scikit-Learn”. He is also an Assistant Professor of Statistics at the University of Wisconsin (M…
This week’s guests are Ade Fajemisin (Postdoctoral Researcher) and Donato Maragno (PhD Student) of the University of Amsterdam. They were co-authors of a recent paper (“Optimization with Constraint L…
This week’s guests are Barret Zoph and Liam Fedus, research scientists at Google Brain. Our conversation centered around Large Language Models (LLM), specifically recent work by Barret, Liam, and the…
Olivia Liao is Senior Director of Data Science at Stitch Fix, a company that uses data science and expert stylists to deliver personalization at scale. We discuss how they blend data science and doma…
Jack Clark is co-director of the AI Index Steering Committee. In this episode we discuss key findings of the fifth edition of the AI Index. The report uses multiple metrics (benchmarks, publications,…