1. EachPod

The Data Exchange with Ben Lorica - Podcast

The Data Exchange with Ben Lorica

A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].

Technology Business News Ai Tech News News
Update frequency
every 7 days
Average duration
41 minutes
Episodes
302
Years Active
2019 - 2025
Share to:
2023 Opportunities and Trends: Data, Machine Learning, and AI

2023 Opportunities and Trends: Data, Machine Learning, and AI

Jenn Webb, special correspondent and managing editor at Gradient Flow, recently organized a mini-panel to discuss themes and trends for 2023. The panel consisted of myself and Mikio Braun. More infor…

01:05:31  |   Thu 12 Jan 2023
Exploring DALL·E 2

Exploring DALL·E 2

Given the growing interest in Generative AI, we revisit a conversation with Mark Chen, Research Scientist at OpenAI and part of the team behind DALL·E 2, a new AI system that can create realistic ima…

00:37:40  |   Thu 05 Jan 2023
Data Science at Shopify and Stitch Fix

Data Science at Shopify and Stitch Fix

On this special end of the year episode, we revisit conversations with two data science leaders in the e-commerce space:

  1. Wendy Foster, Director, Engineering & Data Science at Shopify.
  2. Olivia Liao, Seni…
00:37:25  |   Thu 29 Dec 2022
Building a data management system for unstructured data

Building a data management system for unstructured data

Shayan Mohanty is the CEO of Watchful, a modern and interactive solution that places the control of data labeling back in the hands of data scientists, machine learning practitioners, and subject mat…

00:36:32  |   Thu 22 Dec 2022
A Cloud Native Vector Database Management System

A Cloud Native Vector Database Management System

Frank Liu is Director of Operations & ML Architect at Zilliz, the company behind Milvus,  an open source vector database. We discuss their recent VLDB paper (“A Cloud Native Vector Database Managemen…

00:48:50  |   Thu 15 Dec 2022
What’s Next for Machine Learning in Time Series

What’s Next for Machine Learning in Time Series

Ira Cohen is co-founder, Chief Data Scientist at Anodot, a startup that uses time series tools to monitor  business data in real time, so organizations can proactively resolve revenue, cost, and cust…

00:38:08  |   Thu 08 Dec 2022
Efficient Methods for Natural Language Processing

Efficient Methods for Natural Language Processing

Roy Schwartz is Professor of Natural Language Processing at The Hebrew University of Jerusalem. We discussed a recent survey paper that Roy co-wrote that presented a broad overview of existing method…

00:45:40  |   Thu 01 Dec 2022
Responsible and Trustworthy AI

Responsible and Trustworthy AI

On this Thanksgiving holiday weekend in the U.S., we revisit a Twitter Spaces conversation I had with

00:30:01  |   Wed 23 Nov 2022
Building a premier industrial AI research and product group

Building a premier industrial AI research and product group

Hung Bui is the CEO of VinAI, a premier Artificial Intelligence research-based company developing world-class products and services. Hung assembled the VinAI team just over three years ago and they a…

00:37:50  |   Thu 17 Nov 2022
An open source, production grade vector search engine

An open source, production grade vector search engine

Bob van Luijt, is CEO of SeMI Technologies, the company behind the popular vector search engine Weaviate.   Bob describes their key features and core components, popular use cases, and he also provid…

00:35:14  |   Thu 10 Nov 2022
A comprehensive suite of open source tools for time series modeling

A comprehensive suite of open source tools for time series modeling

Federico Garza and Max Mergenthaler Canseco are both CTOs and co-founders of Nixtla, a startup building developer-friendly software that helps data scientists deploy predictive pipelines.

Subscribe to…

00:35:11  |   Thu 03 Nov 2022
Building Safe and Reliable AI applications

Building Safe and Reliable AI applications

Christopher Nguyen is CEO and cofounder of Aitomatic, a startup that uses a knowledge-first approach to build and deploy machine learning solutions, with a focus on industrial applications (manufactu…

00:30:39  |   Thu 27 Oct 2022
A new storage engine for vectors

A new storage engine for vectors

Ram Sriharsha is VP of Engineering and R&D at Pinecone, a startup that offers a fully managed vector database (not just an index). We discuss Pinecone’s new proprietary storage engine, which was firs…

00:41:58  |   Thu 20 Oct 2022
Project Lightspeed: Next-generation Spark Streaming

Project Lightspeed: Next-generation Spark Streaming

Karthik Ramasamy, is the Head of Streaming at Databricks. He has extensive experience in streaming, having led teams at Twitter (Apache Heron), Splunk, and Streamlio (Apache Pulsar).

Subscribe to the …

00:41:43  |   Thu 13 Oct 2022
The Unreasonable Effectiveness of Speech Data

The Unreasonable Effectiveness of Speech Data

Piotr Żelasko is Head of Research at Meaning, a startup building an AI platform using speech technologies. He has years of experience in speech technologies, both as a researcher and as a software en…

00:35:00  |   Thu 06 Oct 2022
Machine Learning Integrity

Machine Learning Integrity

Yaron Singer is the CEO of Robust Intelligence, a company building tools to help manage and mitigate risks associated with machine learning models and applications.

Download a FREE copy of our recent…

00:44:33  |   Thu 29 Sep 2022
Synthetic data technologies can enable more capable and ethical AI

Synthetic data technologies can enable more capable and ethical AI

Yashar Behzadi is the CEO & Founder of Synthesis AI, a startup that uses synthetic data technologies to enable teams building AI applications, as well as gaming and metaverse applications.

Download a …

00:39:04  |   Thu 22 Sep 2022
Confidential Computing for Machine Learning

Confidential Computing for Machine Learning

Sadegh Riazi is CEO and co-founder of CipherMode Labs, a startup building tools that enable data and machine learning teams to build and deploy models directly on encrypted data. CipherMode’s new ope…

00:36:20  |   Thu 15 Sep 2022
Applied NLP Research at Primer

Applied NLP Research at Primer

John Bohannon is a Senior Director of Data Science and Head of Research at Primer AI, an end-to-end machine intelligence solution for textual data. We discussed their process of translating ML resear…

00:41:50  |   Thu 08 Sep 2022
Using SQL to Retrieve Data from APIs and Web Services

Using SQL to Retrieve Data from APIs and Web Services

Jon Udell is community lead for Steampipe, an open-source tool that populates a database table with data retrieved from APIs. They use Postgres, which means that data is easy to explore and retrieve …

00:31:09  |   Thu 01 Sep 2022
Disclaimer: The podcast and artwork embedded on this page are the property of Ben Lorica. This content is not affiliated with or endorsed by eachpod.com.