1. EachPod
EachPod
Mechanical Dreams - Podcast

Mechanical Dreams

An automatically generated podcast about machine learning and natural language processing. The two fictional hosts talk about papers that I want to learn more about on my way to work.

It's not good, but it's useful.

Science Natural Sciences
Update frequency
every 2 days
Average duration
10 minutes
Episodes
80
Years Active
2024 - 2025
Share to:
The Zamba2 Suite

The Zamba2 Suite

00:13:03  |   Fri 29 Nov 2024
Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training

Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training

I slightly tweaked the personality of the hosts.

00:10:28  |   Mon 25 Nov 2024
Understanding WSD Learning Rates

Understanding WSD Learning Rates

00:09:10  |   Mon 18 Nov 2024
Toward Understanding Why Adam Converges Faster Than SGD for Transformers

Toward Understanding Why Adam Converges Faster Than SGD for Transformers

New generation algorithm! Should make the episodes longer, more detailed, and more coherent.

00:07:29  |   Sat 16 Nov 2024
The Road Less Scheduled

The Road Less Scheduled

00:08:53  |   Fri 01 Nov 2024
Learning-Rate-Free Learning by D-Adaptation

Learning-Rate-Free Learning by D-Adaptation

00:04:37  |   Thu 31 Oct 2024
Scaling FP8 Training to Trillion Token LLMs

Scaling FP8 Training to Trillion Token LLMs

00:09:54  |   Wed 30 Oct 2024
A Survey on Model MoErging

A Survey on Model MoErging

00:08:58  |   Mon 28 Oct 2024
Liquid Time-constant Networks

Liquid Time-constant Networks

00:08:11  |   Sun 27 Oct 2024
A Spectral Condition for Feature Learning

A Spectral Condition for Feature Learning

00:16:52  |   Fri 25 Oct 2024
Don't decay the learning rate

Don't decay the learning rate

A classic paper about learning rates.

00:07:07  |   Thu 24 Oct 2024
OLMoE

OLMoE

professor norris: Welcome back to Mechanical Dreams, the podcast where we delve into the exciting world of machine learning and natural language processing. I'm Professor Norris, and as always, I'm j…

00:06:59  |   Wed 23 Oct 2024
An Empirical Model of Large Batch Training

An Empirical Model of Large Batch Training

First attempt to automatically generate a podcast from a paper. This one is way too short, but it's a start.

00:11:32  |   Wed 23 Oct 2024
Disclaimer: The podcast and artwork embedded on this page are the property of Mechanical Dirk. This content is not affiliated with or endorsed by eachpod.com.