1. EachPod

Video Generation Improvement via Human Preference Alignment

Author
Neural Intelligence Network
Published
Wed 09 Apr 2025
Episode Link
https://podcasters.spotify.com/pod/show/neuralintelpod/episodes/Video-Generation-Improvement-via-Human-Preference-Alignment-e312vci

Recent progress in video generation still struggles with issues like motion instability and prompt alignment. To address this, the study explores incorporating human preferences into advanced flow-based video generation models. The authors introduce a large, new dataset of human-annotated video preferences across visual quality, motion quality, and text alignment. They also develop a multi-dimensional reward model to quantify these preferences and propose three alignment algorithms for flow-based models, demonstrating that a modified Direct Preference Optimization method yields the most effective results in aligning video generation with human expectations.

Share to: