1. EachPod

Optimized Web Crawling

Author
[email protected] (Ben Jaffe and Katie Malone)
Published
Sun 28 Oct 2018
Episode Link
https://soundcloud.com/linear-digressions/optimized-web-crawling

Got a fun optimization problem for you this week! It’s a two-for-one: how do you optimize the web crawling logic of an operation like Google search so that the results are, on average, as up-to-date as possible, and how do you optimize your solution of choice so that it’s maintainable by software engineers in a huge distributed system? We’re following an excellent post from the Unofficial Google Data Science blog going through this problem.

Relevant links: http://www.unofficialgoogledatascience.com/2018/07/by-bill-richoux-critical-decisions-are.html

Share to: