The Computing Series

The Problem

Given a user and a context — what they are watching, what they have liked, what similar users have engaged with — return a ranked list of N items predicted to maximise engagement. The system must serve recommendations within 100ms for millions of concurrent users. The underlying models must be trained on petabytes of interaction data. Item embeddings must be updated as new items are added and old items become stale.

Read in the book →