Advanced Reinforcement Learning Interview Questions #1

This post was originally published on Substack. Click the link to read the full article.

You’re in a Machine Learning Engineer interview at Anthropic , and the interviewer drops this on you: “In Supervised Learning, we assume data is IID (Independent and Identically Distributed).

Read the full article on Substack

Advanced Reinforcement Learning Interview Questions #1 - The Stationarity Trap