When Deepmind's AlphaGo elevated Deep Reinforcement Learning (RL) to the pinnacle of AI research, I became obsessed with mastering it. I created a simple 2D game based in a randomised environment, and set a tight deadline to train a Deep RL agent to navigate it skillfully. Unfortunately, I soon hit a wall and the clock ran out. I saved it as a repository on Github and a few years later, Large Language Models (LLMs) became mainstream and helped me crack it. Here's what happened.