Debugging Challenges
Building complex models can lead to unexpected debugging challenges, as Boris shares his experience with the difficulties of using certain encoding techniques. He highlights the struggle of understanding why a model's loss decreased rapidly, revealing the nuances of information leakage in transformer architectures. As models grow in size, the complexity of training increases significantly, turning what seemed like a straightforward task into a daunting endeavor.In this clip
From this podcast

Gradient Dissent - A Machine Learning Podcast
Boris Dayma — The Story Behind DALL·E mini, the Viral Phenomenon
Related Questions