Efficient Training Methods

Phil discusses the potential of training fully pruned language models efficiently, unlocking access to larger models in a fraction of the time. Lukas raises questions about the challenges of training sparse models and the need to discover the right sparse patterns for optimal performance.