Decoupling Directions

Dora enhances Lora by decoupling the magnitude and directional components of weight matrices, allowing for greater flexibility during model training. This method not only improves performance with fewer parameters but also demonstrates that it can achieve better results at a lower rank, making it a valuable addition to hyperparameter tuning techniques. The findings suggest that even with reduced complexity, Dora outperforms traditional Lora approaches.