On the Surprising Effectiveness of Large Learning Rates under Standard Width Scaling

DSpace Repository

On the Surprising Effectiveness of Large Learning Rates under Standard Width Scaling

Author: Haas, Moritz; Bordt, Sebastian; von Luxburg, Ulrike; Vankadara, Leena Chennuru
Tübinger Autor(en):
Haas, Moritz
Bordt, Sebastian
von Luxburg, Ulrike
Vankadara, Leena C.
Issue year: 2025-05-28
Verlagsangabe: arXiv
Language: English
Full text: https://doi.org/10.48550/arXiv.2505.22491
DDC Classifikation: 004 - Data processing and computer science
Dokumentart: Preprint
Show full item record

This item appears in the following Collection(s)