REX: Revisiting Budgeted Training with an Improved Schedule Paper • 2107.04197 • Published Jul 9, 2021 • 1
Cautious Optimizers: Improving Training with One Line of Code Paper • 2411.16085 • Published Nov 25, 2024 • 21
CAME: Confidence-guided Adaptive Memory Efficient Optimization Paper • 2307.02047 • Published Jul 5, 2023 • 2
Arcee's MergeKit: A Toolkit for Merging Large Language Models Paper • 2403.13257 • Published Mar 20, 2024 • 21
SDXL-Lightning: Progressive Adversarial Diffusion Distillation Paper • 2402.13929 • Published Feb 21, 2024 • 27