olivia-fsm / sophia Goto Github PK
View Code? Open in Web Editor NEWThis project forked from liuhong99/sophia
The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
License: MIT License