michaelhla / dot.opt Goto Github PK
View Code? Open in Web Editor NEWExperimentation on distributed CPU training and performance comparisons against single V100. Results high coordination cost makes CPU training impractical regardless of number of nodes
License: MIT License