wjc404 / gemm_avx2 Goto Github PK
View Code? Open in Web Editor NEWFast avx2/fma3 dgemm and sgemm subroutines for medium to large matrices(>2000*2000) on haswell/skylake/zen processors, with performances comparable to MKL.
License: GNU General Public License v3.0