Commits
Click on a commit to change the comparison rangeFast performing edges for FP32 GEMM of RVV.20 days ago
by ChipKerchner Add bool types for C.20 days ago
by ChipKerchner Add K-unrolling to M = 8. Other small changes.19 days ago
by ChipKerchner Unroll K for N less than or equal to 4.19 days ago
by ChipKerchner Common unroll code.18 days ago
by ChipKerchner Preserve K.18 days ago
by ChipKerchner Better K.16 days ago
by ChipKerchner Global optimizations.16 days ago
by ChipKerchner Use mf2 instead of m1.15 days ago
by ChipKerchner Simplier loops.15 days ago
by ChipKerchner More global optimzation and clean up.14 days ago
by ChipKerchner Merge remote-tracking branch 'origin/develop' into fasterRVVEdges13 days ago
by ChipKerchner Avoid greater than 4 segment load and store penalties by using 2. Fix mf2 length.13 days ago
by ChipKerchner Only initialize unused variables to prevent GCC warnings.12 days ago
by ChipKerchner Fix typo.10 days ago
by ChipKerchner Fix another typo.8 days ago
by ChipKerchner Convert 2X LMUL1 instructions to 1X LMUL2. Improved FP64 GEMM edges - up to more than 3X faster.2 days ago
by ChipKerchner Remove shadow variable.1 day ago
by ChipKerchner