Around 2'000 SIMD kernels for mixed-precision BLAS-like numerics โ dot products, batched GEMMs, distances, geospatial, ColBERT MaxSim, and mesh alignment โ from Float6 to Float118, leveraging RISC-V, Intel AMX, Arm SME, and WebAssembly Relaxed SIMD, in 7 languages and 5 MB.
No comments yet. Log in to reply on the Fediverse. Comments will appear here.