Logic for Matrix Multiplication in Java

Researchers baked 3x inference speedups directly into LLM weights — without speculative decoding

Researchers from the University of Maryland, Lawrence Livermore, Columbia and TogetherAI have developed a training technique that triples LLM inference speed without auxiliary models or infrastructure ...

GitHub

tritonBLAS: A Lightweight Triton-based General Matrix Multiplication (GEMM) Library

This project is intended for research purposes only. Use it at your own risk and discretion. Triton is a language and compiler for writing highly efficient ML primitives, one of the most common ...

GitHub

tanvistack/matrix-calculator-without-numpy

The program uses basic Python programming concepts to perform matrix operations without any built-in libraries. Matrices are stored using nested lists where each inner list represents one row of the ...

IEEE

GEMMHeal: An Efficient Self-repairing Architecture for Matrix Multiplication

Abstract: In modern machine learning models like Transformers, matrix multiplication dominates most computation. Specific hardware often uses large-scale PE arrays, such as systolic arrays, to ...

IEEE

ComplexMM: Efficient and Generic Homomorphic Matrix Multiplication via Complexification and BSGS Alignment

Abstract: Fully homomorphic encryption (FHE) enables computation directly over encrypted data without decryption, offering a promising approach to privacy-preserving outsourcing computation in cloud ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results