Lists (4)
Sort Name ascending (A-Z)
Stars
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Efficient triton implementation of Native Sparse Attention.
A Python tool to visualize + enforce dependencies, using modular architecture 🌎 Open source 🐍 Installable via pip 🔧 Able to be adopted incrementally - ⚡ Implemented with no runtime impact ♾️ Intero…
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.
Krita is a free and open source cross-platform application that offers an end-to-end solution for creating digital art files from scratch built on the KDE and Qt frameworks.
A Primer on Memory Consistency and Cache Coherence (Second Edition) 翻译计划
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
Prune is a constraint logic programming language with branching heuristic.
This repository is responsible for the LLVM-related parts of Jeandle.
Jeandle is a Just-in-Time compiler for Java. It is built on OpenJDK and leverages the LLVM compiler infrastructure to generate machine code, aiming to provide powerful compilation optimizations and…
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
Exploring the scalable matrix extension of the Apple M4 processor
Community maintained hardware plugin for vLLM on Ascend