jeanniefinks

Jeannie Finks jeanniefinks

Achievements

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 262 11 Updated Oct 11, 2024

Sparsity-aware deep learning inference runtime for CPUs

Python 3,114 181 Updated Jul 19, 2024

Refine high-quality datasets and visual AI models

Python 9,275 606 Updated Mar 14, 2025

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,117 150 Updated Aug 1, 2024

ML model optimization product to accelerate inference.

Python 326 30 Updated Apr 10, 2024

Top-level directory for documentation and general content

MDX 121 7 Updated Nov 25, 2024

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

Python 382 26 Updated Jul 19, 2024