Stars
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
A high-throughput and memory-efficient inference and serving engine for LLMs
SGLang is a fast serving framework for large language models and vision language models.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)