-
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedSep 23, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedSep 16, 2025 -
akshare Public
Forked from akfamily/akshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Python MIT License UpdatedAug 30, 2025 -
-
llm-action Public
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
-
nano-vllm Public
Forked from GeeeekExplorer/nano-vllmNano vLLM
Python MIT License UpdatedJul 24, 2025 -
llm-awq Public
Forked from mit-han-lab/llm-awq[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Python MIT License UpdatedJul 16, 2025 -
-
ai-system Public
LLM/MLOps/LLMOps
-
llm-compressor Public
Forked from vllm-project/llm-compressorTransformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Python Apache License 2.0 UpdatedMay 16, 2025 -
QQQ Public
Forked from HandH1998/QQQQQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.
Python UpdatedApr 22, 2025 -
cleanrl Public
Forked from vwxyzjn/cleanrlHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Python Other UpdatedApr 8, 2025 -
GPTQModel Public
Forked from ModelCloud/GPTQModelProduction ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
-
LVEval Public
Forked from infinigence/LVEvalRepository of LV-Eval Benchmark
Python MIT License UpdatedJan 17, 2025 -
perf_analyzer Public
Forked from triton-inference-server/perf_analyzer -
lightllm Public
Forked from ModelTC/LightLLMLightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Python Apache License 2.0 UpdatedJan 16, 2025 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedJan 8, 2025 -
WarrenBuffettLetter Public
Forked from fenwii/WarrenBuffettLetter巴菲特致股东的信1957-2024,巴菲特先生的信一直是价值投资的经典范本,超越时间,超越周期,站在资本与金融的顶峰,他和他的战友芒格先生事价值投资的坚定实践者。Warren E. Buffett's Letter,To the Stockholders of Berkshire Hathaway Inc
MIT License UpdatedJan 6, 2025 -
tensorrtllm_backend Public
Forked from triton-inference-server/tensorrtllm_backendThe Triton TensorRT-LLM Backend
Python Apache License 2.0 UpdatedDec 11, 2024 -
qllm-eval Public
Forked from thu-nics/qllm-evalCode Repository of Evaluating Quantized Large Language Models
Python MIT License UpdatedSep 8, 2024 -
ceval Public
Forked from hkust-nlp/cevalOfficial github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
-
CMMLU Public
Forked from haonan-li/CMMLUCMMLU: Measuring massive multitask language understanding in Chinese
-
unify-easy-llm Public
unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。
-
smoothquant Public
Forked from mit-han-lab/smoothquant[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Python MIT License UpdatedJul 12, 2024 -
-
-
Awesome-LLMOps Public
Forked from tensorchord/Awesome-LLMOpsAn awesome & curated list of best LLMOps tools for developers
-
blog Public
Forked from huggingface/blogPublic repo for HF blog posts
-
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.