Skip to content
View nolaurence's full-sized avatar
  • Alibaba
  • Hangzhou

Block or report nolaurence

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
3 stars written in C++
Clear filter

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 3,898 399 Updated Sep 10, 2025

侯捷C++课程PPT及代码,动手学起来

C++ 1,464 493 Updated Dec 12, 2019

To keep the code that about the algorithms

C++ 48 24 Updated Feb 5, 2021