-
conv_op_optimization Public
Forked from Qwesh157/conv_op_optimizationThis project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.
-
llama.cpp Public
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
-
MatmulTutorial Public
Forked from KnowingNothing/MatmulTutorialA Easy-to-understand TensorOp Matmul Tutorial
-
stable-diffusion.cpp Public
Forked from leejet/stable-diffusion.cppStable Diffusion in pure C/C++
-
cs_books Public
Forked from AzatAI/cs_booksComputer science books Recommended by AzatAI. (Education ONLY)
-
-
oneflow Public
Forked from Oneflow-Inc/oneflowOneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
-
how-to-optim-algorithm-in-cuda Public
Forked from BBuf/how-to-optim-algorithm-in-cudahow to optimize some algorithm in cuda.
Cuda UpdatedJul 16, 2025 -
openCNN Public
Forked from UDC-GAC/openCNNA Winograd Minimal Filter Implementation in CUDA
Cuda Apache License 2.0 UpdatedJul 15, 2025 -
ggml Public
Forked from ggml-org/ggmlTensor library for machine learning
-
leetcode-doocs Public
Forked from doocs/leetcode😏 LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
Java Creative Commons Attribution Share Alike 4.0 International UpdatedJun 22, 2025 -
pmpp4 Public
Forked from tugot17/pmppComplete solutions to the Programming Massively Parallel Processors Edition 4
Jupyter Notebook MIT License UpdatedJun 18, 2025 -
twitterxdownload Public
Forked from ezshine/twitterxdownloada powerful twitter video downloader and twitter marketing tool.
JavaScript GNU Affero General Public License v3.0 UpdatedJun 13, 2025 -
text-behind-image Public
Forked from RexanWONG/text-behind-imagehttps://textbehindimage.rexanwong.xyz - create text behind image designs easily
TypeScript GNU Affero General Public License v3.0 UpdatedJun 11, 2025 -
WebToEpub Public
Forked from dteviot/WebToEpubA simple Chrome (and Firefox) Extension that converts Web Novels (and other web pages) into an EPUB.
JavaScript Other UpdatedMay 5, 2025 -
JavaScript30 Public
Forked from wesbos/JavaScript3030 Day Vanilla JS Challenge
HTML UpdatedApr 25, 2025 -
-
-
DeepGEMM Public
Forked from deepseek-ai/DeepGEMMDeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda MIT License UpdatedMar 13, 2025 -
twitter-web-exporter Public
Forked from prinsss/twitter-web-exporterExport tweets, bookmarks, lists and much more from Twitter(X) web app. (推文/书签/收藏/列表导出工具)
TypeScript MIT License UpdatedMar 12, 2025 -
-
mastra Public
Forked from mastra-ai/mastrathe TypeScript AI agent framework
TypeScript Other UpdatedFeb 23, 2025 -
twillot Public
Forked from twillot-app/twillotTwitter bookmark manager extension
TypeScript UpdatedJan 21, 2025 -
sgemm.cu Public
Forked from salykova/sgemm.cuSGEMM that beats cuBLAS
Cuda MIT License UpdatedJan 15, 2025 -
cuda-tensorcore-hgemm Public
Forked from nicolaswilde/cuda-tensorcore-hgemmCuda UpdatedDec 26, 2024 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 13, 2024 -
ComfyUI-HunyuanVideoWrapper Public
Forked from kijai/ComfyUI-HunyuanVideoWrapperPython UpdatedDec 7, 2024 -
sdxpy Public
Forked from gvwilson/sdxpySoftware Design by Example: a tool-based introduction with Python
Python Other UpdatedDec 3, 2024 -
FasterTransformer Public
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
C++ Apache License 2.0 UpdatedDec 2, 2024