Skip to content
View idning's full-sized avatar

Organizations

@pytorch

Block or report idning

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

procedural reasoning datasets

Python 497 51 Updated Mar 14, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,962 495 Updated Mar 14, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,790 856 Updated Mar 14, 2025

Minimal hackable GRPO implementation

Python 175 24 Updated Jan 31, 2025

Implementation of papers in 100 lines of code.

Python 1,449 154 Updated Dec 2, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,630 550 Updated Mar 14, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,173 1,421 Updated Mar 10, 2025

Instead of running one environment at a time or one per thread, run everything in batch using numpy on a single core.

Jupyter Notebook 5 2 Updated Feb 19, 2018

Fully open reproduction of DeepSeek-R1

Python 22,791 2,050 Updated Mar 14, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,098 130 Updated Mar 14, 2025

A PyTorch native library for large model training

Python 3,451 312 Updated Mar 14, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,269 379 Updated Mar 14, 2025

FlagGems is an operator library for large language models implemented in Triton Language.

Python 452 73 Updated Mar 13, 2025

PyTorch native quantization and sparsity for training and inference

Python 1,901 229 Updated Mar 14, 2025

Development repository for the Triton language and compiler

MLIR 14,858 1,862 Updated Mar 14, 2025

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Cuda 327 49 Updated Jan 2, 2025

TORCH_LOGS parser for PT2

Rust 33 14 Updated Mar 12, 2025

A very simple shared memory dict implementation

Python 165 24 Updated Aug 26, 2022

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,744 203 Updated Mar 8, 2024

Seamless operability between C++11 and Python

C++ 16,310 2,144 Updated Mar 11, 2025

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 9,016 1,101 Updated Oct 9, 2024

Denoising Diffusion Probabilistic Models

Python 4,206 401 Updated Aug 29, 2023

A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

Jupyter Notebook 713 61 Updated May 7, 2024

An open source implementation of CLIP.

Python 11,213 1,061 Updated Mar 1, 2025

PyTorch Implementation of OpenAI's Image GPT

Python 255 34 Updated Oct 3, 2023

Large Language Model-enhanced Recommender System Papers

649 52 Updated Feb 14, 2025

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,325 431 Updated Apr 24, 2023

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Python 2,192 505 Updated Jan 25, 2019

Inference code for Llama models

Python 57,854 9,715 Updated Jan 26, 2025

FUSE implementation for overlayfs

C 556 86 Updated Dec 2, 2024
Next