-
nano-vllm Public
Forked from GeeeekExplorer/nano-vllmNano vLLM
Python MIT License UpdatedAug 31, 2025 -
MoonTV Public
Forked from samqin123/MoonTV一个开箱即用的、跨平台的影视聚合播放站
TypeScript MIT License UpdatedJul 31, 2025 -
-
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python Apache License 2.0 UpdatedJan 17, 2025 -
gemini-teacher Public
Forked from nishuzumi/gemini-teacherEnglish pronunciation correction teacher built with gemini
Python UpdatedDec 16, 2024 -
llm_interview_note Public
Forked from wdndev/llm_interview_note主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
HTML UpdatedOct 22, 2024 -
rime-ice Public
Forked from iDvel/rime-iceRime 配置:雾凇拼音 | 长期维护的简体词库
Lua GNU General Public License v3.0 UpdatedAug 24, 2024 -
allRank Public
Forked from allegro/allRankallRank is a framework for training learning-to-rank neural models based on PyTorch.
Python Apache License 2.0 UpdatedAug 6, 2024 -
MedicalGPT Public
Forked from shibing624/MedicalGPTMedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Python Apache License 2.0 UpdatedMay 6, 2024 -
llama Public
Forked from meta-llama/llamaInference code for Llama models
Python Other UpdatedApr 30, 2024 -
nmt_data_tools Public
Forked from jiaohuix/nmt_data_toolsmachine translation data process tools
Perl UpdatedApr 29, 2024 -
tianshou Public
Forked from thu-ml/tianshouAn elegant PyTorch deep reinforcement learning library.
Python MIT License UpdatedFeb 7, 2024 -
cleanrl Public
Forked from vwxyzjn/cleanrlHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Python Other UpdatedJan 10, 2024 -
PokemonRedExperiments Public
Forked from PWhiddy/PokemonRedExperimentsPlaying Pokemon Red with Reinforcement Learning
Jupyter Notebook MIT License UpdatedDec 15, 2023 -
stable-diffusion Public
Forked from CompVis/stable-diffusionA latent text-to-image diffusion model
Jupyter Notebook Other UpdatedSep 5, 2023 -
baby-llama2-chinese Public
Forked from DLLXW/baby-llama2-chinese用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
Python MIT License UpdatedAug 31, 2023 -
sentencepiece Public
Forked from google/sentencepieceUnsupervised text tokenizer for Neural Network-based text generation.
C++ Apache License 2.0 UpdatedJul 1, 2023 -
how-to-optim-algorithm-in-cuda Public
Forked from BBuf/how-to-optim-algorithm-in-cudahow to optimize some algorithm in cuda.
Cuda UpdatedJun 18, 2023 -
Deep-RL-Notes Public
Forked from harryzhangOG/Deep-RL-NotesA collection of comprehensive notes on Deep Reinforcement Learning, customized for UC Berkeley's CS 285 (prev. CS 294-112)
TeX UpdatedApr 2, 2023 -
Pytorch-Template-1 Public
Forked from YannLeo/Pytorch-TemplateThe template of pytorch trainning and testing
Python UpdatedMar 14, 2023 -
denoising-diffusion-pytorch Public
Forked from lucidrains/denoising-diffusion-pytorchImplementation of Denoising Diffusion Probabilistic Model in Pytorch
Python MIT License UpdatedMar 13, 2023 -
annotated_deep_learning_paper_implementations Public
Forked from labmlai/annotated_deep_learning_paper_implementations🧑🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cycleg…
Jupyter Notebook MIT License UpdatedFeb 28, 2023 -
es_dfm Public
Forked from ThyrixYang/es_dfmcode of our AAAI 2021 paper Capturing Delayed Feedback in Conversion Rate Prediction via Elapsed-Time Sampling
Python Other UpdatedFeb 21, 2023 -
the-art-of-command-line Public
Forked from jlevy/the-art-of-command-lineMaster the command line, in one page
UpdatedDec 30, 2022 -
blog-example Public
Forked from my-dlq/blog-example博客中的示例文件,包含 Kubernetes、Jenkins、Go、Java、SpringBoot、SpringCloud 知识示例等,将结合博客逐步讲解整体的知识内容体系。
Java UpdatedNov 6, 2022 -
simple-simcse Public
Forked from hppRC/simple-simcseA simple implementation of SimCSE
Python MIT License UpdatedOct 31, 2022 -
pytorch-template Public template
Forked from SunQpark/pytorch-templateSimple project base template for PyTorch deep Learning project. Features clean implementation of DDP training and Hydra config.
Python MIT License UpdatedOct 25, 2022 -
toyML Public
Forked from yizhiru/toyMLToy Machine Learning Package
Python Apache License 2.0 UpdatedOct 19, 2022 -
gradnorm_tf Public
Forked from vpetren/gradnorm_tfTensorFlow implementation of GradNorm
Python UpdatedSep 23, 2022 -
rnn-time-to-event Public
Forked from Manelmc/rnn-time-to-eventAn approximation of Recurrent Neural Networks to predict the Time to an Event
Jupyter Notebook MIT License UpdatedSep 23, 2022