Stars
A reading list of up-to-date papers on NLP for Social Good.
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
Efficient Deep Learning Systems course materials (HSE, YSDA)
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Autograd to GPT-2 completely from scratch
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
Verified algorithms in Lean, implemented and proved by AIs
A deep dive into embeddings starting from fundamentals
Formally Verified Arguments of Knowledge in Lean
This is the repository for MorphScore, a tokenizer evaluation framework for morphological alignment.
A tool to calculate dataset scaling needed to achieve cross-lingual training data equity.
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
Free MLOps course from DataTalks.Club
DSPy: The framework for programming—not prompting—language models
This repo contains the source code for the TheoremMarketplace contract, as well as a couple of Python scripts for easily interacting with it.
A Lean4 script for robustly verifying submitted proofs of theorems and implementations of functions
Minimal reproduction of DeepSeek R1-Zero
Minimal and annotated implementations of key ideas from modern deep learning research.
From the Tensor to Stable Diffusion, a rough outline for a 1 week course.