Stars
Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
SWE-bench: Can Language Models Resolve Real-world Github Issues?
This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"
A Datacenter Scale Distributed Inference Serving Framework
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
My learning notes/codes for ML SYS.
Sky-T1: Train your own O1 preview model within $450
Democratizing Reinforcement Learning for LLMs
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Maps that show time instead of space
Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/
Production-grade client-side tracing, profiling, and analysis for complex software systems.
Model Context Protocol Servers
Probabilistic time series modeling in Python
About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008
Drag-and-drop preview for glTF 2.0 models in WebGL using three.js.
Examples Repository for CreativeEditor SDK
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep learning training and inference on a Flink cluster.
The original sources of MS-DOS 1.25, 2.0, and 4.0 for reference purposes
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Tips and tricks for working with Large Language Models like OpenAI's GPT-4.
The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents
TypeScript-first schema validation with static type inference