Starred repositories
💫 Toolkit to help you get started with Spec-Driven Development
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
CLI tool for configuring and monitoring Claude Code
A benchmark for LLMs on complicated tasks in the terminal
Kortix – build, manage and train AI Agents. Fully Open Source.
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
The absolute trainer to light up AI agents.
A ChatGPT Canvas/Claude Artifacts-style interface for Claude Code running in E2B sandboxes. Build, create, and code anything with AI-powered development in secure, isolated environments.
Embeddable library or single binary for indexing and searching 1B vectors
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
MemU is an open-source memory framework for AI companions
Collection of Jupyter notebooks is designed to provide you with a comprehensive guide to various AI tools and technologies
Lightweight and portable LLM sandbox runtime (code interpreter) Python library.
MemOS (Preview) | Intelligence Begins with Memory
MemoryOS is designed to provide a memory operating system for personalized AI agents.
Model Context Protocol Servers
Environments for LLM Reinforcement Learning
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
SkyRL: A Modular Full-stack RL Library for LLMs
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
slime is an LLM post-training framework for RL Scaling.
Democratizing Reinforcement Learning for LLMs
An open-source AI agent that brings the power of Gemini directly into your terminal.
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re…
TradingAgents: Multi-Agents LLM Financial Trading Framework