Starred repositories
🚀 The fast, Pythonic way to build MCP servers and clients
Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
A model-driven approach to building AI agents in just a few lines of code.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
High-performance runtime for multi-agent systems. Build, run and manage secure multi-agent systems in your cloud.
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Open source AI terminal and SSH Client for EC2, Database and Kubernetes.
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
A lightweight, powerful framework for multi-agent workflows
This Guidance demonstrates how to streamline access to numerous large language models (LLMs) through a unified, industry-standard API gateway based on OpenAI API standards
AWS MCP Servers — helping you get the most out of AWS, wherever you use MCP.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
A bridge between Streamable HTTP and stdio MCP transports
No fortress, purely open ground. OpenManus is Coming.
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…
FlashMLA: Efficient Multi-head Latent Attention Kernels
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.
verl: Volcano Engine Reinforcement Learning for LLMs
Witness the aha moment of VLM with less than $3.