Stars
The #1 open-source SWE-bench Verified implementation
DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coo…
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
A repo lists papers related to LLM based agent
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
Automate browser-based workflows with LLMs and Computer Vision
Virtual whiteboard for sketching hand-drawn like diagrams
Measuring Massive Multitask Language Understanding | ICLR 2021
Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
LLM training code for Databricks foundation models
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
alibaba / Megatron-LLaMA
Forked from NVIDIA/Megatron-LMBest practice for training LLaMA models in Megatron-LM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
人工精调的中文对话数据集和一段chatglm的微调代码
Reference models for Intel(R) Gaudi(R) AI Accelerator
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。