Skip to content
View binzh93's full-sized avatar

Block or report binzh93

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The #1 open-source SWE-bench Verified implementation

Python 828 154 Updated Jun 9, 2025

DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coo…

JavaScript 2,709 370 Updated Sep 29, 2025

SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis

Python 104 7 Updated Jun 3, 2025

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 13,778 952 Updated Jul 31, 2025

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,739 200 Updated Sep 29, 2025

A repo lists papers related to LLM based agent

Python 2,018 120 Updated Jul 12, 2025

An LLM-based Web Navigating Agent (KDD'24)

Python 889 77 Updated Sep 27, 2024

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 25,943 6,656 Updated Sep 29, 2025

Automate browser-based workflows with LLMs and Computer Vision

Python 14,493 1,234 Updated Sep 29, 2025

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 107,725 10,954 Updated Sep 29, 2025

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,503 113 Updated May 28, 2023

Deepfakes Software For All

Python 54,533 13,416 Updated Sep 18, 2025

Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)

Python 162 25 Updated May 27, 2024

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 27,471 2,485 Updated Jun 27, 2025

LLM training code for Databricks foundation models

Python 4,327 578 Updated Sep 29, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,359 195 Updated Sep 28, 2025

Best practice for training LLaMA models in Megatron-LM

Python 661 57 Updated Jan 2, 2024

收集和梳理垂直领域的开源模型、数据集及评测基准。

2,543 200 Updated Dec 26, 2023

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,305 2,030 Updated May 19, 2025

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,783 1,564 Updated Sep 8, 2025

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,059 231 Updated Apr 14, 2024

人工精调的中文对话数据集和一段chatglm的微调代码

Jupyter Notebook 1,192 97 Updated May 3, 2025

llama inference for tencentpretrain

Python 99 11 Updated Jun 8, 2023

Reference models for Intel(R) Gaudi(R) AI Accelerator

Python 166 90 Updated Sep 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 59,540 7,291 Updated Sep 27, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,052 1,950 Updated Apr 4, 2024
Python 308 22 Updated Apr 6, 2023

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,559 586 Updated Oct 24, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,988 281 Updated Sep 26, 2025
Next