🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 150,487 30,562 Updated Sep 30, 2025

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 46,673 9,033 Updated Sep 30, 2025

modelscope / facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,493 889 Updated Jun 6, 2025

apple / ml-mobileclip

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,241 100 Updated Sep 22, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 22,111 1,701 Updated Jun 3, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,101 2,102 Updated Dec 25, 2024

CircleRadon / TokenPacker

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025

Python 269 9 Updated May 26, 2025

allenai / mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 941 39 Updated Mar 19, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

34,468 1,873 Updated Aug 1, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 37,907 4,102 Updated Jul 6, 2025

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,160 1,292 Updated May 23, 2024

LLaVA-VL / LLaVA-NeXT

Python 4,277 406 Updated Sep 14, 2025

HyperGAI / HPT

HPT - Open Multimodal LLMs from HyperGAI

Python 315 22 Updated Jun 6, 2024

mbzuai-oryx / LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 841 61 Updated Aug 5, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,998 3,470 Updated Jan 26, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,126 386 Updated Sep 29, 2025

thunlp / LLaVA-UHD

LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer

Python 387 20 Updated Apr 20, 2025

pkunlp-icler / FastV

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 491 19 Updated Jan 4, 2025

openai / transformer-debugger

Python 4,095 244 Updated Jun 4, 2024

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 1,044 77 Updated Nov 18, 2024

QuivrHQ / quivr

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 38,479 3,675 Updated Jul 9, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 59,584 7,295 Updated Sep 30, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 44,773 7,619 Updated Dec 9, 2024

JimmyHHua / CppPrimer_Learning

C++ Primer 5th 学习过程记录（详细的笔记和课后练习解答）

C++ 3 Updated Sep 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jimmy Hua JimmyHHua

Achievements