Skip to content
View JimmyHHua's full-sized avatar
🎯
Focusing
🎯
Focusing
  • 深圳, China

Block or report JimmyHHua

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 24,824 1,734 Updated Sep 28, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 49,577 5,164 Updated Sep 30, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 13,475 1,026 Updated Sep 28, 2025

Train transformer language models with reinforcement learning.

Python 15,708 2,214 Updated Sep 30, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,571 298 Updated Aug 6, 2025

[NeurIPS2024] Cross-video Identity Correlating for Person Re-identification Pre-training

Python 92 4 Updated Jun 20, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 150,487 30,562 Updated Sep 30, 2025

Ultralytics YOLO 🚀

Python 46,673 9,033 Updated Sep 30, 2025

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,493 889 Updated Jun 6, 2025

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,241 100 Updated Sep 22, 2025

Official inference framework for 1-bit LLMs

Python 22,111 1,701 Updated Jun 3, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,101 2,102 Updated Dec 25, 2024

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025

Python 269 9 Updated May 26, 2025

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 941 39 Updated Mar 19, 2025

LLM101n: Let's build a Storyteller

34,468 1,873 Updated Aug 1, 2024

A generative speech model for daily dialogue.

Python 37,907 4,102 Updated Jul 6, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,160 1,292 Updated May 23, 2024
Python 4,277 406 Updated Sep 14, 2025

HPT - Open Multimodal LLMs from HyperGAI

Python 315 22 Updated Jun 6, 2024

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 841 61 Updated Aug 5, 2025

The official Meta Llama 3 GitHub site

Python 28,998 3,470 Updated Jan 26, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,126 386 Updated Sep 29, 2025

LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer

Python 387 20 Updated Apr 20, 2025

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 491 19 Updated Jan 4, 2025

A family of lightweight multimodal models.

Python 1,044 77 Updated Nov 18, 2024

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 38,479 3,675 Updated Jul 9, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 59,584 7,295 Updated Sep 30, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 44,773 7,619 Updated Dec 9, 2024

C++ Primer 5th 学习过程记录(详细的笔记和课后练习解答)

C++ 3 Updated Sep 10, 2022
Next