Skip to content
View Orca-bit's full-sized avatar
  • Chengdu, China
  • 08:45 (UTC +08:00)

Block or report Orca-bit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

一些经典的CTR算法的复现; LR, FM, FFM, AFM, DeepFM, xDeepFM, PNN, DCN, DCNv2, DIFM, AutoInt, FiBiNet,AFN,ONN,DIN, DIEN ... (pytorch, tf2.0)

Jupyter Notebook 269 50 Updated May 9, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 59,186 10,429 Updated Sep 30, 2025

Learn GPU Programming in Mojo🔥 by Solving Puzzles

Mojo 144 171 Updated Sep 27, 2025

Package management made easy

Rust 5,333 353 Updated Sep 26, 2025

JAX-like Neural Network Training Library in Python with CPU/GPU Acceleration via Mojo and MAX

Python 286 9 Updated Sep 8, 2025

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

Python 150 31 Updated Sep 28, 2025

Backward compatible ML compute opset inspired by HLO/MHLO

MLIR 542 155 Updated Sep 23, 2025

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 34,644 15,138 Updated Sep 30, 2025

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,826 743 Updated Sep 29, 2025

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,646 639 Updated Sep 29, 2025

MLX: An array framework for Apple silicon

C++ 22,346 1,341 Updated Sep 29, 2025

Fast ML inference & training for ONNX models in Rust

Rust 1,591 162 Updated Sep 26, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 153,290 13,281 Updated Sep 29, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 18,490 3,048 Updated Sep 30, 2025

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 896 170 Updated Dec 30, 2024

Intel® Extension for TensorFlow*

C++ 347 45 Updated Mar 18, 2025

A personal experimental C++ Syntax 2 -> Syntax 1 compiler

C++ 5,808 265 Updated Sep 9, 2025
Go 16 1 Updated Dec 20, 2021

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,365 765 Updated Sep 29, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 33,568 3,188 Updated Sep 30, 2025

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Python 2,688 565 Updated Sep 29, 2025

Development repository for the Triton language and compiler

MLIR 17,050 2,275 Updated Sep 30, 2025

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,654 319 Updated Oct 19, 2024

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,563 654 Updated Sep 30, 2025

An implementation of a deep learning recommendation model (DLRM)

Python 3,970 866 Updated Sep 2, 2025

mal - Make a Lisp

Assembly 10,473 2,646 Updated Sep 4, 2025

LLM training in simple, raw C/CUDA

Cuda 27,700 3,196 Updated Jun 26, 2025

Zerocopy makes zero-cost memory manipulation effortless. We write `unsafe` so you don’t have to.

Rust 2,036 124 Updated Sep 14, 2025
Next