efficient-ai

Dynamic Attention Mask (DAM) generate adaptive sparse attention masks per layer and head for Transformer models, enabling long-context inference with lower compute and memory overhead without fine-tuning.

inference-optimization sparse-attention efficient-ai

Updated Jun 16, 2025
Python

paredezadrian / mocanet

Star

MOCA-Net: Novel neural architecture with sparse MoE, external memory, and budget-aware computation. Real Stanford SST-2 integration, O(L) complexity, 96.40% accuracy. Built for efficient sequence modeling.

deep-learning sentiment-analysis pytorch neural-networks research-tool external-memory mixture-of-experts sequence-modeling budget-optimization sst2 efficient-ai

Updated Aug 16, 2025
Python

abdulvahapmutlu / quantlab-8bit

Star

QuantLab-8bit is a reproducible benchmark of 8-bit quantization on compact vision backbones. It includes FP32 baselines, PTQ (dynamic & static), QAT, ONNX exports, parity checks, ORT CPU latency, and visual diagnostics.

benchmarking computer-vision deep-learning pytorch reproducibility quantization model-compression onnx gradcam low-precision edge-ai onnxruntime streamlit model-optimization quantization-aware-training post-training-quantization efficient-ai

Updated Sep 25, 2025
Python

fangvv / HWGNAS

Star

Code for paper "Automated Design for Hardware-aware Graph Neural Networks on Edge Devices"

neural-networks neural-architecture-search latency-prediction edge-devices graph-neural-networks gnn jetson-nano on-device-ai inference-acceleration hardware-aware-nas efficient-ai

Updated Aug 22, 2025
Python

Improve this page

Add a description, image, and links to the efficient-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the efficient-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

efficient-ai

Here are 8 public repositories matching this topic...

NVlabs / Long-RL

tiannuo-yang / SearchAgent-X

BaiTheBest / SparseLLM

Liu-Hy / WMDD

ResponsibleAILab / DAM

paredezadrian / mocanet

abdulvahapmutlu / quantlab-8bit

fangvv / HWGNAS

Improve this page

Add this topic to your repo