Skip to content
View Xunzhuo's full-sized avatar
🎲
Exploring AI Networks
🎲
Exploring AI Networks

Block or report Xunzhuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Xunzhuo/README.md

Profile View Counter Linkedln Zhihu Badge Gmail Badge Wechat Badge

Bit is exploring the frontier technologies of combination of networking and LLM at Tencent Kubernetes Engine team. He is currently working on everthing around AI Infrastructure. Previously, he has been involved in NLP research at UESTC NLP Lab.

Bit is leading the development of vLLM Semantic Router, an intelligent auto reasoning router for Efficient LLM Inference on Mixture-of-Models, saving tons of cost by advanced routing algorithm.

As a CNCF Ambassador and Linux Foundation LFAPAC, Bit serves on the Envoy Gateway Steering Committee. He also maintains multiple projects including Envoy AI Gateway, vLLM AIBrix, Istio, Kiali, Aeraki-Mesh, and Merbridge, as well as the approver of Higress and MOSN. Additionally, Bit contributes as a Kubernetes Gateway API and Kubernetes Ingress2Gateway reviewer and member of Kubernetes.

Pinned Loading

  1. vllm-project/semantic-router vllm-project/semantic-router Public

    Intelligent Mixture-of-Models Router for Efficient LLM Inference

    Python 745 63

  2. envoyproxy/gateway envoyproxy/gateway Public

    Manages Envoy Proxy as a Standalone or Kubernetes-based Application Gateway

    Go 2.1k 543

  3. envoyproxy/ai-gateway envoyproxy/ai-gateway Public

    Manages Unified Access to Generative AI Services built on Envoy Gateway

    Go 1k 95

  4. vllm-project/aibrix vllm-project/aibrix Public

    Cost-efficient and pluggable Infrastructure components for GenAI inference

    Go 4.2k 451