czhu15

Follow

Bob Zhu czhu15

Follow

Intel

Achievements

Achievements

Popular repositories Loading

helloworld helloworld Public

A demo repo

M4
Paddle Paddle Public

Forked from PaddlePaddle/Paddle

PArallel Distributed Deep LEarning （PaddlePaddle核心框架，高性能单机、分布式训练和跨平台部署）

C++
vllm-fork vllm-fork Public

Forked from HabanaAI/vllm-fork

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
neural-compressor neural-compressor Public

Forked from intel/neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python
vllm-omni vllm-omni Public

Forked from vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

Python
vllm-gaudi vllm-gaudi Public

Forked from vllm-project/vllm-gaudi

Community maintained hardware plugin for vLLM on Intel Gaudi

Python