Meng Li's Homepage
Meng Li's Homepage
Home
News
Talks
Publications
Projects
Students
Contact
Light
Dark
Automatic
English
中文 (简体)
1
PrivCirNet: Efficient Private Inference via Block Circulant Transformation
Tianshi Xu
,
Lemeng Wu
,
Runsheng Wang
,
Meng Li
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference
Shuzhang Zhong
,
Ling Liang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
FlexHE: A flexible Kernel Generation Framework for Homomorphic Encryption-Based Private Inference
Jiangrui Yu
,
Wenxuan Zeng
,
Tianshi Xu
,
Renze Chen
,
Yun (Eric) Liang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline
Qingyu Guo
,
Jiayong Wan
,
Songqiang Xu
,
Meng Li
,
Yuan Wang
MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Zebin Yang
,
Renze Chen
,
Taiqiang Wu
,
Ngai Wong
,
Yun (Eric) Liang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
OSCA: End-to-end Serial Stochastic Computing Neural Acceleration with Fine-grained Scaling and Piecewise Activation
Yixuan Hu
,
Yikang Jia
,
Meng Li
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization
Tianshi Xu
,
Shuzhang Zhong
,
Wenxuan Zeng
,
Runsheng Wang
,
Meng Li
ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Shuzhang Zhong
,
Zebin Yang
,
Ruihao Gong
,
Runsheng Wang
,
Ru Huang
,
Meng Li
FastQuery: Communication-efficient Embedding Table Query for Private LLMs inference
Chenqi Lin
,
Tianshi Xu
,
Zebin Yang
,
Meng Li
,
Runsheng Wang
,
Ru Huang
ASCEND: Accurate yet Efficient End-to-End Stochastic Computing Acceleration of Vision Transformer
Tong Xie
,
Yixuan Hu
,
Renjie Wei
,
Meng Li
,
Runsheng Wang
,
Yuan Wang
,
Ru Huang
«
»
Cite
×