Meng Li's Homepage
Meng Li's Homepage
Home
News
Talks
Publications
Projects
Students
Contact
Light
Dark
Automatic
1
Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function Approximator
Xincheng Feng
,
Guodong Shen
,
Jianhao Hu
,
Meng Li
,
Ngai Wong
ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction
Renze Chen
,
Zhuofeng Wang
,
Beiquan Cao
,
Tong Wu
,
Size Zheng
,
Xiuhong Li
,
Xuechao Wei
,
Shengen Yan
,
Meng Li
,
Yun Liang
PrivCirNet: Efficient Private Inference via Block Circulant Transformation
Tianshi Xu
,
Lemeng Wu
,
Runsheng Wang
,
Meng Li
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference
Shuzhang Zhong
,
Ling Liang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
FlexHE: A flexible Kernel Generation Framework for Homomorphic Encryption-Based Private Inference
Jiangrui Yu
,
Wenxuan Zeng
,
Tianshi Xu
,
Renze Chen
,
Yun (Eric) Liang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline
Qingyu Guo
,
Jiayong Wan
,
Songqiang Xu
,
Meng Li
,
Yuan Wang
MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Zebin Yang
,
Renze Chen
,
Taiqiang Wu
,
Ngai Wong
,
Yun (Eric) Liang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
OSCA: End-to-end Serial Stochastic Computing Neural Acceleration with Fine-grained Scaling and Piecewise Activation
Yixuan Hu
,
Yikang Jia
,
Meng Li
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization
Tianshi Xu
,
Shuzhang Zhong
,
Wenxuan Zeng
,
Runsheng Wang
,
Meng Li
ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Shuzhang Zhong
,
Zebin Yang
,
Ruihao Gong
,
Runsheng Wang
,
Ru Huang
,
Meng Li
»
Cite
×
var dimensionValue = 'SOME_DIMENSION_VALUE'; ga('set', 'dimension1', dimensionValue);