Meng Li's Homepage
Meng Li's Homepage
Home
News
Talks
Publications
Projects
Students
Contact
Light
Dark
Automatic
Hardware Acc
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference
Shuzhang Zhong
,
Ling Liang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
FlexHE: A flexible Kernel Generation Framework for Homomorphic Encryption-Based Private Inference
Jiangrui Yu
,
Wenxuan Zeng
,
Tianshi Xu
,
Renze Chen
,
Yun (Eric) Liang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline
Qingyu Guo
,
Jiayong Wan
,
Songqiang Xu
,
Meng Li
,
Yuan Wang
MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Zebin Yang
,
Renze Chen
,
Taiqiang Wu
,
Ngai Wong
,
Yun (Eric) Liang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
OSCA: End-to-end Serial Stochastic Computing Neural Acceleration with Fine-grained Scaling and Piecewise Activation
Yixuan Hu
,
Yikang Jia
,
Meng Li
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization
Tianshi Xu
,
Shuzhang Zhong
,
Wenxuan Zeng
,
Runsheng Wang
,
Meng Li
ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Shuzhang Zhong
,
Zebin Yang
,
Ruihao Gong
,
Runsheng Wang
,
Ru Huang
,
Meng Li
CASCADE: A Framework for CNN Accelerator Synthesis with Concatenation and Refreshing Dataflow
Qingyu Guo
,
Haoyang Luo
,
Meng Li
,
Xiyuan Tang
,
Yuan Wang
Alchemist: A Unified Accelerator Architecture for Cross-Scheme Fully Homomorphic Encryption
Jianan Mu
,
Husheng Han
,
Shangyi Shi
,
Jing Ye
,
Zizhen Liu
,
Shengwen Liang
,
Meng Li
,
Mingzhe Zhang
,
Song Bian
,
Xing Hu
,
Huaiwei Li
,
Xiaowei Li
FastQuery: Communication-efficient Embedding Table Query for Private LLMs inference
Chenqi Lin
,
Tianshi Xu
,
Zebin Yang
,
Meng Li
,
Runsheng Wang
,
Ru Huang
»
Cite
×
var dimensionValue = 'SOME_DIMENSION_VALUE'; ga('set', 'dimension1', dimensionValue);