Meng Li's Homepage
Meng Li's Homepage
Home
News
Talks
Publications
Projects
Students
Contact
Light
Dark
Automatic
English
中文 (简体)
Efficient AI
SCALES: Boost Binary Neural Network for Image Super-Resolution with Efficient Scalings
Renjie Wei
,
Zechun Liu
,
Yuchen Fan
,
Runsheng Wang
,
Ru Huang
,
Meng Li
Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function Approximator
Xincheng Feng
,
Guodong Shen
,
Jianhao Hu
,
Meng Li
,
Ngai Wong
ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction
Renze Chen
,
Zhuofeng Wang
,
Beiquan Cao
,
Tong Wu
,
Size Zheng
,
Xiuhong Li
,
Xuechao Wei
,
Shengen Yan
,
Meng Li
,
Yun Liang
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference
Shuzhang Zhong
,
Ling Liang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline
Qingyu Guo
,
Jiayong Wan
,
Songqiang Xu
,
Meng Li
,
Yuan Wang
MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Zebin Yang
,
Renze Chen
,
Taiqiang Wu
,
Ngai Wong
,
Yun (Eric) Liang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
OSCA: End-to-end Serial Stochastic Computing Neural Acceleration with Fine-grained Scaling and Piecewise Activation
Yixuan Hu
,
Yikang Jia
,
Meng Li
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Shuzhang Zhong
,
Zebin Yang
,
Ruihao Gong
,
Runsheng Wang
,
Ru Huang
,
Meng Li
CASCADE: A Framework for CNN Accelerator Synthesis with Concatenation and Refreshing Dataflow
Qingyu Guo
,
Haoyang Luo
,
Meng Li
,
Xiyuan Tang
,
Yuan Wang
ASCEND: Accurate yet Efficient End-to-End Stochastic Computing Acceleration of Vision Transformer
Tong Xie
,
Yixuan Hu
,
Renjie Wei
,
Meng Li
,
Runsheng Wang
,
Yuan Wang
,
Ru Huang
«
»
Cite
×