Meng Li's Homepage
Meng Li's Homepage
Home
News
Talks
Publications
Projects
Students
Contact
Light
Dark
Automatic
Efficient AI
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference
Shuzhang Zhong
,
Yanfan Sun
,
Ling Liang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
ReaLM: Reliable and Efficient Large Language Model Inference with Statistical Algorithm-Based Fault Tolerance
Tong Xie
,
Jiawang Zhao
,
Zishen Wan
,
Zuodong Zhang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding
Linye Wei
,
Shuzhang Zhong
,
Songqiang Xu
,
Runsheng Wang
,
Ru Huang
,
Meng Li
UniCAIM: A Unified CAM/CIM Architecture with Static-Dynamic KV Cache Pruning for Efficient Long-Context LLM Inference
Weikai Xu
,
Wenxuan Zeng
,
Qianqian Huang
,
Meng Li
,
Ru Huang
Compact Non-Volatile Lookup Table Architecture based on Ferroelectric FET Array through In-Situ Combinatorial One-Hot Encoding for Reconfigurable Computing
Weikai Xu
,
Meng Li
,
Qianqian Huang
,
Ru Huang
FLASH: An Efficient Hardware Accelerator Leveraging Approximate and Sparse FFT for Homomorphic Encryption
Tengyu Zhang
,
Yufei Xue
,
Ling Liang
,
Zhen Gu
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design
Renjie Wei
,
Songqiang Xu
,
Linfeng Zhong
,
Zebin Yang
,
Qingyu Guo
,
Yuan Wang
,
Runsheng Wang
,
Meng Li
SCALES: Boost Binary Neural Network for Image Super-Resolution with Efficient Scalings
Renjie Wei
,
Zechun Liu
,
Yuchen Fan
,
Runsheng Wang
,
Ru Huang
,
Meng Li
Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function Approximator
Xincheng Feng
,
Guodong Shen
,
Jianhao Hu
,
Meng Li
,
Ngai Wong
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference
Shuzhang Zhong
,
Ling Liang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
»
Cite
×
var dimensionValue = 'SOME_DIMENSION_VALUE'; ga('set', 'dimension1', dimensionValue);