Meng Li's Homepage
Meng Li's Homepage
Home
News
Talks
Publications
Projects
Students
Contact
Light
Dark
Automatic
English
中文 (简体)
Efficient AI
Orchestrating Dual-Boundaries: An Arithmetic Intensity Inspired Acceleration Framework for Diffusion Language Models
Linye Wei
,
Wenjue Chen
,
Pingzhi Tang
,
Xiaotian Guo
,
Le Ye
,
Runsheng Wang
,
Meng Li
S2CIM: A Secure-Computation and Secure-Storage Compute-in-Memory Architecture with Circuit-Algorithm Co-Design for Efficient and Trustworthy Edge Inference
Hanyong Shao
,
Zhiyuan Ning
,
Runteng Zhu
,
Wenpu Luo
,
Xiaolei Wang
,
Xunzhao Yin
,
Meng Li
,
Kechao Tang
,
Ru Huang
EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval
Zebin Yang
,
Sunjian Zheng
,
Tong Xie
,
Tianshi Xu
,
Bo Yu
,
Fan Wang
,
Jie Tang
,
Shaoshan Liu
,
Meng Li
H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference
Zizhuo Fu
,
Xiaotian Guo
,
Wenxuan Zeng
,
Shuzhang Zhong
,
Yadong Zhang
,
Peiyu Chen
,
Runsheng Wang
,
Le Ye
,
Meng Li
HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing
Haochen Huang
,
Shuzhang Zhong
,
Zhe Zhang
,
Shuangchen Li
,
Dimin Niu
,
Hongzhong Zheng
,
Runsheng Wang
,
Meng Li
No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering
Linye Wei
,
Jiajun Tang
,
Fan Fei
,
Boxin Shi
,
Runsheng Wang
,
Meng Li
SpecMamba: Accelerating Mamba Inference on FPGA with Speculative Decoding
Linfeng Zhong
,
Songqiang Xu
,
Huifeng Wen
,
Tong Xie
,
Qingyu Guo
,
Yuan Wang
,
Meng Li
A 28nm 534.6TOPS/W Mixed-Precision Edge Accelerator for Embodied AI Using Stochastic Computing
Tengyu Zhang
,
Tong Xie
,
Haoyang Luo
,
Yixuan Hu
,
Yaoyu Tao
,
Xiyuan Tang
,
Yuan Wang
,
Runsheng Wang
,
Meng Li
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference
Shuzhang Zhong
,
Yanfan Sun
,
Ling Liang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
ReaLM: Reliable and Efficient Large Language Model Inference with Statistical Algorithm-Based Fault Tolerance
Tong Xie
,
Jiawang Zhao
,
Zishen Wan
,
Zuodong Zhang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
«
»
Cite
×