Meng Li's Homepage
Meng Li's Homepage
Home
News
Talks
Publications
Projects
Students
Contact
Light
Dark
Automatic
English
中文 (简体)
Efficient AI
Attention Sink Forges Native MoE in Attention Layers: Sink-Aware Training to Address Head Collapse
Zizhuo Fu
,
Wenxuan Zeng
,
Runsheng Wang
,
Meng Li
HyPER: Bridging Exploration and Exploitation for Scalable LLM Reasoning with Hypothesis Path Expansion and Reduction
Shengxuan Qiu
,
Haochen Huang
,
Shuzhang Zhong
,
Pengfei Zuo
,
Meng Li
TEAM: Temporal–Spatial Consistency Guided Expert Activation for MoE Diffusion Language Model Acceleration
Linye Wei
,
Zixiang Luo
,
Pingzhi Tang
,
Meng Li
Breaking the Reward Barrier: Accelerating Tree-of-Thought Reasoning via Speculative Exploration
Shuzhang Zhong
,
Haochen Huang
,
Shengxuan Qiu
,
Pengfei Zuo
,
Runsheng Wang
,
Meng Li
CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems
Tong Xie
,
Yijiahao Qi
,
Jinqi Wen
,
Zishen Wan
,
Yanchi Dong
,
Zihao Wang
,
Shaofei Cai
,
Yitao Liang
,
Tianyu Jia
,
Yuan Wang
,
Runsheng Wang
,
Meng Li
DRIFT: Harnessing Inherent Fault Tolerance for Efficient and Reliable Diffusion Model Inference
Jinqi Wen
,
Tong Xie
,
Runsheng Wang
,
Meng Li
DySL-VLA: Efficient Vision-Language-Action Model Inference via Dynamic-Static Layer-Skipping for Robot Manipulation
Zebin Yang
,
Yijiahao Qi
,
Tong Xie
,
Bo Yu
,
Shaoshan Liu
,
Meng Li
EdgeSC: Universal Stochastic Computing Architecture for Efficient Edge Detection
Xincheng Feng
,
Wenyong Zhou
,
Taiqiang Wu
,
Zhengwu Liu
,
Meng Li
,
Ngai Wong
KEEP: A KV-Cache-Centric Memory Management System for Efficient Embodied Planning
Zebin Yang
,
Tong Xie
,
Baotong Lu
,
Shaoshan Liu
,
Bo Yu
,
Meng Li
NASiC: 3D NAND-based CAM-Selected Multibit CIM Architecture for Efficient On-Device Mixture-of-Experts LLM Inference
Weikai Xu
,
Meng Li
,
Shuzhang Zhong
,
Tianyang Luo
,
Dongxue Zhao
,
Ling Liang
,
Zongwei Wang
,
Qianqian Huang
,
Yimao Cai
,
Ru Huang
»
Cite
×