Meng Li

Assistant Professor

Institute for Artificial Intelligence

Biography

I am currently a tenure-track assistant professor jointly affiliated with Institute for Artificial Intelligence and School of Integrated Circuits in Peking University. Before joining Peking University, I was a staff research scientist and tech lead in Meta On-Device AI team with a focus on researching and productizing efficient AI algorithms and hardwares for next generation AR/VR devices. I received my Ph.D. degree in the Department of Electrical and Computer Engineering, University of Texas at Austin under the supervision of Prof. David Z. Pan and my bachelor degree in Peking University under the supervision of Prof. Ru Huang and Prof. Runsheng Wang.

My research interests focus on efficient and secure multi-modality AI acceleration algorithms and hardwares.

I am always looking for creative and self-motivated students and post docs who are interested in co-designing the future AI acceleration algorithm and system for efficiency and privacy. Please contact me via email with the subject line “Prospective Student from [Your Institute]” and your CV. (I have finished the Ph.D. student recruiting for 2024. If you are interested in applying for 2025, contact me early.)

Download my resumé.

Interests

Efficient and Secure Multi-Modality Artificial Intelligence
Algorithm/Hardware Co-Design/Co-Optimization

Education

PhD in Computer Engineering, 2018

University of Texas at Austin, Austin, Tx, USA
MS in Computer Engineering, 2015

University of Texas at Austin, Austin, Tx, USA
BS in Microelectronics, 2013

Peking University, Beijing, China

Research Focus

Efficient AI Algorithm

Multi-Modal AI

AI/HW Co-Design

News

Team SEC won the 1st Place in DAC System Design Contest

Team SEC, formed by graduate students Linye Wei, Wenxuan Zeng, Jiangrui Yu and undergraduate students Zizhuo Fu, Ziyu Tang, Tianjian Yang, has won the 1st Place in DAC System Design Contest. In the contest, Team SEC enables efficient diffusion models on tiny CPU devices through hardware-aware algorithm optimization.

Last updated on Jun 30, 2025

One Paper Accepted by Usenix Security'2025

One paper on privacy-preserving Transformer inference is accepted by Usenix Security'2025 as a regular paper. The title of the paper is “Breaking the Layer Barrier: Remodeling Private Transformer Inference with Hybrid CKKS and MPC”.

Last updated on Jun 18, 2025

Team SEC won the 1st Place in AICAS Grand Challenge on LLM Hardware System Design

Team SEC, formed by PhD students Qingyu Guo, Linfeng Zhong, Songqiang Xu, and Renjie Wei has won the 1st Place in AICAS Grand Challenge on LLM System Design. Prof. Yuan Wang and I are the supervisors for the team. In the contest, Team SEC successfully deploys a Qwen-0.5B LLM on a lightweight KV260 FPGA and achieves 35 tokens/s decoding throughput, 4 times higher than the second place.

Last updated on Jun 18, 2025

One Collaboration Paper Accepted in IEEE TIFS'2025

One collaboration paper on “Swift: Fast Secure Neural Network Inference with Fully Homomorphic Encryption” is accepted by IEEE TIFS'2025.

Last updated on Jun 18, 2025

Four Papers Accepted by DAC'2025

Four papers on efficient LLM are accepted by DAC'2024 as regular papers.

Last updated on Jun 18, 2025

See all

Experience

Tenure-Track Assistant Professor

Peking University

Jul 2022 – Present Beijing

Institute of Artificial Intelligence

Staff Research Scientist

Recent Talks

Efficient Private Transformer Inference through Network/Protocol Co-optimization

Discuss the recent NeurIPs'2024 paper from our lab on joint network/protocol optimization to enable efficient private Transformer inference.

Oct 19, 2024 11:00 AM — 12:00 PM

Efficient Private Inference through Network/Protocol Co-optimization

Discuss the recent works from our lab on joint network/protocol optimization to enable more efficient privacy-preserving AI.

Jan 24, 2024 11:00 AM — 12:00 PM

Neural Acceleration with Full Stack Optimization

Discuss the recent works from our lab on full stack optimization to enable accurate yet efficient AI acceleration for multi-modal applications.

Jan 20, 2024 11:00 AM — 12:00 PM

Efficient Multi-Modal AI Acceleration

Discuss the recent works from our lab on full stack optimization to enable accurate yet efficient AI acceleration for multi-modal applications.

Jun 7, 2023 11:00 AM — 12:00 PM

Efficient Audio-Visual Understanding on AR Devices

Discuss full stack techniques from training, optimization, hardware, etc perspectives to enable efficient audio-visual understanding on AR devices

Nov 5, 2021 11:00 AM — 12:00 PM

See all events

Recent Publications

More detailed publication lists available through Google Scholar

Quickly discover relevant content by filtering publications.

Tianshi Xu, Wenjie Lu, Jiangrui Yu, Chenqi Lin, Yi Chen, Runsheng Wang, Meng Li (2025). Breaking the Layer Barrier: Remodeling Private Transformer Inference with Hybrid CKKS and MPC. In Usenix Security Symposium 2025.

Yu Fu, Yu Tong, Yijing Ning, Tianshi Xu, Meng Li, Jingqiang Lin, Dengguo Feng (2025). Swift: Fast Secure Neural Network Inference with Fully Homomorphic Encryption. In IEEE Transactions on Information Forensics and Security (TIFS) (2025).

Shuzhang Zhong, Yanfan Sun, Ling Liang, Runsheng Wang, Ru Huang, Meng Li (2025). HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference. In Design Automation Conference (DAC) 2025.

Tong Xie, Jiawang Zhao, Zishen Wan, Zuodong Zhang, Yuan Wang, Runsheng Wang, Ru Huang, Meng Li (2025). ReaLM: Reliable and Efficient Large Language Model Inference with Statistical Algorithm-Based Fault Tolerance. In Design Automation Conference (DAC) 2025.

Linye Wei, Shuzhang Zhong, Songqiang Xu, Runsheng Wang, Ru Huang, Meng Li (2025). SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding. In Design Automation Conference (DAC) 2025.

Weikai Xu, Wenxuan Zeng, Qianqian Huang, Meng Li, Ru Huang (2025). UniCAIM: A Unified CAM/CIM Architecture with Static-Dynamic KV Cache Pruning for Efficient Long-Context LLM Inference. In Design Automation Conference (DAC) 2025.

Tengyu Zhang, Yufei Xue, Ling Liang, Zhen Gu, Yuan Wang, Runsheng Wang, Ru Huang, Meng Li (2025). FLASH: An Efficient Hardware Accelerator Leveraging Approximate and Sparse FFT for Homomorphic Encryption. In Design, Automation and Test in Europe Conference and Exhibition (DATE) 2025.

Weikai Xu, Meng Li, Qianqian Huang, Ru Huang (2025). Compact Non-Volatile Lookup Table Architecture based on Ferroelectric FET Array through In-Situ Combinatorial One-Hot Encoding for Reconfigurable Computing. In Design, Automation and Test in Europe Conference and Exhibition (DATE) 2025.

Renjie Wei, Songqiang Xu, Linfeng Zhong, Zebin Yang, Qingyu Guo, Yuan Wang, Runsheng Wang, Meng Li (2025). LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design. In Design, Automation and Test in Europe Conference and Exhibition (DATE) 2025.

Renjie Wei, Zechun Liu, Yuchen Fan, Runsheng Wang, Ru Huang, Meng Li (2025). SCALES: Boost Binary Neural Network for Image Super-Resolution with Efficient Scalings. In Design, Automation and Test in Europe Conference and Exhibition (DATE) 2025.

Xincheng Feng, Guodong Shen, Jianhao Hu, Meng Li, Ngai Wong (2025). Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function Approximator. In Asia and South Pacific Design Automation Conference (ASP-DAC) 2025.

Meng Li

Assistant Professor

Institute for Artificial Intelligence

School of Integrated Circuits

Peking University

Biography

Research Focus

News

Experience

Recent Talks

Recent Publications

Projects

Accomplish­ments

Contact

Accomplishments