李萌

助理教授、研究员、博雅青年学者

Biography

李萌于2022年加入北京大学集成电路学院和人工智能研究院，任助理教授，博士生导师，博雅青年学者。加入北京大学前，他曾任职于美国Facebook公司的虚拟现实增强现实实验室，作为技术主管主导虚拟现实和增强现实设备中的人工智能加速算法和系统研究。他于2018年和2013年分别在美国德州大学奥斯汀分校和北京大学获得博士和学士学位。

他的研究兴趣集中于高效、安全的多模态人工智能加速算法和芯片，旨在通过算法到芯片的跨层次协同设计和优化，为人工智能构建高能效、高可靠、高安全的算力基础。他的研究获得了科技部重点研发课题、国自然重大项目课题、国自然重大研究计划培养项目等一些列国家级项目支持。

他在国际顶级会议、期刊发表文章90余篇，引用7000余次，获得最佳论文2次。此外，他还获得了DAC系统设计竞赛第一名、AICAS大模型系统设计竞赛第一名、CCF集成电路Early Career Award、欧洲设计自动化协会最佳博士论文、ACM学生科研竞赛总决赛第一名、美国德州大学奥斯汀分校 Margarida Jacome 杰出论文奖、ASPDAC博士科研论坛最佳海报报告奖、ACM/SIGDA博士科研竞赛金牌以及半导体安全领域顶会IEEE HOST和集成电路设计自动化领域顶会ACM GLSVLSI最佳论文奖。

实验室常年招收对人工智能算法和芯片感兴趣的本科生、硕士生、博士生和博士后。对于感兴趣学生，欢迎给我发送邮件，邮件主题为"Prospective Student from [Your Institute]"，同时在邮件中插入你的简历、成绩单或其他材料。

兴趣爱好

高效、安全多模态人工智能加速算法和芯片
算法/芯片协同设计

教育经历

博士，计算机工程, 2018
德克萨斯州州立大学奥斯汀分校，美国
硕士，计算机工程, 2015
德克萨斯州州立大学奥斯汀分校，美国
学士，微电子学, 2013
北京大学，中国

Research Focus

Efficient AI Algorithm

Multi-Modal AI

AI/HW Co-Design

News

傅子酌同学获得ACM/SIGDA学生科研竞赛第一名

最近更新于 11月 6, 2025

One Paper Accepted by Usenix Security'2025

最近更新于 6月 30, 2025

Team SEC won the 1st Place in AICAS Grand Challenge on LLM Hardware System Design

最近更新于 6月 30, 2025

One Collaboration Paper Accepted in IEEE TIFS'2025

最近更新于 6月 30, 2025

Four Papers Accepted by DAC'2025

最近更新于 6月 30, 2025

查看全部

Experience

Tenure-Track Assistant Professor

Peking University

7月 2022 – 现在 Beijing

Institute of Artificial Intelligence

Staff Research Scientist

Recent Talks

Efficient Private Transformer Inference through Network/Protocol Co-optimization

Discuss the recent NeurIPs'2024 paper from our lab on joint network/protocol optimization to enable efficient private Transformer inference.

10月 19, 2024 11:00 AM — 12:00 PM

Efficient Private Inference through Network/Protocol Co-optimization

Discuss the recent works from our lab on joint network/protocol optimization to enable more efficient privacy-preserving AI.

1月 24, 2024 11:00 AM — 12:00 PM

Neural Acceleration with Full Stack Optimization

Discuss the recent works from our lab on full stack optimization to enable accurate yet efficient AI acceleration for multi-modal applications.

1月 20, 2024 11:00 AM — 12:00 PM

Efficient Multi-Modal AI Acceleration

Discuss the recent works from our lab on full stack optimization to enable accurate yet efficient AI acceleration for multi-modal applications.

6月 7, 2023 11:00 AM — 12:00 PM

Efficient Audio-Visual Understanding on AR Devices

Discuss full stack techniques from training, optimization, hardware, etc perspectives to enable efficient audio-visual understanding on AR devices

11月 5, 2021 11:00 AM — 12:00 PM

查看全部演讲

Recent Publications

More detailed publication lists available through Google Scholar

Quickly discover relevant content by filtering publications.

Tianshi Xu, Wenjie Lu, Jiangrui Yu, Chenqi Lin, Yi Chen, Runsheng Wang, 李萌 (2025). Breaking the Layer Barrier: Remodeling Private Transformer Inference with Hybrid CKKS and MPC. In Usenix Security Symposium 2025.

Shuzhang Zhong, Yanfan Sun, Ling Liang, Runsheng Wang, Ru Huang, 李萌 (2025). HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference. In Design Automation Conference (DAC) 2025.

Tong Xie, Jiawang Zhao, Zishen Wan, Zuodong Zhang, Yuan Wang, Runsheng Wang, Ru Huang, 李萌 (2025). ReaLM: Reliable and Efficient Large Language Model Inference with Statistical Algorithm-Based Fault Tolerance. In Design Automation Conference (DAC) 2025.

Linye Wei, Shuzhang Zhong, Songqiang Xu, Runsheng Wang, Ru Huang, 李萌 (2025). SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding. In Design Automation Conference (DAC) 2025.

Weikai Xu, Wenxuan Zeng, Qianqian Huang, 李萌, Ru Huang (2025). UniCAIM: A Unified CAM/CIM Architecture with Static-Dynamic KV Cache Pruning for Efficient Long-Context LLM Inference. In Design Automation Conference (DAC) 2025.

Weikai Xu, 李萌, Qianqian Huang, Ru Huang (2025). Compact Non-Volatile Lookup Table Architecture based on Ferroelectric FET Array through In-Situ Combinatorial One-Hot Encoding for Reconfigurable Computing. In Design, Automation and Test in Europe Conference and Exhibition (DATE) 2025.

Tengyu Zhang, Yufei Xue, Ling Liang, Zhen Gu, Yuan Wang, Runsheng Wang, Ru Huang, 李萌 (2025). FLASH: An Efficient Hardware Accelerator Leveraging Approximate and Sparse FFT for Homomorphic Encryption. In Design, Automation and Test in Europe Conference and Exhibition (DATE) 2025.

Renjie Wei, Songqiang Xu, Linfeng Zhong, Zebin Yang, Qingyu Guo, Yuan Wang, Runsheng Wang, 李萌 (2025). LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design. In Design, Automation and Test in Europe Conference and Exhibition (DATE) 2025.

Renjie Wei, Zechun Liu, Yuchen Fan, Runsheng Wang, Ru Huang, 李萌 (2025). SCALES: Boost Binary Neural Network for Image Super-Resolution with Efficient Scalings. In Design, Automation and Test in Europe Conference and Exhibition (DATE) 2025.

Xincheng Feng, Guodong Shen, Jianhao Hu, 李萌, Ngai Wong (2025). Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function Approximator. In Asia and South Pacific Design Automation Conference (ASP-DAC) 2025.

PDF

查看全部出版物

Projects

HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference

Based on the KTransformers project, develop an adaptive scheduling framework that leverage the heterogeneous computation capability of CPU and GPU for efficient mixture-of-expert LLM inference.

AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference

Develop an adaptive scheduling framework for efficient mixture-of-expert LLM inference on edge devices.

MPCViT: Searching for Accurate and Efficient MPC-friendly Vision Transformer with Heterogeneous Attention

Develop MPCViT that leverages neural architecture search to search MPC-friendly vision transformers.

NASViT: Neural Architecture Search for Efficient Vision Transformer with Gradient Conflict-Aware Supernet Training

Propose gradient conflict-aware training to improve supernet-based NAS and develop a family of optimized hybrid CNN/ViT networks that achieve state-of-the-art performance Pareto.

AlphaNet: Improved Training of Supernet with Alpha-Divergence

Develop AlphaNet to improve the supernet-based NAS with a more generalized alpha-divergence-based knowledge distillation and achieve state-of-the-art performance Pareto.

AttentiveNAS: Improving Neural Architecture Search via Attentive Sampling

Develop AttentiveNAS that focuses on improving the sampling strategy for supernet-based NAS to achieve state-of-the-art performance Pareto.

Accomplishments

AICAS Grand Challenge on LLM Hardware System Design 1st Place — IEEE Circuit & System Society, 4月 2025
Young Teachers’ Teaching Skills Competition 1st Place Prize — Peking University, 12月 2024
CCF-Ant Group Research Award on Hardware/Software Co-Design — China Computer Federation (CCF), 8月 2024
CCF Integrated Circuits Early Career Award — China Computer Federation (CCF), 7月 2024
Secretflow Outstanding Industry-Academic Cooperation Contribution Award — Ant Group, 5月 2024
CCF-Ant Group Research Award on Privacy Computing — China Computer Federation (CCF), 8月 2023
Margarida Jacome Outstanding Dissertation Prize — University of Texas at Austin, 5月 2019
Outstanding Dissertations Award — European Design and Automation Association (EDAA), 5月 2019
Nominee of ACM Doctoral Dissertation Award — University of Texas at Austin, 3月 2019
First Place, Student Research Competition Grand Final (Graduate Category) — Aossication of Computing Machinery (ACM), 6月 2018
Best Paper Award — ACM Great Lake Symposium on VLSI (GLSVLSI), 3月 2018
Best Poster (Presentation) Award — ASPDAC Student Research Forum, Aossication of Computing Machinery (ACM) SIGDA, 2月 2018
Gold Medal — ICCAD Student Research Competition, Aossication of Computing Machinery (ACM) SIGDA, 11月 2017
Best Paper Award — IEEE International Symposium on Hardware Oriented Security and Trust (HOST), 7月 2017
Cockrell School Graduate Student Fellowship — University of Texas at Austin, 9月 2013
Yang Fuqing and Wang Yangyuan Academician Scholarship — Peking University, 9月 2011
Li Yanhong Baidu Scholarship — Peking University, 9月 2010

Contact

meng[dot]li[at]pku[dot]edu[dot]cn
No.5, Yiheyuan Road, Haidian District, Beijing 100871

李萌

助理教授、研究员、博雅青年学者

人工智能研究院

集成电路学院

北京大学

Biography

Research Focus

News

Experience

Recent Talks

Recent Publications

Projects

Accomplish­ments

Contact

Accomplishments