李萌的个人主页
李萌的个人主页
首页
新闻
最新讲座
论文发表
开源项目
学生
联系方式
浅色
深色
自动
中文 (简体)
English
Efficient AI
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference
Shuzhang Zhong
,
Yanfan Sun
,
Ling Liang
,
Runsheng Wang
,
Ru Huang
,
李萌
ReaLM: Reliable and Efficient Large Language Model Inference with Statistical Algorithm-Based Fault Tolerance
Tong Xie
,
Jiawang Zhao
,
Zishen Wan
,
Zuodong Zhang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
李萌
SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding
Linye Wei
,
Shuzhang Zhong
,
Songqiang Xu
,
Runsheng Wang
,
Ru Huang
,
李萌
UniCAIM: A Unified CAM/CIM Architecture with Static-Dynamic KV Cache Pruning for Efficient Long-Context LLM Inference
Weikai Xu
,
Wenxuan Zeng
,
Qianqian Huang
,
李萌
,
Ru Huang
Compact Non-Volatile Lookup Table Architecture based on Ferroelectric FET Array through In-Situ Combinatorial One-Hot Encoding for Reconfigurable Computing
Weikai Xu
,
李萌
,
Qianqian Huang
,
Ru Huang
FLASH: An Efficient Hardware Accelerator Leveraging Approximate and Sparse FFT for Homomorphic Encryption
Tengyu Zhang
,
Yufei Xue
,
Ling Liang
,
Zhen Gu
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
李萌
LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design
Renjie Wei
,
Songqiang Xu
,
Linfeng Zhong
,
Zebin Yang
,
Qingyu Guo
,
Yuan Wang
,
Runsheng Wang
,
李萌
SCALES: Boost Binary Neural Network for Image Super-Resolution with Efficient Scalings
Renjie Wei
,
Zechun Liu
,
Yuchen Fan
,
Runsheng Wang
,
Ru Huang
,
李萌
Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function Approximator
Xincheng Feng
,
Guodong Shen
,
Jianhao Hu
,
李萌
,
Ngai Wong
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference
Shuzhang Zhong
,
Ling Liang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
李萌
»
引用
×
var dimensionValue = 'SOME_DIMENSION_VALUE'; ga('set', 'dimension1', dimensionValue);