李萌的个人主页
李萌的个人主页
首页
新闻
最新讲座
论文发表
开源项目
学生
联系方式
浅色
深色
自动
中文 (简体)
English
Hardware Acc
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference
Shuzhang Zhong
,
Yanfan Sun
,
Ling Liang
,
Runsheng Wang
,
Ru Huang
,
李萌
ReaLM: Reliable and Efficient Large Language Model Inference with Statistical Algorithm-Based Fault Tolerance
Tong Xie
,
Jiawang Zhao
,
Zishen Wan
,
Zuodong Zhang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
李萌
SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding
Linye Wei
,
Shuzhang Zhong
,
Songqiang Xu
,
Runsheng Wang
,
Ru Huang
,
李萌
UniCAIM: A Unified CAM/CIM Architecture with Static-Dynamic KV Cache Pruning for Efficient Long-Context LLM Inference
Weikai Xu
,
Wenxuan Zeng
,
Qianqian Huang
,
李萌
,
Ru Huang
Compact Non-Volatile Lookup Table Architecture based on Ferroelectric FET Array through In-Situ Combinatorial One-Hot Encoding for Reconfigurable Computing
Weikai Xu
,
李萌
,
Qianqian Huang
,
Ru Huang
FLASH: An Efficient Hardware Accelerator Leveraging Approximate and Sparse FFT for Homomorphic Encryption
Tengyu Zhang
,
Yufei Xue
,
Ling Liang
,
Zhen Gu
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
李萌
LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design
Renjie Wei
,
Songqiang Xu
,
Linfeng Zhong
,
Zebin Yang
,
Qingyu Guo
,
Yuan Wang
,
Runsheng Wang
,
李萌
SCALES: Boost Binary Neural Network for Image Super-Resolution with Efficient Scalings
Renjie Wei
,
Zechun Liu
,
Yuchen Fan
,
Runsheng Wang
,
Ru Huang
,
李萌
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference
Shuzhang Zhong
,
Ling Liang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
李萌
FlexHE: A flexible Kernel Generation Framework for Homomorphic Encryption-Based Private Inference
Jiangrui Yu
,
Wenxuan Zeng
,
Tianshi Xu
,
Renze Chen
,
Yun (Eric) Liang
,
Runsheng Wang
,
Ru Huang
,
李萌
»
引用
×
var dimensionValue = 'SOME_DIMENSION_VALUE'; ga('set', 'dimension1', dimensionValue);