软件工程
-
娄文启
特任副研究员
娄文启,现为中国科大软件学院特任副研究员,硕士生导师。2018年6月本科毕业于西北工业大学计算机学院, 2023年12月于中国科学技术大学获得计算机系统结构博士学位,导师为 周学海教授与 王超教授。 主要研究方向为智能加速器架构、FPGA加速器设计、软硬件协同优化等,致力于从算法与硬件角度缓解深度学习模型的部署压力。 近年来,在计算机系统结构领域发表学术论文近20篇,包括IEEE TCAD、TC、DAC、FPGA、DATE等顶级期刊会议。
电子邮箱: louwenqi@ustc.edu.cn
联系地址: 绍钧楼508,中国科大苏州高等研究院若水路校区
主要研究方向:
Ø FPGA加速器设计: 专注于为卷积神经网络(CNN)、视觉Transformer(ViT)和大型语言模型(LLM)设计FPGA加速器,以提升性能和效率
Ø 神经架构与加速器协同搜索: 聚焦于神经网络架构与硬件加速器的协同进化,以在特定硬件平台上实现优化性能
Ø 面向FPGA/GPU推理的模型量化与剪枝: 通过量化和剪枝技术降低模型复杂性和规模,专为在FPGA和GPU架构上实现高效推理而定制
Ø AI赋能硬件设计: 利用人工智能革新硬件设计流程,包括自动化设计探索、优化和验证
获奖情况:
英特尔中国奖学金 2022
中国科大姑苏一等奖学金 2021
学术论文及著作:
1. Fu Wei, Lou Wenqi, Tang Cheng, Wen Hongbing, Qin Yunji, Gong Lei, Wang Chao, Zhou Xuehai. “UniCoS: A Unified Neural and Accelerator Co-Search Framework for CNNs and ViTs”. ACM/IEEE Design Automation Conference (DAC), San Francisco, Jun. 22-25, 2025. (CCF-A类会议,芯片设计顶级会议,通讯作者)
2. Lou Wenqi, Gong Lei, Wang Chao, Qian Jiaming, Wang Xuan, Li Changlong, Zhou Xuehai. “Unleashing Network/Accelerator Co-Exploration Potential on FPGAs: A Deeper Joint Search”. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2024, 43(10):3041-3054. (CCF-A类期刊)
3. Lou Wenqi, Qin Yunji, Wang Xuan, Gong Lei, Wang Chao, Zhou Xuehai. “FlexBCM: Hybrid Block-Circulant Neural Network and Accelerator Co-Search on FPGAs”. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2024, 43(11):3852-3863. (CCF-A类期刊)
4. Lou Wenqi, Qian Jiaming, Gong Lei, Wang Xuan, Wang Chao, Zhou Xuehai. NAF: Deeper Network/Accelerator Co-Exploration for Customizing CNNs on FPGA; proceedings of the 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE), 2023 [C]. IEEE. (CCF-B类会议, EDA领域顶级会议)
5. Lou Wenqi, Gong Lei, Wang Chao, Du Zidong, Zhou Xuehai. “Octcnn: A high throughput fpga accelerator for cnns using octave convolution algorithm” [J]. IEEE Transactions on Computers (TC), 2022, 71(8): 1847-1859. (CCF-A类期刊)
6. 娄文启, 王超, 宫磊, 周学海. 一种神经网络指令集扩展与代码映射机制 [J]. 软件学报, 2020, 31(10): 3074-3086. (CCF-中文T1)
7. Tang Cheng, Lou Wenqi, Cheng Qianyu, Tuo Jiayi, Fu Wei, Jiang Tianhao, Wang Chao, Zhou Xuehai. "Spectral Enhanced Tuning: A Plug-and-Play Framework for Dehazing Models with Frequency Decoding and Fusion". IEEE International Conference on Multimedia & Expo (ICME) 2025, Nantes, Jun. 30 to July 4, 2025. (CCF-B类会议, 通讯作者)
8. Qin Yunji, Lou Wenqi, Wang Chao, Gong Lei, Zhou Xuehai. Enhancing Long Sequence Input Processing in FPGA-Based Transformer Accelerators through Attention Fusion; Proceedings of the 2024 ACM Great Lakes Symposium on VLSI. 2024 [C]. (CCF-C类会议, VLSI领域重要会议, 通讯作者)
9. Dong Jiale, Lou Wenqi, Zheng Zhendong, Qin Yunji, Gong Lei, Wang Chao, Zhou Xuehai. “UbiMoE: A Ubiquitous Mixture-of-Experts Vision Transformer Accelerator With Hybrid Computation Pattern on FPGA”. 2025 IEEE International Symposium on Circuits and Systems (ISCAS), 2025. (Oral, 电路系统旗舰会议,通讯作者)
10. Wang Xuan, Gong Lei, Cao Jing, Lou Wenqi, Wang Weiya, Wang Chao, Zhou Xuehai. "hAP: A Spatial-von Neumann Heterogeneous Automata Processor with Optimized Resource and IO Overhead on FPGA". Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA). 2023. (CCF-B类会议, FPGA 领域顶级会议)