Publications (Google Scholar, DBLP)


Conference

2024

[C25] Flagger: Cooperative Acceleration for Large-Scale Cross-Silo Federated Learning Aggregation
Xiurui Pan, Yuda An, Shengwen Liang, Bo Mao, Mingzhe Zhang, Qiao Li, Myoungsoo Jung, Jie Zhang. 51st ACM/IEEE Annual International Symposium on Computer Architecture. ISCA 2024

[C24] Alchemist: A Unified Accelerator Architecture for Cross-Scheme Fully Homomorphic Encryption
Jianan Mu, Husheng Han, Shangyi Shi, Jing Ye, Zizhen Liu, Shengwen Liang, Meng Li, Mingzhe Zhang, Song Bian, Xing Hu, Huawei Li, Xiaowei Li. Proceedings of the 61st Annual Design Automation Conference. DAC 2024

2023

[C23] Poseidon: Practical Homomorphic Encryption Accelerator
Yinghao Yang, Huaizhi Zhang, Shengyu Fan, Hang Lu, Mingzhe Zhang, Xiaowei Li. The 29th IEEE International Symposium on High-Performance Computer Architecture. HPCA 2023

[C22] TensorFHE: Achieving Practical Computation on Encrypted Data Using GPGPU
Shengyu Fan, Zhiwei Wang, Weizhi Xu, Rui Hou, Dan Meng, Mingzhe Zhang. The 29th IEEE International Symposium on High-Performance Computer Architecture. HPCA 2023

2022

[C21] Enhancing GPU Performance via Neighboring Directory Table Based Inter-TLB Sharing
Yajuan Du, Mingyang Liu, Yuqi Yang, Mingzhe Zhang and Xulong Tang. The 40th IEEE International Conference on Computer Design. ICCD 2022

2021

[C20] Distilling Bit-level Sparsity Parallelism for General Purpose Deep Learning Acceleration
Hang Lu, Liang Chang, Chenglong Li, Zixuan Zhu, Shengjian Lu, Yanhuan Liu, Mingzhe Zhang. The 54th Annual IEEE/ACM International Symposium on Microarchitecture. MICRO 2021

[C19] BitX: Empower Versatile Inference with Hardware Runtime Pruning
Hongyan Li, Hang Lu, Jiawen Huang, Wenxu Wang, Mingzhe Zhang, Wei Chen, Liang Chang and Xiaowei Li. 50th International Conference on Parallel Processing. ICPP 2021

[C18] CoPIM: A Concurrency-aware PIM Workload Offloading Architecture for Graph Applications
Liang Yan, Mingzhe Zhang, Rujia Wang, Xiaoming Chen, Xingqi Zou, Xiaoyang Lu, Yinhe Han, Xian-He Sun. 2021 IEEE/ACM International Symposium on Low Power Electronics and Design. ISLPED 2021

[C17] Streamline Ring ORAM Accesses through Spatial and Temporal Optimization
Dingyuan Cao, Mingzhe Zhang, Hang Lu, Xiaochun Ye, Dongrui Fan, Yuezhi Che, Rujia Wang. The 27th IEEE International Symposium on High-Performance Computer Architecture. HPCA 2021.

2019

[C16] Self-adaptive Address Mapping Mechanism for Access Pattern Awareness on DRAM
Chundian Li, Mingzhe Zhang, Zhiwei Xu, Xianhe Sun. 17th IEEE International Symposium on Parallel and Distributed Processing with Applications. ISPA 2019.

[C15] When Deep Learning Meets the Edge: Auto-Masking Deep Neural Networks for Efficient Machine Learning on Edge Devices
Ning Lin, Hang Lu, Jingliang Gao, Mingzhe Zhang, Xiaowei Li. 37th IEEE International Conference on Computer Design. ICCD 2019.

[C14] Balancing Performance and Energy Efficiency of ONoC by Using Adaptive Bandwidth
Mingzhe Zhang, Lunkai Zhang, Frederic T. Chong, Zhiyong Liu. 37th IEEE International Conference on Computer Design. ICCD 2019.

[C13] FindeR: Accelerating FM-Index-based Exact Pattern Matching in Genomic Sequences through ReRAM technology
Farzaneh Zokaee, Mingzhe Zhang, Lei Jiang. 28th International Conference on Parallel Architectures and Compilation. PACT 2019.

[C12] C-MAP: Improving the Effectiveness of Mapping Method for CGRA by Reducing NoC Congestion
Shuqian An, Mingzhe Zhang, Xiaochun Ye, Da Wang, Hao Zhang, Dongrui Fan, Zhimin Tang. 21st IEEE International Conference on High Performance Computing and Communications. HPCC 2019.

[C11] Magma: A Monolithic 3D Vertical Heterogeneous ReRAM-based Main Memory Architecture
Farzaneh Zokaee, Mingzhe Zhang, Xiaochun Ye, Dongrui Fan, Lei Jiang. 2019 Proceedings of the 56th Annual Design Automation Conference. DAC 2019.

2018

[C10] Mmalloc: A Dynamic Memory Management on Many-core Coprocessor for the Acceleration of Storage-intensive Bioinformatics Application
Zihao Wang, Mingzhe Zhang, Jingrong Zhang, Rui Yan, Xiaohua Wan, Zhiyong Liu, Fa Zhang, Xuefeng Cui. 2018 IEEE International Conference on Bioinformatics and Biomedicine. BIBM 2018.

2017

[C09] Quick-and-Dirty: Improving Performance of MLC PCM by Using Temporary Short Writes
Mingzhe Zhang, Lunkai Zhang, Lei Jiang, Frederic T Chong, Zhiyong Liu. 35th IEEE International Conference on Computer Design. ICCD 2017.

[C08] Balancing performance and lifetime of MLC PCM by using a region retention monitor
Mingzhe Zhang, Lunkai Zhang, Lei Jiang, Zhiyong Liu, Frederic T Chong. 2017 IEEE International Symposium on High Performance Computer Architecture. HPCA 2017.

2016

[C07] COMRANCE: A rapid method for Network-on-Chip design space exploration
Mingzhe Zhang, Yangguang Shi, Fa Zhang, Zhiyong Liu. 2016 The 7th International Green and Sustainable Computing Conference. IGSC 2016.

2014

[C06] SpongeDirectory: Flexible sparse directories utilizing multi-level memristors
Lunkai Zhang, Dmitri Strukov, Hebatallah Saadeldeen, Dongrui Fan, Mingzhe Zhang, Diana Franklin. 2014 The 23rd International Conference on Parallel Architecture and Compilation Techniques. PACT 2014.

2013

[C05] SimICT: A fast and flexible framework for performance and power evaluation of large-scale architecture
Xiaochun Ye, Dongrui Fan, Ninghui Sun, Shibin Tang, Mingzhe Zhang, Hao Zhang. Proceedings of the 2013 International Symposium on Low Power Electronics and Design. ISLPED 2013.

[C04] Spontaneous reload cache: Mimicking a larger cache with minimal hardware requirement
Lunkai Zhang, Mingzhe Zhang, Lingjun Fan, Da Wang, Paolo Ienne. 2013 IEEE 8th International Conference on Networking, Architecture and Storage. NAS 2013.

[C03] Energy-Performance Modeling and Optimization of Parallel Computing in On-Chip Networks
Shuai Zhang, Zhiyong Liu, Dongrui Fan, Fonglong Song, Mingzhe Zhang. 2013 12th IEEE International Symposium on Parallel and Distributed Processing with Applications. ISPA 2013.

[C02] A Path-Adaptive Opto-electronic Hybrid NoC for Chip Multi-processor
Mingzhe Zhang, Da Wang, Xiaochun Ye, Liqiang He, Dongrui Fan, Zhiyong Liu. 2013 12th IEEE International Symposium on Parallel and Distributed Processing with Applications. ISPA 2013.

2012

[C01] Self-Correction Trace Model: A Full-System Simulator for Optical Network-on-Chip
Mingzhe Zhang, Liqiang He, Dongrui Fan. 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum. IPDPSW 2012.

Journals & Transactions

2022

[J07] VNet: a versatile network to train real-time semantic segmentation models on a single GPU
Wenxing Li, Ning Lin, Mingzhe Zhang, Hang Lu, Xiaoming Chen, Xiaowei Li. Science China Information Sciences, Vol. 64, Issue 3, pp. 1-2.

[J06] Accelerating Graph Processing with Lightweight Learning-Based Data Reordering
Mo Zou, Mingzhe Zhang, Rujia Wang, Xian-He Sun, Xiaochun Ye, Dongrui Fan, Zhimin Tang. IEEE Computer Architecture Letters, Vol. 21, Issue 1, pp. 5-8.

[J05] Application-Oriented Data Migration to Accelerate In-Memory Database on Hybrid Memory
Wenze Zhao, Yajuan Du, Mingzhe Zhang, Mingyang Liu, Kailun Jin, Rachata Ausavarungnirun. Micromachines, Vol. 13, Issue 1, pp. 52-60.

2020

[J04] Architecting Effectual Computation for Machine Learning Accelerators
Hang Lu, Mingzhe Zhang, Yinhe Han, Huawei Li, Li Xiaowei. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), Vol. 39, Issue 10, pp. 2654-2667.

2019

[J03] A Survey on Architecture Research of Non-Volatile Memory based on Dynamical Trade-off (in chinese)
Mingzhe Zhang, Fa Zhang, Zhiyong Liu. Journal of Computer Research and Development, Vol. 56, Issue 4, pp. 677-691.

[J02] Quick-and-Dirty: An Architecture for High-Performance Temporary Short Writes in MLC PCM
Mingzhe Zhang, Lunkai Zhang, Lei Jiang, Frederic T Chong, Zhiyong Liu. IEEE Transactions on Computers (TC), Vol. 68, Issue 9, pp. 1365-1375.

2015

[J01] FreeRider: Non-local adaptive network-on-chip routing with packet-carried propagation of congestion information
Shaoli Liu, Tianshi Chen, Ling Li, Xi Li, Mingzhe Zhang, Chao Wang, Haibo Meng, Xuehai Zhou, Yunji Chen. IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 26, Issue 8, pp. 2272-2285.