software projects  skills papers awards  interests editors committees reviewers activities students/postdoc

Summary ________________________________

I am recruiting self-motivated postdoc researchers in the area of HPC, machine learning, and/or data analytics. Details can be found here.

Software Products Developed ______________________

Key Projects _____________________________

  • DOE ASCR Data Reduction, 2021-2024, co-PI: Automatic Generation of Algorithms for High-Speed Reliable Lossy Compression.

  • NSF CSSI ROCCI, 2021-2024, PI: Elements: ROCCI: Integrated Cyberinfrastructure for In Situ Lossy Compression Optimization Based on Post Hoc Analysis Requirements, $320K.

  • DOE ASCR SDR (Early Career Award) , 2021-2026, PI: Scalable Dynamic Scientific Data Reduction: The overarching goal of this ASCR research project is to develop a scalable data reduction (SDR) framework that will be smart enough to dynamically generate the best-qualified data reduction solution that meets user requirements for various scientific applications, $2.5M.

  • NSF CDS&E HyLoC, 2020-2023, PI: Objective-driven Adaptive Hybrid Lossy Compression Framework for Extreme-Scale Scientific Application, $300K.

  • ECP VeloC/SZ, 2020-2023, technical leader, key designer/developer: The VeloC/SZ project focuses on ensuring high reliability for long-running exascale simulations and reducing the data while keeping important scientific outcomes intact.

  • ECP ExaSky, 2016-2023, key developer of compressor: Our work in this project aims at exploring a set of very efficient compression techniques for Cosmology simulation.

  • ECP CODAR, 2016-2023, key developer of compression assessment module: Our work in this collaboration project aims at exploring the characteristics of the data regarding data compression as well as lossy compression quality of different compressors for different scientific applications/datasets. For details, please read my papers published in IJHPCA17 and IJHPCA19.

  • DOE Catalog Project, 2015-2019, key developer: In-depth characterization and analysis of the errors, failures and faults for large-scale (or exascale) supercomputing environment. For details, please read my papers published in CCGrid17, TPDS18, DSN19, etc.

  • ECP VeloC, 2016-2019: Our work in this collaboration project aims at providing a very flexible checkpoint/restart interface for scientific users, by integrating the advantages of two successful multi-level checkpoint libraries, FTI and SCR. Another objective is to enable the checkpoint/restart operations to adapt to different features of I/O environments (such as burst buffer) or file systems.

  • ECP EZ, 2016-2019, technical leader: EZ project aims to develop a very effective, efficient, generic lossy compressor for significantly reducing the scientific data for scientists.

  • NSF ALETHEIA, 2016-2021, coPI: A framework for automatic detection/correction of corruptions in extreme scale scientific executions. For details, please read my papers published in SC19 and more.

  • PARIS, 2013-2016, key developer: Data-knowledge based Extreme-Scale Resilience:  This project, refered to PARIS, explores fundamental properties of numerical science applications to improve the resilience of extreme-scale executions and to provide efficient solutions to system failures and silent data corruptions (SDCs). For details, please read my papers published in SC14, IPDPS14, IPDPS15, and TPDS16.

  • AMFT Project, 2012-2015

  • Predicting Idleness in Data Centers, key developer, 2012 - 2013: This project, a Google Research Award, aims to model and predict workload//hostload for Google data centers, also aiming to improve system performance. Please read my papers published in CLUSTER12, SC12, and SC13.

  • Cloud@HOME, 2012 - 2013: It is funded by the national French science foundation (called ANR) for running complex services over unreliable (Internet) resources, maximizing resource utilization and Quality of Service (QoS). My contribution is optimizing and stabilizing a best-suited queuing policy and a virtual resource allocation scheme. The prototype implemented is leveraging ParallelColt matrix-computation library and the resource isolation technology by XEN 4.0. Please read my papers in IEEE TC , IEEE IWQoS2013, HiPC13, and CLOUD2013 for details.

  • Desktop Cloud / Self-organizing Cloud, 2010-2011: This project is supported by Hong Kong RGC grant HKU 7179/09E and HKU Basic Research grant (Grant No. 10401460), and also in part by Hong Kong UGC Special Equipment Grant (SEG HKU09). My contribution is developing a set of core optimization algorithms - optimal resource allocation with fully distributed resource discovery protocols. Please read my TPDS2012, JPDC2012, ICPP2011, UCC2011 papers for details.

  • CNGrid, 2007-2011: This project is a key national project under the High-Tech R&D Program (China-863 program) in China. I am mainly in charge of the construction and development of HKU-Grid Point, one of the key Grid points along with other nine ones. The research contribution includes two papers, which are published in ICPP2010 and the Journal of Huazhong University of Science and Technology 2011 respectively.

  • SemREX, 2006-2008: This project is funded by China-973 Project of National Basic Research and Development Plan. My major contribution is co-designing and co-developing the relationship-searching engine, coauthoring a paper which was awarded as best-paper in IEEE ICDS2008.

  • CGSV(ChinaGrid SuperVision, sponsored by HP), 2005-2006, designer and developer: ChinaGrid SuperVison (CGSV) is sponsored by HP Inc. It is a key monitoring-software that provide real-time monitoring support for ChinaGrid. My major contribution is taking part in designing its whole architecture, developing Graphic User Interface and Archive module independently, and developing Registry and Windows Sensor cooperatively. Please read my PAKDD2007 and APCC2007 papers for more information.

  • GPE4CGSP(Sponsored by Intel), 2006, developer:The goal of this project is to integrate two well-known grid platforms: ChinaGrid/CGSP and UNICORE/GPE. My major contribution is analyzing the code of GPE and developing a middleware to integrate the Information Center of CGSP and that of GPE, supported by a GUI as well. Please read my AINAW2007 paper for more information.

  • CGSP(ChinaGrid Supported Platform), 2005-2006 developer: CGSP, the biggest grid project in China, is sponsored by Ministry of Education. It is designed and developed by about forty developers from twelve top-ranking universities in China. My major contribution is making a GUI to display its key information, such as jobs, applications, services and so on, and providing web service interfaces with Geo-Information System (GIS) support and security support. More information could be found in my LNCS CDVE2007 paper.

  • CoGIS, 2004-2005: My major contribution is integrating it with a distributed monitoring system (namely GlobalWatch system) and installing/administering many different Grid softwares like Globus, GridFTP, and improving a Dynamic Replica Transmission platform. Detailed information could be found in my patent NO. 200610125570.9.

  • GlobalWatch (A distributed monitoring platform) , 2004-2005, designer and developer: GlobalWatch is a distributed monitoring system used to monitor grid platforms. My contribution is developing the server and client software with another developer. We develop the server with Servlet technology and the sensor (Web Service) with WebService Application Server (WAS). Please read my APSCC2007 paper for details.

Technical Background _____________________

  • Programming Languages: Java, C, C++, Fortran, Python, bash, MPI, OpenMP, Prolog, JavaScript, JSP

  • High Layer Technology: GIS (ArcGIS, OpenMap) / GUI

  • Tools: automake, Makefile, ant, XML, XSL, HTML, SQL, Doc book, LaTEX

  • Platform: J2EE

  • OS: Unix and Linux (Kernel and Administration)

  • DBMS: MySQL, Oracle

  • Network Communication: Socket, RMI, Web Service (Axis, Muse, globus), Matlab Web Server, etc.

  • Others: OpenPBS, Design Pattern, UML, Visio, VMWare, XEN

Publications ______________________________

    -----2024-----

  1. Haotian Xu, Zhaorui Zhang, Sheng Di, Benben Liu, Jiannong Cao, "FedFa: A Fully Asynchronous Training Paradigm for Federated Learning", International Joint Conferences on Artificial Intelligence (IJCAI24), 2024..
  2. Grant Wilkins, Sheng Di, Jon Calhoun, Zilinghan Li, Kibaek Kim, Robert Underwood, Richard Mortier and Franck Cappello, "FedSZ: Leveraging Floating-Point Lossy Compression for Federated Learning Communications", 44th IEEE International Conference on Distributed Computing Systems (IEEE ICDCS2024)
  3. Milan Shah, Xiaodong Yu, Sheng Di, Michela Becchi, Franck Cappello,"A Portable, Fast, DCT-based Compressor for AI Accelerators", in International Symposium on High-Performance Parallel and Distributed Computing (ACM HPDC2024), 2024.
  4. Shihui Song, Yafan Huang, Peng Jiang, Xiaodong Yu, Weijian Zheng, Sheng Di, Qinglei Cao, Yunhe Feng, Zhen Xie, Franck Cappello, "CereSZ: Enabling and Scaling Error-bounded Lossy Compression on Cerebras CS-2", in International Symposium on High-Performance Parallel and Distributed Computing (ACM HPDC2024), 2024.
  5. Md Hasanur Rahman, Sheng Di, Guanpeng Li, Franck Cappello, "A Generic and Efficient Framework for Estimating Lossy Compressibility of Scientific Data", in Proceedings of the 35th International Conference on Massive Storage Systems and Technology (IEEE MSST2024), 2024.
  6. Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Zhaorui Zhang, Jinyang Liu, Xiaoyi Lu, Ken Raffenetti, Hui Zhou, Kai Zhao, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur, "Optimizing Collective Communications with Error-bounded Lossy Compression for GPU Clusters", Principles and Practice of Parallel Programming (PPoPP2024), 2024. [poster]
  7. Zizhe Jian, Sheng Di, Jinyang Liu, Kai Zhao, Xin Liang, Haiying Xu, Robert Underwood, Jiajun Huang, Shixun Wu, Zizhong Chen, Franck Cappello, "CliZ: Optimizing Lossy Compression for Climate Datasets with Adaptive Fine-tuned Data Prediction", in Proceedings of the 38th IEEE International Parallel and Distributed Processing Symposium (IEEE IPDPS2024), 2024.
  8. Di Zhang, Monish Soundar Raj, Bing Xie, Sheng Di, Dong Dai, "Cross-System Analysis of Job Characterization and Scheduling in Large-Scale Computing Clusters", in Proceedings of the 38th IEEE International Parallel and Distributed Processing Symposium (IEEE IPDPS2024), 2024.
  9. Md Hasanur Rahman, Sheng Di, Shengjian Guo, Xiaoyi Lu, Guanpeng Li, Franck Cappello, "DRUTO: Upper-Bounding Silent Data Corruption Vulnerability in GPU Applications", in Proceedings of the 38th IEEE International Parallel and Distributed Processing Symposium (IEEE IPDPS2024), 2024.
  10. Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Zhaorui Zhang, Jinyang Liu, Xiaoyi Lu, Ken Raffenetti, Hui Zhou, Kai Zhao, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur, "An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression", in Proceedings of the 38th IEEE International Parallel and Distributed Processing Symposium (IEEE IPDPS2024)
  11. Mingze Xia, Sheng Di, Franck Cappello, Pu Jiao, Kai Zhao, Jinyang Liu, Xuan Wu, Xin Liang, and Hanqi Guo, "Preserving Topological Feature with Sign-of-Determinant Predicates in Lossy Compression: A Case Study of Vector Field Critical Points", Proceedings of the 40th IEEE International Conference on Data Engineering (IEEE ICDE2024), Utrecht, Netherlands, May 13 - 16, 2024.
  12. Jinyang Liu, Sheng Di, Kai Zhao, Xin Liang, Sian Jin, Zizhe Jian, Jiajun Huang, Shixun Wu, Zizhong Chen, Franck Cappello, "High-performance Effective Scientific Error-bounded Lossy Compression with Auto-tuned Multi-component Interpolation", in ACM Special Interest Group on Management of Data (SIGMOD2024), 2024.
  13. Sian Jin, Sheng Di, Frederic Vivien, Dance Wang, Yves Robert, Dingwen Tao, Franck Cappello, "Concealing Compression-accelerated I/O for HPC Applications through In Situ Task Scheduling", in Eurosys2024, 2024.

    -----2023-----

  14. Arham Khan, Sheng Di, Kai Zhao, Jinyang Liu, Kyle Chaid, Ian Foster, Franck Cappello, "SECRE: Surrogate-based Error-controlled Lossy Compression Ratio Estimation Framework", in 30th edition of the IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC2023), 2023.
  15. Pu Jiao, Sheng Di, Jinyang Liu, Xin Liang, Franck Cappello, "Characterization and Detection of Artifacts for Error-controlled Lossy Compressors", in 30th edition of the IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC2023), 2023.
  16. Robert Underwood, Chun Hong Yoon, Ali M. Gok, Sheng Di, Franck Cappello, "ROIBIN-SZ: Fast and Science-Preserving Compression for Serial Crystallography", Journal of Synchrotron Radiation News (SRN), 2023.
  17. Robert Underwood, Julie Bessac, David Krasowska, Jon C Calhoun, Sheng Di, Franck Cappello, "Black-box statistical prediction of lossy compression ratios for scientific data", International Journal of High Performance Computing Applications (IJHPCA2023), 2023, Volume 37, Issue 3-4.
  18. Arkaprabha Ganguli, Robert Underwood, Julie Bessac, David Krasowska, Jon Calhoun, Sheng Di, Franck Cappello, "A Lightweight, Effective Compressibility Estimation Method for Error-bounded Lossy Compression", in IEEE International Conference on Cluster Computing(IEEE CLUSTER2023), 2023.
  19. Arham Khan, Sheng Di, Kai Zhao, Jinyang Liu, Kyle Chard, Ian Foster, Franck Cappello, "An Efficient and Accurate Compression Ratio Estimation Model for SZx", in IEEE International Conference on Cluster Computing(IEEE CLUSTER2023), 2023. [poster]
  20. Yafan Huang, Sheng Di, Xiaodong Yu, Guanpeng Li, Franck Cappello, "cuSZp: An Ultra-fast GPU Error-bounded Lossy Compression Framework with Optimized End-to-End Performance", in IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2023), 2023.
  21. Daoce Wang, Jesus Pulido, Jiannan Tian, Sian Jin, Houjun Tang, Jean Sexton, Sheng Di, Kai Zhao, Bo Fang, Zarija Lukic, Franck Cappello, James Ahrens, Dingwen Tao, "AMRIC: A Novel In Situ Lossy Compression Framework for Efficient I/O in Adaptive Mesh Refinement Applications", IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2023), 2023.
  22. Jinyang Liu, Sheng Di, Sian Jin, Kai Zhao, Xin Liang, Zizhong Chen, Franck Cappello, "Scientific Error-bounded Lossy Compression with Super-resolution Neural Networks", in conjunction with IEEE International Conference on Big Data (IEEE BigData23), 2023.
  23. Jiajun Huang, Jinyang Liu, Sheng Di, Yujia Zhai, Shixun Wu, Kai Zhao, Zizhong Chen, Yanfei Guo, Franck Cappello, "Exploring Wavelet Transform Usages for Error-bounded Scientific Data Compression", International Workshop on Big Data Reduction (IEEE IWBDR23) in conjunction with IEEE International Conference on Big Data (IEEE BigData23), 2023.
  24. Jiajun Huang, Sheng Di, Xiaodong Yu, Zizhong Chen, Franck Cappello, Yanfei Guo, and Rajeev Thakur, "Accelerating Collective Communications with Lossy Compression on GPU", in IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2023), 2023. [poster] (1st place of ACM SRC award -- Graduates)
  25. Avinash Kethineedi, Jon C. Calhoun, Robert Underwood, Sheng Di, Franck Cappello, "ROI Preservation in Streaming Lossy Compression", IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2023), 2023. [poster]
  26. Robert R. Underwood, Sheng Di, Sian Jin, Md Hasanur Rahman, Arham Khan, Franck Cappello, "LibPressio-Predict: Flexible and Fast Infrastructure for Inferring Compression Performance", in Proceedings of the 7th International Workshop on Data Reduction for Big Scientific Data (DRBSD-9), in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2023), 2023.
  27. Boyuan Zhang, Sheng Di, Xiaodong Yu, Martin Swany, Dingwen Tao, Franck Cappello, "GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs", in International Conference on Supercomputing (ACM ICS2023), 2023.
  28. Milan Shah, Xiaodong Yu, Sheng Di, Michela Becchi, and Franck Cappello, "Lightweight Huffman Coding for Efficient GPU Compression", in International Conference on Supercomputing (ACM ICS2023), 2023.
  29. Jinyang Liu, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen, Franck Cappello, "FAZ: A flexible auto-tuned modular error-bounded compression framework for scientific data", in International Conference on Supercomputing (ACM ICS2023), 2023.(best paper nominated)
  30. Yuanjian Liu, Sheng Di, Kyle Chard, Ian Foster, Franck Cappello, "Optimizing Scientific Data Transfer on Globus with Error-bounded Lossy Compression", in 43rd IEEE International Conference on Distributed Computing Systems (IEEE ICDCS2023), 2023.
  31. Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Yunhe Feng, Yunhe Feng, Xin Liang, Dingwen Tao, Franck Cappello, "FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs," in 32nd International Symposium on High-Performance Parallel and Distributed Computing (ACM HPDC2023), 2023.
  32. Khalid Ayedh Alharthi, Arshad Jhumka, Sheng Di, Lin Gui, Franck Cappello, Simon McIntosh, "SmithTime Machine: Generative Real-Time Model For Failure (and Lead Time) Prediction in HPC Systems," in IEEE International Conference on Dependable Systems and Networks (IEEE DSN2023), 2023.
  33. Yafan Huang, Kai Zhao, Sheng Di, Guanpeng Li, Maxim Dmitriev, Thierry-Laurent D. Tonellot and Franck Cappello, "Towards Improving Reverse Time Migration Performance by High-speed Lossy Compression," in 23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (ACM CCGrid2023), 2023.
  34. Milan Shah, Xiaodong Yu, Sheng Di, Danylo Lykov, Yuri Alexeev, Michela Becchi, Franck Cappello, "GPU-Accelerated Error-Bounded Compression Framework for Quantum Circuit Simulations", in Proceedings of the 37th IEEE International Parallel and Distributed Processing Symposium (IPDPS2023), St. Petersburg, Florida, USA, May 15-June 19, 2023.
  35. Pu Jiao, Sheng Di, Hanqi Guo, Kai Zhao, Jiannan Tian, Dingwen Tao, Xin Liang, Franck Cappello, "Toward Quantity-of-Interest Preserving Lossy Compression for Scientific Data", in 49th International Conference on Very Large Database (VLDB 2023), 2023, Canada.
  36. Md Hasanur Rahman, Sheng Di, Kai Zhao, Robert Underwood, Guanpeng Li, Franck Cappello, "A Feature-Driven Fixed-Ratio Lossy Compression Framework for Real-World Scientific Datasets", in Proceeding of the 39th IEEE International Conference on Data Engineering (ICDE2023), 2023.

    -----2022-----

  37. Xin Liang, Sheng Di, Franck Cappello, Mukund Raj, Chunhui Liu, Kenji Ono, Zizhong Chen, Tom Peterka, and Hanqi Guo, "Toward Feature-Preserving Vector Field Compression", IEEE Transactions on Visualization and Computer Graphics (IEEE TVCG), 2022.
  38. Zhaoyuan Su, Sheng Di, Ali Murat Gok, Yue Cheng, Franck Cappello, "Understanding Impact of Lossy Compression on Derivative-related Metrics in Scientific Datasets", in Proceedings of the 7th International Workshop on Data Reduction for Big Scientific Data (DRBSD-8), in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2022), 2022.
  39. Robert Underwood, Julie Bessac, Sheng Di, Franck Cappello, "Understanding the Effects of Modern Compressors on the Community Earth Science Model", in Proceedings of the 7th International Workshop on Data Reduction for Big Scientific Data (DRBSD-8), in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2022), 2022. (best paper award)
  40. Maxim Dmitriev, Thierry Tonellot, Hussain Salim , Sheng Di, "Error-bounded lossy compression in Reverse Time Migration", Sixth EAGE High Performance Computing Workshop (EAGE22), 2022.
  41. Griffin Dube, Jiannan Tian, Sheng Di, Dingwen Tao, Jon C. Calhoun, Franck Cappello, "Efficient Error-Bounded Lossy Compression for CPU Architectures", 30th International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (IEEE MASCOTS 2022), Nice, France, 2022.
  42. Xin Liang, Kai Zhao, Sheng Di, Sihuan Li, Robert Underwood, Ali M. Gok, Jiannan Tian, Junjing Deng, Jon C. Calhoun, Dingwen Tao, Zizhong Chen, Franck Cappello, "SZ3: A Modular Framework for Composing Prediction-Based Error-Bounded Lossy Compressors", IEEE Transactions on Big Data (IEEE TBD), 2022.
  43. Yuanjian Liu, Sheng Di, Kai Zhao, Sian Jin, Cheng Wang, Kyle Chard, Dingwen Tao, Ian Foster, Franck Cappello, "Optimizing Error-Bounded Lossy Compression for Scientific Data with Diverse Constraints", in IEEE Transactions on Distributed and Computer Systems (TPDS), 2022.
  44. Ruiwen Shan, Sheng Di, Jon C. Calhoun, Franck Cappello, "Exploring Light-weight Cryptography for Efficient and Secure Lossy Data Compression", in IEEE CLUSTER2022, 2022.
  45. Yafan Huang, Shengjian Guo, Sheng Di, Guanpeng Li, Franck Cappello, "Mitigating Silent Data Corruptions in HPC Applications across Multiple Program Inputs", IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2022), 2022. (best paper finalist)
  46. Sian Jin, Dingwen Tao, Houjun Tang, Sheng Di, Suren Byna, Zarija Lukic, Franck Cappello, "Accelerating Parallel Write via Deeply Integrating Predictive Lossy Compression with HDF5", IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2022), 2022.
  47. Jinyang Liu, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen, Franck Cappello, "Dynamic Quality Metric Oriented Error Bounded Lossy Compression for Scientific Datasets", IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2022), 2022.
  48. Milan Shah, Xiaodong Yu, Sheng Di, Franck Cappello, Michela Becchi, "Compressing Quantum Circuit Simulation Tensor Data", IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2022), 2022. [poster] (2nd place of ACM SRC award -- Graduates)
  49. David Krasowska, Robert Underwood, Julie Bessac, Jon Calhoun, Sheng Di, Franck Cappello, "Statistical Prediction of Lossy Compression Ratios for 3D Scientific Data", IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2022), 2022. [poster] (1st place of ACM SRC award -- Undergraduates)
  50. Jiannan Tian, Dingwen Tao, Sheng Di, Franck Cappello, "Spline-interpolation based Error-bounded Lossy Compression for Scientific Data on GPUs", IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2022), 2022. [poster]
  51. Ian Foster, Mark Ainsworth, Julie Bessac, Franck Cappello, Jong Choi, Sheng Di, et al., "Online data analysis and reduction: An important Co-design motif for extreme-scale computers", in The International Journal of High Performance Computing Applications (IJHPCA), 2022.
  52. Khalid Ayedh Alharthi, Arshad Jhumka, Sheng Di, Franck Cappello, "Clairvoyant: A Log-Based Transformer-Decoder for Failure Prediction in Large-Scale Systems", International Conference on Supercomputing (ACM ICS2022), 2022.
  53. Xiaodong Yu, Sheng Di, Kai Zhao, Jiannan Tian, Dingwen Tao, Xin Liang, Franck Cappello, "Ultra-fast Error-bounded Lossy Compression for Scientific Dataset", 31st International Symposium on High-Performance Parallel and Distributed Computing (ACM HPDC2022), 2022.
  54. Kai Zhao, Sheng Di, Danny Perez, Xin Liang, Zizhong Chen, Franck Cappello, "MDZ: An Efficient Error-bounded Lossy Compressor for Molecular Dynamics", in Proceeding of the 38th IEEE International Conference on Data Engineering (ICDE2022), Virtual Event, May 9-12, 2022.
  55. Sian Jin, Sheng Di, Jiannan Tian, Suren Byna, Dingwen Tao, and Franck Cappello, "Significantly Improving Prediction-Based Lossy Compression Via Ratio-Quality Modeling", in Proceedings of the 38th IEEE International Conference on Data Engineering (ICDE2022), Virtual Event, May 9-12, 2022.
  56. Cody Rivera, Sheng Di, Xiaoding Yu, Jiannan Tian, Dingwen Tao, and Franck Cappello, "Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs", in Proceedings of the 36th IEEE International Parallel and Distributed Processing Symposium (IPDPS2022), Lyon, France, May 30-June 3, 2022.
  57. Yafan Huang, Shengjian Guo, Sheng Di, Guanpeng Li, Franck Cappello, "Hardening Selective Protection across Multiple Program Inputs for HPC Applications", Principles and Practice of Parallel Programming (PPoPP2022), 2022. [poster]
  58. Franck Cappello, Sheng Di, and Robert Underwood, "Improving lossy compression for climate datasets with SZ3", EGU General Assembly 2022, Vienna, Austria, 23-27 May 2022, EGU22-9741, https://doi.org/10.5194/egusphere-egu22-9741, 2022.
  59. Julie Bessac, David Krasowksa, Robert Underwood, Sheng Di, Jon C. Calhoun, and Franck Cappello, "Exploring Lossy Compressibility through Statistical Correlations of Geophysical Datasets", EGU General Assembly 2022, Vienna, Austria, 23-27 May 2022, EGU22-9948, https://doi.org/10.5194/egusphere-egu22-9948, 2022.
  60. Robert Underwood, Sheng Di, and Franck Cappello, "Understanding the effects of Modern Lossless and Lossy Compressors on the Community Earth Science Model", EGU General Assembly 2022, Vienna, Austria, 23-27 May 2022, EGU22-10774, https://doi.org/10.5194/egusphere-egu22-10774, 2022.
  61. Xavier Yepes-Arbos, Sheng Di, Kim Serradell, Franck Cappello, and Mario C. Acosta, "Exploring the SZ lossy compressor use for the XIOS I/O server", EGU General Assembly 2022, Vienna, Austria, 23-27 May 2022, EGU22-9153, https://doi.org/10.5194/egusphere-egu22-9153, 2022.
  62. Robert Underwood, Jon C. Calhoun, Sheng Di, Amy Apon, Franck Cappello, "OptZConfig: Efficient Parallel Optimization of Lossy Compression Configuration", in IEEE Transactions on Distributed and Computer Systems (TPDS), 2022.

    -----2021-----

  63. Jinyang Liu, Sihuan Li, Sheng Di, Xin Liang, Kai Zhao, Dingwen Tao, Zizhong Chen, and Franck Cappello, "Improving Lossy Compression for SZ by Exploring the Best-Fit Lossless Compression Techniques", International Workshop on Big Data Reduction (IEEE IWBDR21) in conjunction with IEEE International Conference on Big Data (IEEE BigData21), 2021.
  64. Yuanjian Liu, Sheng Di, Kai Zhao, Sian Jin, Cheng Wang, Kyle Chard, Dingwen Tao, Ian Foster, Franck Cappello, "Understanding Effectiveness of Multi-error-bounded Lossy Compression for Preserving Ranges of Interest in Scientific Analysis", in Proceedings of the 7th International Workshop on Data Reduction for Big Scientific Data (DRBSD-7), in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2021), 2021.
  65. David Krasowska, Julie Bessac, Robert Underwood, Jon C. Calhone, Sheng Di, Franck Cappello, "Exploring Lossy Compressibility through Statistical Correlations of Scientific Datasets", in Proceedings of the 7th International Workshop on Data Reduction for Big Scientific Data (DRBSD-7), in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2021), 2021.
  66. Robert Underwood, Victoriana Malvoso, Jon C. Calhone, Sheng Di, Franck Cappello, "Productive and Performant Generic Lossy Data Compression with LibPressio", in Proceedings of the 7th International Workshop on Data Reduction for Big Scientific Data (DRBSD-7), in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2021), 2021.
  67. Yuanjian Liu, Sheng Di, Kai Zhao, Sian Jin, Cheng Wang, Kyle Chard, Dingwen Tao, Ian Foster, Franck Cappello, "Optimizing Multi-Range based Error-Bounded Lossy Compression for Scientific Datasets", in 28th IEEE International Conference on High Performance Computing, Data and Analytics (HiPC2021), India, 2021 (short paper).
  68. Ruiwen Shan, Sheng Di, Jon C. Calhoun, Franck Cappello, "Towards Combining Error-bounded Lossy Compression and Cryptography for Scientific Data", in IEEE High Performance Extreme Computing (IEEE HPEC2021), 2021.
  69. Xiaodong Yu, Sheng Di, Ali Murat Gok, Dingwen Tao, Franck Cappello, "cuZ-Checker: A GPU-Based Ultra-Fast Assessment System for Lossy Compressions", in IEEE CLUSTER2021, 2021.
  70. Jinyang Liu, Sheng Di, Kai Zhao, Sian Jin, Dingwen Tao, Xin Liang, Zizhong Chen, Franck Cappello, "Exploring Autoencoder-Based Error-Bounded Compression for Scientific Data", in IEEE CLUSTER2021, 2021.
  71. Jiannan Tian, Sheng Di, Xiaodong Yu, Cody Rivera, Kai Zhao, Sian Jin, Yunhe Feng, Xin Liang, Dingwen Tao, Franck Cappello, "Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs", in IEEE Cluster2021, 2021.
  72. Hongyuan Liu, Bogdan Nicolae, Sheng Di, Franck Cappello, Adwait Jog, "Accelerating DNN Architecture Search at Scale using Selective Weight Transfer", in IEEE CLUSTER2021, 2021.
  73. Sihuan Li, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen, and Franck Cappello, "Resilient Error-bounded Lossy compressor for Data Transfer", in the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (IEEE/ACM SC2021), St. Louis, Missouri, USA, Nov 14 - 19, 2021.
  74. Ian Foster, Mark Ainsworth, Julie Bessac, Franck Cappello, Jong Choi, Sheng Di, et al. "Online data analysis and reduction: An important Co-design motif for extreme-scale computers", in The International Journal of High Performance Computing Applications (IJHPCA), 2021.
  75. Kai Zhao, Sheng Di, Maxim Dmitriev, Thierry-Laurent D. Tonellot, Zizhong Chen, and Franck Cappello, "Optimizing Error-Bounded Lossy Compression for Scientific Data by Dynamic Spline Interpolation", Proceeding of the 37th IEEE International Conference on Data Engineering (ICDE2021), Chania, Crete, Greece, Apr 19 - 22, 2021.
  76. Khalid Ayedh Alharthi, Arshad Jhumka, Sheng Di, Franck Cappello, Edward Chuah, "Sentiment Analysis based Error Detection for Large-Scale Systems", IEEE/IFIP 51st International Conference on Dependable Systems and Networks (IEEE DSN2021), 2021.
  77. Jiannan Tian, Cody Rivera, Sheng Di, Jieyang Chen, Xin Liang, Dingwen Tao, and Franck Cappello, "Revisiting Huffman Coding: Toward Extreme Performance on Modern GPU Architectures", Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS2021), Portland, Oregon, May 17-21, 2021.
  78. -----2020-----

  79. Kai Zhao, Sheng Di, Xin Liang, Sihuan Li, Dingwen Tao, Julie Bessac, Zizhong Chen, Franck Cappello, "SDRBench: Scientific Data Reduction Benchmark for Lossy Compressors", International Workshop on Big Data Reduction (IEEE IWBDR20) in conjunction with IEEE International Conference on Big Data (IEEE BigData20), 2020.

  80. Kai Zhao, Sheng Di, Sihuan Li, Xin Liang, Yujia Zhai, Jieyang Chen, Kaiming Ouyang, Franck Cappello and Zizhong Chen, "FT-CNN: Algorithm-Based Fault Tolerance for Convolutional Neural Networks", IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS) Special Section on Parallel and Distributed Computing Techniques for AI, ML and DL (TPDS-SS-AI 2020), 2020.

  81. Jiannan Tian, Sheng Di, Kai Zhao, Cody Rivera, Megan Hickman, Robert Underwood, Sian Jin, Xin Liang, Jon Calhoun, Dingwen Tao, and Franck Cappello, "cuSZ: An Efficient GPU Based Error-Bounded Lossy Compression Framework for Scientific Data", Proceedings of the 29th International Conference on Parallel Architectures and Compilation Techniques (PACT'20), Atlanta, GA, USA, October 3 - 7, 2020.

  82. Sihuan Li, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen and Franck Cappello, "Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression", in IEEE CLUSTER2020, 2020.

  83. Franck Cappello, Sheng Di, Ali M. Gok, "Fulfilling the Promises of Lossy Compression forScientific Applications", Smoky Mountain Computational Science and Engineering Conference (SMC2020), USA, Aug. 25-27, 2020.

  84. Hao Fan, Song Wu, Xinyu Zhao, Zhenjiang Xie, Sheng Di, Jiang Xiao, Chen Yu, Hai Jin, "Accelerating Parallel Applications in Cloud Platforms via Adaptive Time-Slice Control", IEEE Transactions on Computers (IEEE TC), 2020.

  85. Kai Zhao, Sheng Di, Xin Liang, Sihuan Li, Dingwen Tao, Zizhong Chen, Franck Cappello, "Significantly Improving Lossy Compression for HPC Datasets with Second-Order Prediction and Parameter Optimization", 29th International Symposium on High-Performance Parallel and Distributed Computing (ACM HPDC20), 2020.

  86. Xiangyu Zou, Tao Lu, Wen Xia, Xuan Wang, Weizhe Zhang, Haijun Zhang, Sheng Di, Dingwen Tao, and Franck Cappello, "Performance Optimization for Relative-Error-Bounded Lossy Compression on Scientific Data", IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS), 2020.

  87. Xin Liang, Hanqi Guo, Sheng Di, Franck Cappello, Mukund Raj, Chunhui Liu, Kenji Ono, Zizhong Chen and Tom Peterka, "Towards Feature Preserving 2D and 3D Vector Field Compression", in the 13rd IEEE Pacific Visualization Symposium (IEEE PacificVis2020), Tianjin, China, Apr 14-17, 2020.

  88. Jiannan Tian, Sheng Di, Chengming Zhang, Xin Liang, Sian Jin, Dazhao Cheng, Dingwen Tao, and Franck Cappello, "waveSZ: A Hardware-Algorithm Co-Design of Efficient Lossy Compression for Scientific Data", Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (ACM PPoPP2020), San Diego, California, USA, February 22-26, 2020.

  89. Robert Underwood, Sheng Di, Jon Calhoun, Franck Cappello, "FRaZ: A Generic High-Fidelity Fixed-Ratio Lossy Compression Framework for Scientific Floating-point Data", in Proceedings of the 34th IEEE International Parallel and Distributed Symposium (IEEE IPDPS2020), New Orleans, LA, May 18-22, 2020.


  90. -----2019-----
  91. Tasmia Reza, Kristopher Keipert, Sheng Di, Xin Liang, Jon C. Calhoun, Franck Cappello, "Analyzing the Performance and Accuracy of LossyCheckpointing on Sub-iteration of NWChem", in Proceedings of the 5th International Workshop on Data Reduction for Big Scientific Data (DRBSD-5), in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2019).

  92. Franck Cappello, Sheng Di, Sihuan Li, Xin Liang, Ali M. Gok, Dingwen Tao, Chun Hong Yoon , Xin-Chuan Wu, Yuri Alexeev, Federic T. Chong, "Use cases of lossy compression for floating-point data in scientific datasets", in The International Journal of High Performance Computing Applications (IJHPCA), 2019.

  93. Xin Liang, Sheng Di, Dingwen Tao, Sihuan Li, Bogdan Nicolae, Zizhong Chen, Franck Cappello, "Improving Performance of Data Dumping with Lossy Compression for Scientific Simulation", in IEEE CLUSTER2019, 2019.

  94. Xin-Chuan Wu, Sheng Di, Emma Maitreyee Dasgupta, Franck Cappello, Yuri Alexeev, Hal Finkel, Frederic T. Chong, "Full State Quantum Circuit Simulation by Using Data Compression", in IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2019), 2019.

  95. Xin Liang, Sheng Di, Sihuan Li, Dingwen Tao, Bogdan Nicolae, Zizhong Chen, Franck Cappello, "Significantly Improving Lossy Compression Quality based on An Optimized Hybrid Prediction Model", in IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2019), 2019.

  96. Sihuan Li, Hongbo Li, Xin Liang, Jieyang Chen, Elizabeth Giem, Kaiming Ouyang, Kai Zhao, Sheng Di, Franck Cappello, and Zizhong Chen, "FT-iSort: Efficient Fault Tolerance for Introsort", in IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2019), 2019.

  97. Sian Jin, Sheng Di, Xin Liang, Jiannan Tian, Dingwen Tao, Franck Cappello, "DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression", Proceedings of the 28th ACM International Symposium on High-Performance Parallel and Distributed Computing (ACM HPDC19), Phoenix, AZ, USA, June 24 - 28, 2019.

  98. Sheng Di, Hanqi Guo, Eric Pershey, Marc Snir, Franck Cappello, "Characterizing and Understanding HPC Job Failures over The 2K-day Life of IBM BlueGene/Q System", IEEE/IFIP 49th International Conference on Dependable Systems and Networks (IEEE DSN19), Portland, USA, 2019.

  99. XiangYu Zou, Tao Lu, Sheng Di, Dingwen Tao, Wen Xia, Xuan Wang, Weizhe Zhang, Qing Liao, "Accelerating Lossy Compression on HPC datasets via Partitioning Computation for Parallel Processing", in The 21st IEEE International Conference on High Performance Computing and Communications (IEEE HPCC19), 2019.

  100. XiangYu Zou, Tao Lu, Wen Xia, Xuan Wang, Weizhe Zhang, Sheng Di, Dingwen Tao, Franck Cappello, "Accelerating Relative-error Bounded Lossy Compression for HPC datasets with Precomputation-Based Mechanisms", in Proceedings of the 35th International Conference on Massive Storage Systems and Technology (IEEE MSST19), 2019.


    -----2018-----
  101. Dingwen Tao, Sheng Di, Xin Liang, Zizhong Chen, Franck Cappello, "Optimizing Lossy Compression Rate-Distortion from Automatic Online Selection between SZ and ZFP", in IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS), 2019.

  102. Xin Liang, Sheng Di, Dingwen Tao, Sihuan Li, Shaomeng Li, Hanqi Guo, Zizhong Chen, Franck Cappello, "Error-Controlled Lossy Compression Optimized for High Compression Ratios of Scientific Datasets", in IEEE Bigdata2018, 2018.

  103. Sihuan Li, Sheng Di, Xin Liang, Zizhong Chen, Franck Cappello, "Optimizing Lossy Compression with Adjacent Snapshots for N-body Simulation", in IEEE Bigdata2018, 2018.

  104. Xin Liang, Sheng Di, Dingwen Tao, Sihuan Li, Zizhong Chen, Franck Cappello, "Improving In-situ Lossy Compression with Spatio-Temporal Decimation based on SZ Model", in Proceedings of the 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4), in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2018).

  105. Xin-Chuan Wu, Sheng Di, Franck Cappello, Hal Finkel, Yuri Alexeev , Frederic T. Chong, "Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression", The 3rd International Workshop on Post-Moore Era Supercomputing (PME) in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis IEEE/ACM SC2018).

  106. Xin-Chuan Wu, Sheng Di, Franck Cappello, Hal Finkel, Yuri Alexeev, Frederic T. Chong, "Amplitude-Aware Lossy Compression for Quantum Circuit Simulation", in Proceedings of the 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4), in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2018).

  107. Sihuan Li, Sheng Di, Xin Liang, Zizhong Chen, Franck Cappello, "Improving Error-bounded Compression for Cosmological Simulation", in IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2018). [poster]

  108. Xin-Chuan Wu, Sheng Di, Franck Cappello, Hal Finkel, Yuri Alexeev, Frederic T. Chong, "Full State Quantum Circuits Simulation by Using Data Compression", in IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2018). [poster]

  109. Wenbin He, Hanqi Guo, Tom Peterka, Sheng Di, Franck Cappello, Han-Wei Shen, "Parallel Partial Reduction for Large-Scale Data Analysis and Visualization," in The 8th IEEE Symposium on Large Data Analysis and Visualization (IEEE LDAV) in conjunction with IEEE VIS 2018, Berlin, Germany, October 21, 2018.(best paper nominated)

  110. Sheng Di, Hanqi Guo, Rinku Gupta, Eric Pershey, Marc Snir, Franck Cappello, "Exploring Properties and Correlations of Fatal Events in a Large-Scale HPC System," in IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS), 2018.

  111. Dingwen Tao, Sheng Di, Xin Liang, Zizhong Chen, Franck Cappello, "Fixed-PSNR Lossy Compression for Scientific Data", in IEEE CLUSTER 2018.

  112. Xin Liang, Sheng Di, Dingwen Tao, Zizhong Chen, Franck Cappello, "Efficient Transformation Scheme for Lossy Data Compression with Point-wise Relative Error Bound", in IEEE CLUSTER 2018. (best paper)

  113. Ali Murat Gok, Sheng Di, Yuri Alexeev, Dingwen Tao, Vladimir Mironov, Franck Cappello, "PaSTRI: Error-bounded Lossy Compression for Two-Electron Integrals in Quantum Chemistry", in IEEE CLUSTER 2018, 2018. (best paper)

  114. Sheng Di, Dingwen Tao, Xin Liang, Franck Cappello, "Efficient Lossy Compression for Scientific Data based on Pointwise Relative Error Bound", in IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS), 2018.

  115. Hanqi Guo, Sheng Di, Rinku Gupta, Tom Peterka, Franck Cappello, "La VALSE: Scalable Visual Analysis of Logs for Fault Characterization on Supercomputers", in EG Symposium on Parallel Graphics and Visualization (ECPGV2018), 2018.

  116. Dingwen Tao, Sheng Di, Xin Liang, Zizhong Chen and Franck Cappello, "Improving performance of iterative methods by lossy checkponting", in 27th ACM Symposium on High-Performance Parallel and Distributed Computing (ACM HPDC2018), 2018.

  117. Jong Youl Choi, Choong-Seock Chang, Julien Dominski, Scott Klasky, Gabriele Merlo, Eric Suchyta, Mark Ainsworth, Bryce Allen, Franck Cappello, Michael Churchill, Philip Davis, Sheng Di, Greg Eisenhauer, Stephane Ethier, Ian Foster, Berk Geveci, Hanqi Guo, Kevin Huck, Frank Jenko, Mark Kim, James Kress, Seung-Hoe Ku, Qing Liu, Jeremy Logan, Allen Malony, Kshitij Mehta, Kenneth Moreland, Todd Munson, Manish Parashar, Tom Peterka, Norbert Podhorszki, Dave Pugmire, Ozan Tugluk, Ruonan Wang, Ben Whitney, Matthew Wolf, and Chad Wood, "Coupling Exascale Multiphysics Applications: Methods and Lessons Learned",
    In Proceedings of IEEE International Conference on
    eScience 2018, Amsterdam, Netherlands, October 29--November 1, 2018.

  118. Xinhou Wang, Kezhi Wang, Song Wu, Sheng Di, Hai Jin, Kun Yang, Shumao Ou, "Dynamic Resource Scheduling in Mobile Edge Cloud with Cloud Radio Access Network", in IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS), 2018.


  119. -----2017-----
  120. Dingwen Tao, Sheng Di, Zizhong Chen, and Franck Cappello, "In-Depth Exploration of Single-Snapshot Lossy Compression Techniques for N-Body Simulations", Proceedings of the 2017 IEEE International Conference on Big Data (BigData2017), Boston, MA, USA, December 11 - 14, 2017.

  121. Omer Subasi, Sheng Di, Leonardo Bautista-Gomez, Prasanna Balaprakash, Osman Unsal, Jesus Labarta, Adrian Cristal, Sriram Krishnamoorthy, Franck Cappello, "Exploring The Capabilities of Support Vector Machines in Detecting Silent Data Corruptions", in Journal of Sustainable Computing, Informatics and Systems (SUSCOM), 2017.

  122. Omer Subasi, Sheng Di, Leonardo Bautista-Gomez, Prasanna Balaprakash, Osman Unsal, Jesus Labarta, Adrian Cristal, Franck Cappello, "MACORD: Online Adaptive Learning Framework for Silent Error Detection", in International Workshop of Fault Tolerant Systems (FTS17), in conjunction with the IEEE International Conference on Cluster Computing (Cluster 2017), 2017.

  123. Sheng Di, Dingwen Tao, Franck Cappello, "An Efficient Approach to Loss Compression with Point-wise Relative Error Bound", Proceedings of the 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2) in conjunction with IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2017), 2017.

  124. Ali Murat Gok, Dingwen Tao, Sheng Di, Vladimir Mironov, Yuri Alexeev, Franck Cappello, "PaSTRI: A Novel Data Compression Algorithm for Two-Electron Integrals in Quantum Chemistry", in IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (IEEE/ACM SC2017). [poster]

  125. Dingwen Tao, Sheng Di, Hanqi Guo, Zizhong Chen, and Franck Cappello, "Z-checker: A Framework for Assessing Lossy Compression of Scientific Data", in The International Journal of High Performance Computing Applications (IJHPCA), 2017.

  126. Sheng Di, Franck Cappello, "Optimization of Error-Bounded Lossy Compression for Hard-to-Compress HPC Data," in IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS), 2017.

  127. Eduardo Berrocal, Leonardo Bautista-Gomez, Sheng Di, Zhiling Lan, and Franck Cappello, "Toward General Software Level Silent Data Corruption Detection for Parallel Applications, " in IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS), 2017.

  128. Franck Cappello, Rinku Gupta, Sheng Di, Emil Constantinescu, Thomas Peterka, and Stefan M. Wild, "Understanding and improving the trust in results of numerical simulations and scientific data analytics", in 10th workshop on resilience in high performance computing (resilience) in Clusters, Clouds and Grids, in the conjunction with 23rd International European Conference on Parallel and Distributed Computing (Euro-Par), 2017.

  129. Ian T. Foster, Mark Ainsworth, Bryce Allen, Julie Bessac, Franck Cappello, Jong Youl Choi, Emil M. Constantinescu, Philip E. Davis, Sheng Di, et al., "Computing Just What You Need: Online Data Analysis and Reduction at Extreme Scales", in 23rd International European Conference on Parallel and Distributed Computing (Euro-Par 2017), 2017. pp. 3-19.

  130. Dingwen Tao, Sheng Di, Zizhong Chen, and Franck Capello, "Exploration of Pattern-Matching Techniques for Lossy Compression on Cosmology Simulation Data Sets ", Proceedings of the 1st International Workshop on Data Reduction for Big Scientific Data (DRBSD1) in Conjunction with ISC2017, Frankfurt, Germany, June 22, 2017.

  131. Dingwen Tao, Sheng Di, Hanqi Guo, Zizhong Chen, Franck Cappello, "Towards Efficient Error-controlled Lossy Compression for Scientific Data", in Greater Chicago Area Systems Research Workshop (GCASR2917), 2017, download.

  132. Sheng Di, Rinku Gupta, Eric Pershey, Marc Snir, Franck Cappello, "LogAider: A tool for mining potential correlations in HPC Log Events, " in IEEE/ACM 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ACM CCGrid2017), Spain, 2017.

  133. Xinhou Wang, Song Wu, Kezhi Wang, Sheng Di, Hai Jin, Kun Yang and Shumao Ou, "Maximizing the Profit of Cloud Broker with Priority Aware Pricing",  in 23rd IEEE International Conference on Parallel and Distributed Systems (ICPADS17), 2017.

  134. Dingwen Tao, Sheng Di, Franck Cappello, "Significantly Improving Lossy Compression for Scientific Data Sets Based on Multidimensional Prediction and Error-Controlled Quantization, " in International Parallel and Distributed Processing Symposium (IEEE/ACM IPDPS 2017), Orlando, Florida, 2017.

  135. Xuanhua Shi, Junling Liang, Xuan Luo, Peng Zhao, Sheng Di, Bingsheng He, Hai Jin, "Frog: Asynchronous Graph Processing on GPU with Hybrid Coloring Model," in IEEE Transactions on Knowledge and Data Engineering (IEEE TKDE), 2017.


  136. -----2016-----
  137. Eduardo Berrocal, Leonardo Bautista Gomez, Sheng Di, Zhiling Lan and Franck Cappello, "Exploring Partial Replication to Improve Lightweight Silent Data Corruption Detection for HPC Applications," 22th International European Conference on Parallel and Distributed Computing (Europar16), 2016.

  138. Song Wu, Yihong Wang, Wei Luo, Sheng Di, Haibao Chen, Xiaolin Xu, Hai Jin, and Ran Zheng, "ACStor: Optimizing Access Performance of Virtual Disk Images in Clouds," in Transactions on Parallel and Distributed Systems (IEEE TPDS), 2016.

  139. Sheng Di, Yves Robert, Frederic Vivien, and Franck Cappello, "Toward an Optimal Online Checkpoint Solution under a Two-Level HPC Checkpoint Model, " in IEEE Transactions on Parallel and Distributed Computing (IEEE TPDS), 2016.

  140. Omer Subasi, Sheng Di, Leonardo Bautista-Gomez, Prasanna Balaprakash, Osman Unsal, Jesus Labarta, Adrian Cristal and Franck Cappello, "Spatial Support Vector Regression to Detect Silent Errors in the Exascale Era, " in IEEE/ACM Cluster, Cloud and Grid Computing (ACM CCGrid 2016), 2016.

  141. Sheng Di, Franck Cappello, "Adaptive-Impact Driven Detection of Silent Data Corruption for HPC Applications, " in IEEE Transactions on Parallel and Distributed Computing (IEEE TPDS), 2016. Errata: In the paper, there is a typo in the Algorithm 1 (line 5 and line 7): Gamma={PM_j -  varepsilon_j<theta * r } should be Gamma={PM_j | varepsilon_j<theta * r }. The corrected version can be downloaded from here.

  142. Sheng Di, Franck Cappello, "Fast Error-bounded Lossy HPC Data Compression with SZ," in International Parallel and Distributed Processing Symposium (IEEE/ACM IPDPS 2016), 2016.

  143. Song Wu, Zhenjiang Xie, Haibao Chen, Sheng Di, Xinyu Zhao, Hai Jin, "Dynamic Acceleration of Parallel Applications in Cloud Platforms by Adaptive Time-Slice Control," in International Parallel and Distributed Processing Symposium (IEEE/ACM IPDPS 2016), 2016.

  144. Xinhou Wang, Kezhi Wang, Song Wu, Sheng Di, Kun Yang, and Hai Jin, "Dynamic Resource Scheduling in Cloud Radio Access Network with Mobile Cloud Computing," in International Symposium on Quality of Service (IEEE/ACM IWQoS2016), Beijing, China, 2016.


  145. -----2015-----
  146. Eduardo Berrocal, Leonardo Bautista-Gomez, Sheng Di, Zhiling Lan, Franck Cappello, "Lightweight Silent Data Corruption Detection Based on Runtime Data Analysis for HPC Applications, " in 24th International ACM Symposium on High Performance Parallel and Distributed Computing (ACM HPDC2015), 2015, short paper.

  147. Sheng Di, Eduardo Berrocal, and Franck Cappello, "An Efficient Silent Data Corruption Detection Method with Error-feedback Control and Even Sampling for HPC Applications, " in IEEE/ACM 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ACM CCGrid2015), 2015.

  148. Xuanhua Shi, Junling Liang, Sheng Di, Bingsheng He, Hai Jin, Lu Lu, Zhixiang Wang, Xuan Luo, Jianlong Zhong, "Optimization of Asynchronous Graph Processing on GPU with Hybrid Coloring Model," in 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (ACM PPoPP), 2015. [poster].


  149. -----2014-----
  150. Sheng Di, Franck Cappello, "GloudSim: Google Trace based Cloud Simulator with Virtual Machines," in Journal of Software: Practice and Experience (Wiley SPE), 2014.

  151. Sheng Di, Eduardo Berrocal, Leonardo Bautista-Gomez1, Katherine Heisey, Rinku Gupta1, Franck Cappello, "Towards Effective Detection of Silent Data Corruptions for HPC Applications," in IEEE/ACM Proc. of International Conference of SuperComputing (IEEE/ACM SC2014), New Orleans, 2014. [poster]

  152. Song Wu, Haibao Chen, Sheng Di, Bingbing Zhou, Zhenjiang Xie, Hai Jin, Xuanhua Shi, "Synchronization-Aware Scheduling for Virtual Clusters in Cloud," in IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS), 2014.

  153. Xuanhua Shi, Haohong Lin, Hai Jin, Bing Bing Zhou, Zuoning Yin, Sheng Di and Song Wu, "GIRAFFE: A Scalable Distributed Coordination Service for Large-scale Systems," in IEEE Proc. of 16th International Conference on Cluster Computing (IEEE CLUSTER2014), Madrid, Spain, 2014. (best paper nominated)

  154. Sheng Di, Leonardo Bautista-gomez, Franck Cappello, "Optimization of Multi-level Checkpoint Model with Uncertain Execution Scales," in IEEE/ACM Proc. of International Conference of SuperComputing (IEEE/ACM SC2014), New Orleans, 2014.

  155. Hai Jin, Xinhou Wang, Song Wu, Sheng Di, Xuanhua Shi, "Towards Optimized Fine-Grained Pricing of IaaS Platform", in IEEE Transactions on Cloud Computing (IEEE TCC), 2014.

  156. Sheng Di, Derrick Kondo, and Cho-Li Wang, "Optimization of Composite Cloud Service Processing with Virtual Machines," in IEEE Transactions on Computers (IEEE TC), 2014.

  157. Haibao Chen, Song Wu, Zhenjiang Xie, Sheng Di, Bingbing Zhou, Xuanhua Shi, Hai Jin, "Communication-Driven Scheduling for Virtual Clusters in Cloud," in 23rd International ACM Symposium on High Performance Parallel and Distributed Computing (HPDC2014), 2014, short paper.

  158. Sheng Di, Derrick Kondo, and Franck Cappello, "Characterizing and Modeling Cloud Applications/Jobs on a Google Data Center," in Journal of Supercomputing (springer JS), 2014.

  159. Sheng Di, Mohamed S. Bouguerra, Leonardo Bautista-gomez, Franck Cappello, "Optimization of Multi-level Checkpoint Model for Large-scale HPC Applications, " in International Parallel and Distributed Processing Symposium (IEEE/ACM IPDPS 2014), 2014.


  160. -----2013-----
  161. Sheng Di, Cho-Li Wang, Franck Cappello, "Adaptive Algorithm for Minimizing Cloud Task Length with Load Prediction Errors," in IEEE Transactions on Cloud Computing (IEEE TCC Special Issue), 2013.

  162. Sheng Di, Derrick Kondo, and Walfredo Cirne, "Google Hostload Prediction based on Bayesian Model with Optimized Feature Combination," in Journal of Parallel & Distributed Computing (elsevier JPDC), 2013.

  163. Sheng Di, Yves Robert, Frederic Vivien, Derrick Kondo, Cho-Li Wang, Franck Cappello, "Optimization of Cloud Task Processing with Checkpoint-Restart Mechanism", in IEEE/ACM Proc. of International Conference of SuperComputing (IEEE/ACM SC2013), Denver, CO, US, 2013.

  164. Sheng Di, Cho-Li Wang, "Minimization of Cloud Task Execution Length with Workload Prediction Errors," in International Conference on High Performance Computing (IEEE/ACM HiPC 2013), 2013.

  165. Sheng Di, Derrick Kondo, Franck Cappello, "Characterizing Cloud Applications on a Google Data Center," in Proc. of 42th International Conference on Parallel Processing (IEEE ICPP2013), 2013.

  166. Sheng Di, Derrick Kondo, Cho-Li Wang, "Optimization and Stabilization of Composite Service Processing in a Cloud System," in International Symposium on Quality of Service (IEEE/ACM IWQoS2013), Montreal, Canada, 2013.

  167. Sheng Di, Cho-Li Wang, and Derrick Kondo, "Towards Payment-Bound Analysis in Cloud Systems with Task-Prediction Errors," in International Conference on Cloud Computing (IEEE CLOUD2013), Santa Clara Marriott, CA, USA, 2013.

  168. Sheng Di, Cho-Li Wang, and Ling Chen, "Ex-post Efficient Resource Allocation for Self-organizing Cloud," in Journal of Computers and Electrical Engineering (CEE), elsevier, 2013.


  169. -----2012-----
  170. Sheng Di and Cho-Li Wang, "Error-tolerant Resource Allocation and Payment Minimization for Cloud System," IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS Special Issue), 2012.

  171. Sheng Di, Derrick Kondo, Walfredo Cirne, "Host Load Prediction in a Google Compute Cloud with a Bayesian Model," in IEEE/ACM Proc. of International Conference of SuperComputing (IEEE/ACM SC2012), Utah, US, 2012, , .

  172. Sheng Di, Derrick Kondo, Walfredo Cirne, "Characterization and Prediction of Host Load in a Google Data Center," in IEEE Proc. of 14th International Conference on Cluster Computing (IEEE CLUSTER2012), Peking, China, 2012.

  173. Sheng Di and Cho-Li Wang, "Dynamic Optimization of Multi-Attribute Resource Allocation in Self-Organizing Clouds," IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS), 2012.


  174. -----2011-----
  175. Sheng Di and Cho-Li Wang, "Decentralized Proactive Resource Allocation for Maximizing Throughput of P2P Grid", Journal of Parallel & Distributed Computing (JPDC), 2011, doi: 10.1016/j.jpdc.2011.10.010, .

  176. Sheng Di, Cho-Li Wang, Luwei Cheng, Ling Chen, "Socially Optimal Win-win Resource Allocation for Self-Organizing Cloud", in IEEE International Conference on Cloud and Service Computing (IEEE CSC2011), 2011.

  177. Sheng Di, Cho-Li Wang, Weida Zhang, Luwei Cheng, "Probabilistic Best-fit Multi-dimensional Range Query in Self-Organizing Cloud", in Proc. of 40th International Conference on Parallel Processing (IEEE ICPP2011), 2011, pp. 763-772, .

  178. Zheming Xu, Sheng Di, Luwei Cheng, Weida Zhang and Cho-Li Wang, "WAVNet: Wide-Area Network Virtualization for Elastic Cloud Computing", in Proc. of 40th International Conference on Parallel Processing (IEEE ICPP2011), 2011, pp. 285-294, . (best paper nominated)

  179. Luwei Cheng, Cho-Li Wang, Sheng Di, "Defeating Network Jitter for Virtual Machines", "best student paper award" in IEEE/ACM International Conference on Utility and Cloud Computing (IEEE/ACM UCC2011),  2011, . (best student paper award)



  180. -----2010-----
  181. Haoyu Hu, Yinfeng Wang, Cho-Li Wang, Sheng Di, "Sensible Cloud: Cloud based Large-scale Social Intelligence Reasoning", CNGridAnnual2010, Beijing, 2010, (in Chinese), .

  182. Yinfeng Wang, Hao Liu, Sheng Di and Haoyu Hu, "A Parallel Index Mechanism for Large Scale High Dimensional Data", Journal of Huazhong University of Science and Technology (Nature Science Edition), Jun, 2011, 39(1), (in Chinese), .

  183. Sheng Di, Cho-Li Wang, "Dual-Phase Just-in-Time Workflow Scheduling in P2P Grid Systems," in Proc. of 39th International Conference on Parallel Processing (IEEE ICPP2010), 2010, pp.238-247, .

  184. Sheng Di and Cho-Li Wang, "Conflict-minimizing Dynamic Load Balancing for P2P Desktop Grid, in The 11th IEEE/ACM International Conference on Grid Computing (IEEE/ACM Grid2010), Brussels, Belgium, Oct 24-29, 2010, 137-144, .


  185. -----2006~2009-----
  186. Sheng Di, Cho-li Wang, Dexter H. Hu, "Gossip-based Dynamic Load Balancing in Self-organized Desktop Grid", Proc of 10th HPCAsia (27th APAN), 2009, .

  187. Ling Chen, Sheng Di, "RSR-CGSF: A Robust Semantic Resource Based Cooperative Grid Service Framework", 1st International Conference on Information Engineering and Computer Science (IEEE ICIECS 2009), 2009, .

  188. Sheng Di, Cho-li Wang, "Task Scheduling based on Dynamic Critical Task Estimation in P2P Grid Workflow", CNGridAnnual2009, Beijing, 2009, (in Chinese), .

  189. Ling Chen, Hai Jin, Sheng Di, "A Semantic Double-Buffer Based Approach to Enhance Semantic Web Search", in International Conference on the Digital Society (IEEE ICDS2008), pp. 111-116, . (best paper award)

  190. Sheng Di, Hai Jin, Shengli Li, Ling Chen, Li Qi, Chengwei Wang, "Ontology Based Grid Information Interoperation," ainaw, 21st International Conference on Advanced Information Networking and Applications Workshops (IEEE AINAW2007), 2007, pp. 91-96, .

  191. Sheng Di, Hai Jin, Shengli Li, "A Flexible Two-Level Mechanism in Querying and Presenting Large-scale Historical Monitoring Data",  13th Asia-Pacific Conference on Communications (IEEE APCC2007), 2007.

  192. Hai Jin, Chuanjiang Yi, Sheng Di, "A Composite-Service Authorization Prediction Platform for Grid Environment", Cooperative Design, Visualization, and Engineering (LNCS CDVE2007), 2007, pp. 217-22,

  193. Sheng Di, Hai Jin, Shengli Li, Jing Tie, Ling Chen, "Efficient Time Series Data Classification and Compression for Distributed Monitoring", the 2007 International Workshop on High Performance Data Mining and Applications (HPDMA2007, in conjunction with PAKDD2007), pp. 389-400.

  194. Sheng Di, Hai Jin, Shengli Li, Ling Chen, Chengwei Wang, "GlobalWatch: A Distributed Service Grid Monitoring Platform with high flexibility and usability" ,Proc of 1st IEEE Asia-Pacific Services Computing Conference (APSCC2006), 2006, pp. 440-446.  

  195. Hai Jin, Pingpeng Yuan, Li Huang, Feng Mao, Sheng Di, Sheng Sun, Shilun Yuan, Changqin Li, Yanxia Li, Qin Shi: Patent: "Grid Data transmission Platform with high QoS and Multi-replica", NO. 200610125570.9

 Awards received __________________________

  • 2021, R&D 100 award (in recognition of leading SZ: A Lossy Compression Framework for Scientific Data)

  • 2021, DOE Early Career Research Program Award

  • 2019, IEEE Distinguished Research and Development Award (Chicago Section)

  • 2019, R&D 100 award (in recognition of participating in SCR: Scalable Checkpoint/Restart Framework)

  • 2018, IEEE Distinguished Mentoring Award (Chicago Section), in recognition of mentoring as a scientist in the area of data compression and software development

  • 2018, Overall best paper in IEEE CLUSTER (and one more best paper in Data&Storage Track)

  • 2018, best paper nominated (honorable mention) in LDAV 2018 Symposium

  • 2016, Outstanding Contribution in Reviewing Award for JPDC journal

  • 2014, Best paper nominated in International Conf. on Cluster Computing (IEEE CLUSTER2014)

  • 2011, Best paper nominated in 40th International Conf. on Parallel Processing (IEEE ICPP2011)

  • 2011, Best student paper award in IEEE/ACM UCC2011

  • 2008, Best paper award in International Conference on the Digital Society (ICDS)

  • 2006, Full tuition fee scholarship, 2005-2006, Huazhong university of Science and Technology

  • 2006, Triple-A student, 2005-2006, Huazhong university of Science and Technology

  • 2005, First class scholarship, 2004-2005, Huazhong university of Science and Technology (top 10%)

  • 2005, Full tuition fee scholarship, 2004-2005, Huazhong university of Science and Technology

  • 2005, First class scholarship, 2004-2005, Huazhong university of Science and Technology

  • 2004, University excellent leader, 2003-2004, South-Central University for nationalities

  • 2003, Triple-A student, 2002-2003, South-Central University for nationalities

  • 2003, Second class scholarship, 2002-2003, South-Central University for nationalities (top 3%)

  • 2002, First class scholarship, 2001-2002, South-Central University for nationalities (top 1%)

  • 2001, Triple-A student, 2000-2001, South-Central University for nationalities

  • 2001, First class scholarship, 2000-2001, South-Central University for nationalities (top 1%)

  • 1999, First class prize, High-school Mathematics Competition, Province Award, Shandong Province, China

  • 1998, Second class prize, High-school Mathematics Competition, National Award, China

Research Interest __________________________

  • Lossy Data Compression for Extreme-scale Scientific Datasets

  • Fault-Tolerance in Cloud Computing and HPC

  • Optimization of Resource Allocation (both theoretical analysis and practical implementation)

  • Cloud Computing, Peer-to-Peer, and Grid Computing

  • Performance Evaluation (with Google trace)

More Research Topics I studied______________

  • Grid Monitoring System

  • Grid Information Service

  • Grid Security / Access Control

  • Process Migration

  • Semantic Webs

  • Data Analysis/Compression

  • Flexible Graphical User Interface

  • Web 2.0

Journal Editorial/Review Board Member_______

  • Editorial Board Member, Frontiers in High Performance Computing Journal.

  • Review Board Member, Transactions on Parallel and Distributed Systems (TPDS)

Local Organizing & Committee Member _______

  1. Program Committee Member: IEEE/ACM The International Conference for High Performance computing, Networking, Storage and Analysis (SC2022), 2022.

  2. Program Co-chair: 3rd International Workshop on Big Data Reduction (IWBDR2022), in conjunction with IEEE Bigdata conference, 2022.

  3. Program Chair: International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-8), in conjunction with SC2022.

  4. Program Committee Member: 51st International Conference on Parallel Processing (ICPP2022), 2022.

  5. Review Committee Board Member: Early Career Research Program (ECRP), 2022.

  6. Program Committee Member: 36th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2022), 2022.

  7. Program Co-chair: 2nd International Workshop on Big Data Reduction (IWBDR2021), in conjunction with IEEE Bigdata conference, 2021.

  8. Program Committee Member: IEEE Special Section on Parallel and Distributed Computing Techniques for AI, ML, and DL (IEEE TPDS-SS-AI 2021), 2021.

  9. Program Committee Member: IEEE 2021 International Conference on Machine Learning and Applications (IEEE ICMLA2021), 2021.

  10. Program Committee Member: IEEE International Conference on Big Data (IEEE Bigdata2021), 2021.

  11. Program Committee Member: 35th IEEE International Parallel and Distributed Processing Symposium  (IPDPS2021), 2021.

  12. Program Committee Member: 18th IEEE Workshop on Silicon Errors in Logic -- System Effects (SELSE2021), 2021.

  13. Program Co-chair: 1st International Workshop on Big Data Reduction (IWBDR2020), in conjunction with IEEE Bigdata conference, 2020.

  14. Program Committee Member: IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS) Special Section on Parallel and Distributed Computing Techniques for AI, ML and DL (IEEE TPDS-SS-AI 2020).

  15. Program Committee Member: International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-5), in conjunction with SC2020.

  16. Program Committee Member: IEEE International Conference on High Performance Computing , Data, and Analytics (IEEE HiPC2020), 2020.

  17. Program Committee Member [Data, Storage, and Visualization track and poster section]: IEEE International Conference on Cluster Computing (IEEE CLUSTER-2020), 2020

  18. Program Committee Member: Asia-Pacific Services Computing Conference (APSCC 2019), 2019.

  19. Program Committee Member: IEEE International Conf. on SmartData (SmartData 2019), 2019.

  20. Program Committee Member: IEEE congress on BigData (IEEE BigData Congress 2019), 2019.

  21. Program Committee Member: Asia-Pacific Services Computing Conference (APSCC2018), 2018

  22. Track Chair/Program Committee Member: IEEE International Congress on BigData (IEEE BigData Congress 2018), San FranCisco, CA, 2018.

  23. Program Committee Member [poster session]: IEEE/ACM The International Conference for High Performance Computing, Networking, Storage and Analysis  (SC2018), 2018.

  24. Program Committee Member: 32nd IEEE International Parallel and Distributed Processing Symposium  (IPDPS2018), 2018.

  25. Program Committee Member: 18th IEEE/ACM International Symposium of Cluster, Cloud and Grid Computing (CCGrid2017), 2017.

  26. Program Committee Member: 3rd Workshop of Fault Tolerant Systems (FTS17), held in conjunction with IEEE CLUSTER2017, 2017.

  27. Program Co-chair: 2nd Workshop of Fault Tolerant Systems (FTS16), held in conjunction with IEEE CLUSTER2016, Sept.16th, 2016, Taipei.

  28. Program Committee Member: 16th IEEE/ACM International Symposium of Cluster, Cloud and Grid Computing (CCGrid2015), 2015.

  29. Program Committee Member: 7th International Conference on Cloud Computing Technology and Science, (CloudCom2015), 2015.

  30. Program Committee Member: IEEE International conference on Big Data Intelligence and Comuputing (DataCom2015), 2015.

  31. Program Committee Member: The 4th IEEE International Workshop on Cloud Computing Interclouds, Multiclouds, Federations, and Interoperability (Intercloud'15), 2015.

  32. Organizing Chair: Postdoc-Ph.D-Student Meeting at 2nd JLESC workshop, Nov.24-26, 2014.

  33. Program Committee Member: 6th International Conference on Cloud Computing Technology and Science, Singapore (CloudCom2014), 15-18 Dec, 2014. (System track and Ph.D consortium)

  34. Program Committee Member: 5th International Conference on Scalable Information Systems (Infoscale2014), Seoul, South Korea, 2014.

  35. Program Committee Member: Asia-Pacific Services Computing Conference (APSCC2014), 2014

  36. Program Committee Member: International Workshop on Mobile Internet Big Data, 2014

  37. Program Committee Member: IEEE International Workshop on Advanced Technologies of Cloud Computing, IWATCC14, 2014.

  38. Program Committee Member: IEEE International Conference on Services Computing (SCC2014), 2014.

  39. Program Committee Member: IEEE 3rd International Workshop on Cloud Computing Interclouds, Multiclouds, Federations, and Interoperability (IEEE Intercloud2014), 2014.

  40. Program Committee Member: The 5th IEEE International Conference on Cloud Computing Technology and Science (CloudCom2013), 2013.

  41. Program Committee Member: The 27th IEEE International Conference on Advanced Information Networking and Applications (AINA2013), 2013.

  42. Program Committee Member: The 4th IEEE International Conference on Cloud Computing Technology and Science (CloudCom2012), 2012.

  43. Program Committee Member: IEEE Asia Pacific Cloud Computing Conference (APCloud2012), 2012

  44. Organizing Committee Member: PRAGRMA Confeference11, 2011

  45. Organizing Committee Member: 6th Workshop of OMII-Europe and CNGrid, 2008

Invited External Reviewer_______

  • Reviewer: IEEE Grid Computing (GRID07), 2007

  • Reviewer: IEEE Transactions on Computers (ToC), 2008

  • Reviewer: Journal of Parallel Distributed and Computing (JPDC), 2008

  • Reviewer: IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid09)

  • Reviewer: High Performance Computing Asia (HPCAsia09), 2009

  • Reviewer: IEEE International Conference on Cluster Computing (IEEE Cluster09), 2009

  • Reviewer: International Conference on Parallel and Distributed Computing (ICPADS09), 2009

  • Reviewer: IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid10)

  • Reviewer: IEEE 4th International Conference on Cloud Computing (Cloud10), 2010

  • Reviewer: Journal of Computer Science and Technology (JCST), 2010

  • Reviewer: Heterogeneity in Computing Workshop (HCW10) in conjunction with IPDPS10

  • Reviewer: CNGrid Annual Conference 2009/2010/2011

  • Reviewer: IEEE/ACM International Parallel & Distributed Processing Symposium (IPDPS11), 2011

  • Reviewer: IEEE International Conference on Parallel Processing (ICPP11), 2011

  • Reviewer: Heterogeneity in Computing Workshop (HCW11) in conjunction with IPDPS11

  • Reviewer: International Conference on Services Computing (SCC11), 2011

  • Reviewer: Cloud Computing (CloudCom11), 2011

  • Reviewer: International Journal of Computational Science and Engineering, 2012

  • Reviewer: Cloud Computing (CloudCom12), 2012

  • Reviewer: IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid13)

  • Reviewer: International Journal of Automated Software Engineering (ASE13), 2013

  • Reviewer: International Journal of Peer-to-Peer Networking and Applications (PPNA), 2013

  • Reviewer: International Conference on Networking and Grid Cloud Computing (ICNGCC-2013)

  • Reviewer: International Journal of Future Generation Computer Systems (FGCS-2013)

  • Reviewer: KSII Transactions on Internet and Information Systems (TIIS), 2013

  • Reviewer: Journal of Zhejiang University, 2013

  • Reviewer: IEEE Transactions on Parallel and Distributed Systems (TPDS), 2013

  • Reviewer: Cloud computing (CloudCom13), 2013

  • Reviewer: Journal of Frontiers of Computer Science, 2013.

  • Reviewer: IEEE Transactions on Cloud Computing (TCC), 2013.

  • Reviewer: The 27th IEEE International Conference on Advanced Information Networking and Applications (AINA13), 2013.

  • Reviewer: IEEE/ACM International Parallel & Distributed Processing Symposium (IPDPS14), 2014

  • Reviewer: IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid14)

  • Reviewer: Journal of Computer Science and Technology (JCST), 2014.

  • Reviewer: IEEE Transactions on Cloud Computing (TCC), 2014.

  • Reviewer: International Journal of Future Generation Computer Systems (FGCS), 2014

  • Reviewer: IEEE/ACM Proc. of 26th International Conference of SuperComputing (SC14), 2014

  • Reviewer, IEEE International Conference on Cloud Computing and Science (CloudCom14), 2014

  • Reviewer, IEEE Transactions on Parallel and Distributed Systems (TPDS), 2014

  • Reviewer, IEEE/ACM International Parallel & Distributed Processing Symposium (IPDPS15), 2015

  • Reviewer: IEEE International Workshop on Cloud Computing Interclouds, Multiclouds, Federations, and Interoperability (Intercloud 2015), 2015

  • Reviewer: elsevier Journal of Systems and Software (JSS), 2015

  • Reviewer: International Journal of Future Generation Computer Systems (FGCS), 2015

  • Reviewer: International ACM Symposium on High Performance Parallel and Distributed Computing (HPDC15), 2015

  • Reviewer: IEEE Systems Journal (SJ), 2015.

  • Reviewer: Journal of Software: Practice and Experience (SPE), 2015.

  • Reviewer: Journal of Parallel Distributed and Computing (JPDC), 2015.

  • Reviewer: The Computer Journal, 2015.

  • Reviewer: International Conference on Cluster Computing (IEEE CLUSTER15), 2015.

  • Reviewer: The 12th Annual IFIP International Conference on Network and Parallel Computing (NPC15), 2015.

  • Reviewer: IEEE Transactions on Parallel and Distributed Systems (TPDS), 2015.

  • Reviewer: IEEE Transactions on Cloud Computing (TCC), 2015.

  • Reviewer: Journal of Mathematical Problems in Engineering (MPE2015), 2015.

  • Reviewer: International Conference on Cloud Computing and Big Data (CCBD2015), 2015.

  • Reviewer: IEEE Transactions on Service Computing (TSC), 2015.

  • Reviewer: IEEE International conference on Big Data Intelligence and Comuputing (DataCom'15), 2015.

  • Reviewer: Journal of Knowledge based Systems (KBS), 2015.

  • Reviewer: International Parallel and Distributed Processing Symposium (IPDPS'16), 2016.

  • Reviewer: IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid'16), 2016.

  • Reviewer: International ACM Symposium on High Performance Parallel and Distributed Computing (HPDC'16), 2016.

  • Reviewer: IEEE Transactions on Parallel and Distributed Computing (TPDS), 2016.

  • Reviewer: ACM International Conference on Supercomputing (ICS'16), 2016.

  • Reviewer: IEEE Transactions on Cloud Computing (TCC), 2016.

  • Reviewer: elsevier Journal of Parallel Computing (PARCO), 2016.

  • Reviewer: IEEE Transactions on Communications (TOC), 2016.

  • Reviewer: Journal of Parallel and Distributed Computing (JPDC), 2016.

  • Reviewer: IIS. Journal of Information Science and Engineering, 2016.

  • Reviewer: International Parallel and Distributed Processing Symposium (IPDPS'17), 2017.

  • Reviewer: IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid'17), 2017.

  • Reviewer: IEEE International Conference on Cluster Computing (IEEE Cluster17), 2017.

  • Reviewer: IEEE Transactions on Parallel and Distributed Computing (TPDS), 2017.

  • Reviewer: The International Journal of High Performance Computing Applications (IJHPCA), 2017.

  • Reviewer: 3rd Workshop of Fault Tolerant Systems (FTS17), 2017.

  • Reviewer: IEEE Transactions on Parallel and Distributed Systems (TPDS), 2018.

  • Reviewer: Elsevier Computer Physics Communications (CPC), 2018.

  • Reviewer: IEEE International Congress on Bigdata, 2018.

  • Reviewer: International Conference on Parallel Processing  (ICPP), 2018.

  • Reviewer: Future Generation System Computing (FGCS), 2018.

  • Reviewer: Journal of Concurrency and Computation: Practice and Experience (CCPE), 2018.

  • Reviewer: IEEE Cluster, 2018. [poster]

  • Reviewer: LNCS Asia-Pacific Services Computing Conference (APSCC), 2018.

  • Reviewer: Journal of Supercomputing, 2018.

  • Reviewer: ACM Transactions on Parallel Computing (TOPC), 2019

  • Reviewer: IEEE Transactions on Parallel and Distributed Systems (TPDS), 2019.

  • Reviewer: IEEE congress on BigData, 2019.

  • Reviewer: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC19), 2019.

  • Reviewer: IEEE International Conference on Smart Data (SmartData-2019), 2019.

  • Reviewer: International Conference on Parallel Processing (ICPP), 2019.

  • Reviewer: SIAM Scientific Computing Journal, 2019

  • Reviewer: IEEE International Parallel and Distributed Processing Symposium (IPDPS'19), 2019

  • Reviewer: ACM Journal of Computing Surveys (CSUR), 2019

  • Reviewer: IEEE SmartDataService Conference, 2020

  • Reviewer: International Journal of Electrical Power and Energy Systems (IJEPES), 2020

  • Reviewer: Springer Peer-to-Peer Networking and Applications (PPNA2020), 2020

  • Reviewer: IEEE Transactions on Parallel and Distributed Systems (TPDS), regular track, 2020.

  • Reviewer: IEEE Transactions on Parallel and Distributed Systems (TPDS), special section on Parallel and Distributed Computing Techniques for AI, ML and DL, 2020.

  • Reviewer: IEEE International Conference on Cluster Computing (IEEE CLUSTER-2020), 2020.

  • Reviewer: International Conference on High Performance Computing (IEEE/ACM HiPC 2020).

  • Reviewer: ACM Computing Surveys (CSUR), 2020.

  • Reviewer: IEEE Transactions on Smart Grid, 2020.

  • Reviewer: Journal of Mathematical Problems in Engineering (MPE), 2020.

  • Reviewer: Journal of Information Sciences, 2021.

  • Reviewer: IEEE Transactions on Cloud Computing (TCC), 2021.

  • Reviewer: IEEE Transactions on Parallel and Distributed Systems (TPDS), 2021.

  • Reviewer: The International Journal of High Performance Computing Applications (IJHPCA), 2021.

  • Reviewer: IEEE Special Section on Parallel and Distributed Computing Techniques for AI, ML, and DL (IEEE TPDS-SS-AI 2021), 2021.

  • Reviewer: IEEE 2021 International Conference on Machine Learning and Applications (IEEE ICMLA2021), 2021.

  • Reviewer: Journal of Computational Science (JOCSCI), 2022.

  • Reviewer: Special Issue on Parallel Programming Models and Systems Software for High-End Computing of the Journal: Concurrency and Computation: Practice and Experience (CPE), 2022.

  • Reviewer: IEEE Transactions on Parallel and Distributed Systems (TPDS), 2022.

  • Reviewer: Knowledge-Based Systems (KNOSYS), 2022.

Activities (Invited Talks)____________________

  • 2022, Aug. 30th, Invited talk at DOE Computer Graphics Forum (DOECGF 2022). Title: Scalable Dynamic Scientific Data Reduction, Virtual meeting.

  • 2022, Sept. 18th, Invited talk at 'Compression session' of 14th Joint Laboratory for Extreme-Scale Computing (JLESC) workshop, NCSA, Champaign, USA.

  • 2022, April, 15th, Poster presentation at ECP annual meeting, Virtual meeting.

  • 2022, Jan 24, Invited talk at 'Breakout session on data reduction for ECP Applications' section in ECP annual meeting, Virtual meeting.

  • 2022, Jan 24th, White paper presentation at ASCR Workshop on the Management and Storage of Scientific Data, Virtual meeting.

  • 2021, Dec. 16th, Invited Talk at 'Compression session' of 13th Joint Laboratory for Extreme-Scale Computing (JLESC) workshop, Virtual meeting.

  • 2021, April, Invited Talk at 'Lossy data reduction for ECP applications' session in ECP annual meeting, Virtual meeting.

  • 2021, April, Invited Talk at 'ECP Community BOF: Tools for Data-driven Analysis and Improvement of HPC Scientific Software Development', Virtual meeting.

  • 2021, Feb., Invited Talk at Joint Laboratory for Extreme-Scale Computing (JLESC) workshop, Virtual meeting.

  • 2019, Nov. Tutorial Speaker in Compression for Scientific Data at Supercomputing (SC19), 2019, Denver, USA.

  • 2019, Oct, Invited Talk at Illinois Institute of Technology (IIT), Chicago, USA.

  • 2019, Oct, Invited Talk at Wayne State University (WSU), Detroit, USA.

  • 2019, April, Invited Talk at Joint Laboratory for Extreme Scale Computing (JLESC) workshop, Tennessee, Knoxville, USA.

  • 2019, March, Invited Talk about Z-checker at ECP CODAR all-hands meeting in ORNL, USA.

  • 2019, March, Invited Talk about SZ at ECP Exasky all-hands meeting in Santa Fe, USA.

  • 2018, Nov. Tutorial Speaker in Compression for Scientific Data at Supercomputing (SC18), 2018, Denver, USA.

  • 2018, June 18-20, Invited Talk at The 13th scheduling for large scale systems workshop, Berkeley, CA.

  • 2017, Nov. Tutorial Speaker in Compression for Scientific Data at Supercomputing (SC17), 2017, Denver, USA.

  • 2017, July, Invited Talk at Joint Laboratory for Extreme-Scale Computing (JLESC) workshop, Champaign, USA.

  • 2016, Dec, Invited Talk at Joint Laboratory for Extreme-Scale Computing (JLESC) workshop, Kobe, Japan.

  • 2016, Nov, Invited Talk at Youth workshop, Kobe, Japan.

  • 2016, Sept. 15th, Invited Talk at Fault Tolerant System workshop, in conjunction with IEEE CLUSTER conference, Taipei, Taiwan.

  • 2016, June, Invited Talk at Huazhong University of Science and Technology (HUST), Wuhan, China.

  • 2016, May 5th, Invited Talk at Hubei University of Technology, Wuhan, China.

  • 2016, March, Invited Poster Presentation at Los Alomas National Lab (LANL), USA.

  • 2014, Nov. 25th, Invited Talk at Joint Laboratory for Extreme-Scale Computing (JLESC) Workshop, Champaign, USA.

  • 2014, May 8th, Invited Talk at Argonne National Laboratory, Lemont, USA.

  • 2014, April 2nd, Invited Talk at University of California - Merced (UC-Merced), Merced, USA.

  • 2014, Feb. 11th, Invited Talk at Huazhong University of Science and Technology (HUST), Wuhan, China.

  • 2014, Jan. 24th, Invited Talk at Shenzhen Institutes of Advanced Technology (SIAT), Shenzhen, China

  • 2014, Jan. 23rd, Invited Talk at The University of Hong Kong, Hong Kong, Hong Kong, China

  • 2013, Nov. 25-27, Invited Talk at the 10th Workshop of the INRIA-Illinois Joint Laboratory on Petascale Computing, UIUC, USA.

  • 2013, June 12-14, Invited Talk at the 9th Workshop of the INRIA-Illinois Joint Laboratory on Petascale Computing, Lyon, France.

  • 2012, Nov. 19-22, Invited Talk at Google (Mountain View, California), USA.

  • 2011, Aug. 22-23, Final-check Report for HKU-Grid Point project, on behalf of System Research Group of The University of Hong Kong, Beijing (Peking), China.

  • 2011, Jan. 12-13, Invited Talk for reporting the development progress for HKU-Grid Point project, on behalf of System Research Group of The University of Hong Kong, Beijing (Peking), China.

  • 2010, July. 29-Aug. 1, Invited Talk for reporting the development progress for HKU-Grid, on behalf of System Research Group of The University of Hong Kong, at Xilinhot, Inter Mongolia, China.

  • 2008, Dec. 18-20, Invited Talk for reporting the development progress for HKU-Grid, on behalf of System Research Group of The University of Hong Kong, at Shanghai, China.

  • 2008, July. 24-25, Invited Talk for reporting the development progress for HKU-Grid Point project, on behalf of System Research Group of The University of Hong Kong, at Wuxi, Jiangsu, China.

  • 2008, Jun. 22-25, Invited Talk for reporting the development progress for HKU-Grid Point project, on behalf of System Research Group of The University of Hong Kong, at Beijing (Peking), China.

  • 2007, Oct., Organization & Coordination of the demonstration for SRG Group (Open-day), HKU.

  • 2007, July 24-25, Invited Talk for reporting the development progress for HKU-Grid Point project, on behalf of System Research Group of The University of Hong Kong, at Beijing (Peking), China.

Mentored Students and Post-doc Researchers____________________

  • Alexandra Poulos (Ph.D student from Clemson University, USA)

  • Ning Yan (Ph.D student from Georgia State University, USA)

  • Darren Ng (Ph.D student from University of California, Merced, USA)

  • Zizhe Jian (Ph.D student from University of California, Riverside, USA)

  • Tripti Agarwal (Ph.D student from University of Utah, USA)

  • Tri Nguyen (Ph.D student from NC State University, USA)

  • Grant Wilkins (Ph.D student from Clemson University, USA)

  • Di Zhang (Ph.D student from The University of North Carolina at Charlotte, USA)

  • Boyuan Zhang (Ph.D student from Indiana University, USA)

  • David Krasowska (Ph.D student from Clemson Univresity, USA)

  • Milan Shah (Ph.D student from NC State University, USA)

  • Pu Jiao (Ph.D student from Missouri S&T, USA)

  • Zhaoyuan (Alex) Su (Ph.D student from GMU, USA)

  • Arham Khan (Ph.D student from Univresity of Chicago, USA)

  • Ali M. Gok (Ph.D student from NU, USA); Postdoc at Argonne; Present: Cerebras Systems, USA.

  • Xiaodong Yu (Ph.D student from Virginia Tech, USA); Postdoc at Argonne.

  • Jiannan Tian (Ph.D student from WSU, USA).

  • Robert Underwood (Ph.D student from Clemson U., USA).

  • Kai Zhao (Ph.D student from UCR, USA).

  • Sian Jin (Ph.D student from WSU, USA).

  • Jinyang Liu (Ph.D student from UCR, USA).

  • Yuanjian Liu (Ph.D student from U. Chicago, USA).

  • Yafan Huang (Ph.D student from U. Iowa, USA)

  • Md Hasanur Rahman (Ph.D student from U. Iowa, USA)

  • Eduardo Berrocal (Ph.D student from IIT, USA); Present: Senior Software Engineer at Intel Inc., USA.

  • Omer Subasi (Ph.D student from BSC, Spain); Present: Computer Scientist at PNNL, USA.

  • Dingwen Tao (Ph.D  student from UCR, USA); Present: Assistant Computer Scientist at WSU, USA.

  • Xin Liang (Ph.D student from UCR, USA); Present: Assistant Computer Scientist at Missouri S&T, USA.

  • Sihuan Li (Ph.D student from UCR, USA); Present: Engineer at Facebook Inc., USA.

  • Tasimia Reza (Ph.D student from Clemson U., USA).

  • Ruiwen Shan (Ph.D student from Clemson U.., USA).

  • David Krasowska (Ph.D student from Clemson U., USA).

  • Cody Rivera (Bachelor student from UA USA).

  • Hengzhi Chen (Bachelor student from USC, USA).

  • Hongyuan Liu (Ph.D student from W&M, USA).

  • Khalid Alharthi (Ph.D student from University of Warwick, UK).

  • Hao Liu (Bachelar student from HKU, China); Present: Engineer at Google Inc., USA.

  • Haibao Chen (Ph.D student from HUST, China).

  • Xinhou Wang (Ph.D student from HUST, China).

  • Xinchuan Wu (Ph.D student from U. Chicago); Present: Engineer at Intel Inc., USA.

  • Zheming Xu (Master student from HKU, China).