Perspectives on High-Performance Computing in a Big Data World. Fox, G., C. In Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing (HPDC '19), pages 145-145, 2019. Association for Computing Machinery (ACM).
Perspectives on High-Performance Computing in a Big Data World [pdf]Paper  abstract   bibtex   
High-Performance Computing (HPC) and Cyberinfrastructure have played a leadership role in computational science even since the start of the NSF computing centers program. Thirty years ago parallel computing was a centerpiece of computer science research. Naively Big Data surely requires HPC to be processed, and transformational Big Data technology such as Hadoop and Spark exploit parallelism to success. Nevertheless, the HPC community does not appear to be thriving as a leader in Data Science while parallel computing is no longer a centerpiece. Some reasons for this are the dominant presence of Industry in technology futures and the universal fascination with Artificial Intelligence and Machine Learning. Maybe the pendulum will swing back a bit, but I expect the "AI first" philosophy to dominate in the foreseeable future. Thus I describe a future where HPC thrives in collaboration with Industry and AI. In particular, I discuss the promise of MLforHPC (AI for systems) and HPCforML (systems for AI).

Downloads: 0