SC20 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Toward On-Demand I/O Forwarding in HPC Platforms

Workshop:Fifth International Parallel Data Systems Workshop

Authors: Jean Luca Bez (Federal University of Rio Grande do Sul, Brazil; Barcelona Supercomputing Center (BSC)); Francieli Zanon Boito (University of Bordeaux Computer Science Research Laboratory; French Institute for Research in Computer Science and Automation (INRIA), French National Center for Scientific Research (CNRS), Bordeaux-INP); Alberto Miranda and Ramon Nou (Barcelona Supercomputing Center (BSC)); Toni Cortes (Polytechnic University of Catalonia, Barcelona Supercomputing Center (BSC)); and Philippe Olivier Alexandre Navaux (Federal University of Rio Grande do Sul, Brazil)

Abstract: I/O forwarding is an established and widely-adopted technique in HPC to reduce contention and improve I/O performance in the access to shared storage infrastructure. On such machines, this layer is often physically deployed on dedicated nodes, and their connection to the clients is static. Furthermore, the increasingly heterogeneous workloads entering HPC installations stress the I/O stack, requiring tuning and reconfiguration based on the applications' characteristics. Nonetheless, it is not always feasible in a production system to explore the potential benefits of this layer under different configurations without impacting clients. In this paper, we investigate the effects of I/O forwarding on performance by considering the application's I/O access patterns and system characteristics. We aim to explore when forwarding is the best choice for an application, how many I/O nodes it would benefit from, and whether not using forwarding at all might be the correct decision. To gather performance metrics, explore, and understand the impact of forwarding I/O requests of different access patterns, we implemented FORGE, a lightweight I/O forwarding layer in user-space. Using FORGE, we evaluated the optimal forwarding configurations for several access patterns on MareNostrum 4 (Spain) and Santos Dumont (Brazil) supercomputers. Our results demonstrate that shifting the focus from a static system-wide deployment to an on-demand reconfigurable I/O forwarding layer dictated by application demands can improve I/O performance on future machines.


Back to Fifth International Parallel Data Systems Workshop Archive Listing

Back to Full Workshop Archive Listing