Designing MPI Library with On-Demand Paging (ODP) of InfiniBand: Challenges and Benefits

MZ Li and K Hamidouche and XY Lu and H Subramoni and J Zhang and DK Panda, SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 433-443 (2016).

Existing InfiniBand drivers require the communication buffers to be pinned in physical memory during communication. Most runtimes leave these buffers pinned until the end of the run. Such situation limits the swappable memory space for applications. To address these concerns, Mellanox has recently introduced the On-Demand Paging (ODP) feature for InfiniBand. With ODP, communication buffers are paged in when they are needed by the HCA and paged out when the OS needs to swap them. This paper presents a thorough analysis on ODP and studies its performance characteristics. With these studies, we propose novel designs of ODP- aware MPI communication protocols. To the best of our knowledge, this is the first work to study and analyze the ODP feature and design an ODP- aware MPI library. Performance evaluations with applications show that ODP-aware designs can reduce the size of pin-down buffers by 11X without performance degradation compared with the pindown scheme.

Return to Publications page