EconPapers    
Economics at your fingertips  
 

Debugging Latent Synchronization Errors in MPI-3 One-Sided Communication

Roger Kowalewski () and Karl Fürlinger
Additional contact information
Roger Kowalewski: Ludwig-Maximilians-Universität München
Karl Fürlinger: Ludwig-Maximilians-Universität München

A chapter in Tools for High Performance Computing 2016, 2017, pp 83-96 from Springer

Abstract: Abstract The Message Passing Interface (MPI-3) provides a one-sided communication interface, also known as MPI Remote Memory Access (RMA), which enables one process to specify all required communication parameters for both the sending and receiving side. While this communication interface enables superior performance potential developers have to deal with a complex memory consistency model. Proper synchronization of asynchronous remote memory accesses to shared data structures is a challenging task. More importantly, it is difficult to pinpoint such synchronization bugs as they do not necessarily manifest in an error or occur for example only after porting the application to a different HPC environment. We introduce a debugging tool to support the detection of latent synchronization bugs. Based on the semantic flexibility of the MPI-3 specification we dynamically modify executions of improperly synchronized MPI remote memory accesses to force a manifestation of an error. An experimental evaluation with small applications and the usage in a library which heavily relies on MPI RMA reveal that this approach can uncover synchronization bugs which would otherwise likely go unnoticed.

Date: 2017
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-319-56702-0_5

Ordering information: This item can be ordered from
http://www.springer.com/9783319567020

DOI: 10.1007/978-3-319-56702-0_5

Access Statistics for this chapter

More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2026-06-01
Handle: RePEc:spr:sprchp:978-3-319-56702-0_5