Parallel programming with C++ and MPI -- same executable possible?

Question

I would like to create a program that would work with both MPI and without at run time and I have reached a point where I think that is not possible.

For example, an MPI program might look like this:

#include <mpi.h>

int main(int argc, char* argv[])
{
  MPI_Init(&argc, &argv);

...

  MPI_Finalize ();
  return 0;
}

But in order to allow users who don't have MPI installed to compile this, I would have to wrap it around #if's:

// Somewhere here, pull in a .h file which would set HAVE_MPI to 0 or 1

#if HAVE_MPI
#include <mpi.h>
#endif

int main(int argc, char* argv[])
{
#if HAVE_MPI
  MPI_Init(&argc, &argv);
#endif
...
#if HAVE_MPI
  MPI_Finalize ();
#endif
  return 0;
}

Until now, I think I'm correct? If this program compiled to a.out with HAVE_MPI set to 1, it would be run as: mpirun -np 4 ./a.out or mpirun -np 1 ./a.out. It could never be run as ./a.out, because even if it isn't run within mpirun, it would call the MPI_* functions.

I guess to achieve what I would like -- an executable that could be run with and without mpirun is not possible. And the only option is to create two separate executables -- one with HAVE_MPI set to 0 and another set to 1. Is this correct or am I missing something about how C++ programs using MPI are implemented?

As a side-note, I don't think this is true with shared memory parallelization (i.e., with OpenMP). The same executable should work with a single or multiple threads.

Is my understanding correct?

To pick up on your side note - yes the same executable should run on 1 or multiple threads. But don't be surprised to find that on 1 thread the executable is slower than the equivalent non-OpenMP code. Using OpenMP imposes some parallel overhead, part of which is imposed on all codes and does not vary with the number of threads used. — High Performance Mark
– High Performance Mark, Commented Aug 19, 2022 at 9:34
after reading the question I am not sure if you know that #ifdef cannot help to have both options in one executable or it that is the question. #ifdef is evaluated by the preprocessor before actual compilation starts — 463035818_is_not_an_ai
– 463035818_is_not_an_ai, Commented Aug 19, 2022 at 9:40
Can you check the command line arguments and determine if the program was run via mpirun? You could only call MPI functions when the arguments are correct. For example: No arguments = don't use MPI. As for compilation, you could supply MPI headers with your project, so that everyone can compile it. The license seems to allow that. — VLL
– VLL, Commented Aug 19, 2022 at 9:50
Most MPI implementations allow a program to be run in singleton mode: simply run a.out, and MPI_Init() will success and consider this is a single task job. To be perfectly clear, you will not need mpirun a.out but you will at least need libmpi.so and its dependencies. — Gilles Gouaillardet
– Gilles Gouaillardet, Commented Aug 19, 2022 at 10:04
you can always write int main() { if (MPI_enabled) { ...code using mpi...} else { ...code not using mpi...} but i guess thats not what you want either ;) — 463035818_is_not_an_ai
– 463035818_is_not_an_ai, Commented Aug 19, 2022 at 10:33

Jérôme Richard · Accepted Answer · 2022-08-19 11:09:02Z

2

A compiled MPI program needs MPI libraries at runtime in addition to the mpirun call (not required by all MPI implementations for 1 process nor in all cases). Thus, to run MPI function only in some cases at runtime without having a dependency to MPI, the potion of the code using MPI needs to be dynamically loaded. This can be done by putting MPI-related functions in shared library components and loading them at runtime regarding your needs.

The same thing is also true for OpenMP: an OpenMP code requires OpenMP compiler flags like -fopenmp creating a dependency to an OpenMP implementation required at runtime. That being said, the common GOMP (GCC) runtime or the IOMP runtime (ICC/Clang) are often installed on most Linux computing machines by default (especially the former since GCC is generally the default compiler and is bundled with GOMP).

Using shared library help you to easily switch between two parallel implementations: MPI and OpenMP. That being said, it means the initialization, finalization and communication collectives must be wrapped in an external library compiled separately of your program. The library functions can be loaded dynamically at startup time or manually at runtime using dlopen (see this related post).

As pointed out in the comments, using preprocessor directives are parsed at compile time and this is why you need to rebuild your program twice when using them. This is not required with shared libraries (though all diverging parts to be executed at runtime needs to be compiled separately).

answered Aug 19, 2022 at 11:09

Jérôme Richard

53.4k6 gold badges48 silver badges77 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Ray Over a year ago

Ah! Thank you for this! I only had a vague idea about dynamically loaded libraries, but I certainly would not have figured out that it was the solution to my problem! Thank you for posting this answer and mentioning dlopen! I think I've seen programs do what I had asked; just no idea how they were written. Thank you!!

Collectives™ on Stack Overflow

Parallel programming with C++ and MPI -- same executable possible?

1 Answer 1

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related