Python hybrid multiprocessing / MPI with shared memory in the same node

Question

I have a Python application that needs to load the same large array (~4 GB) and do a perfectly parallel function on chunks of this array. The array starts off saved to disk.

I typically run this application on a cluster computer with something like, say, 10 nodes, each node of which has 8 compute cores and a total RAM of around 32GB.

The easiest approach (which doesn't work) is to do n=80 mpi4py. The reason it doesn't work is that each MPI core will load the 4GB map, and this will exhaust the 32GB of RAM resulting in a MemoryError.

An alternative is that rank=0 is the only process that loads the 4GB array, and it farms out chunks of the array to the rest of the MPI cores -- but this approach is slow because of network bandwidth issues.

The best approach would be if only 1 core in each node loads the 4GB array and this array is made available as shared memory (through multiprocessing?) for the remaining 7 cores on each node.

How can I achieve this? How can I have MPI be aware of nodes and make it coordinate with multiprocessing?

Victor Eijkhout · Accepted Answer · 2019-03-21 08:57:43Z

1

MPI-3 has a shared memory facility for precisely your sort of scenario. And you can use MPI through mpi4py.... Use MPI_Comm_split_type to split your communicator into groups that live on a node. Use MPI_Win_allocate_sharedfor a window on the node; specify nonzero size only on one rank. Use MPI_Win_shared_query to get pointers to that window.

answered Mar 21, 2019 at 8:57

Victor Eijkhout

6,0002 gold badges29 silver badges31 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Thomas Moreau · Accepted Answer · 2020-06-16 11:51:58Z

1

Edit 16/06/2020: starting python3.8, there is now a shared memory support in multiprocessing: https://docs.python.org/3/library/multiprocessing.shared_memory.html that can be used to have such per node version of the data.

The multiprocessing module does not have shared memory.

You could look at joblib way to share large numpy arrays, using memory views. You could use manual memory mapping to avoid duplicating the data.

To find a way to only pass the data once on each node, I would go with launching one MPI process per node and then use joblib for the remaining computation, as it automatically uses the memmaping for large numpy array input.

edited Jun 16, 2020 at 11:51

answered Sep 13, 2017 at 10:47

Thomas Moreau

4,4671 gold badge22 silver badges33 bronze badges

4 Comments

dbrane Over a year ago

While I haven't tried it, this has been pointed to me: docs.python.org/2/library/… as a way to share memory with multiprocessing

Thomas Moreau Over a year ago

This effectively emulate shared memory but it is slower than the memory maps, as it relies on a synchronisation process with proxy function to read and right from it (this not communication efficient). Also, you will not be able to use efficient numpy computations if you need linear algebra.

Darkonaut Over a year ago

multiprocessing does have shared memory. Apparently you are mixing this up with multiprocessing.Manager which uses proxies to modify objects in a separate server-process. Manager has nothing to do with multiprocessing.Array or multiprocessing.Value which only use real shared memory and no extra manager-process under the hood.

Thomas Moreau Over a year ago

Indeed, shared memory was added in version 3.8 in october 2019. I will edit my answer to point toward this.

Collectives™ on Stack Overflow

Python hybrid multiprocessing / MPI with shared memory in the same node

2 Answers 2

Comments

4 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

4 Comments

Your Answer

Sign up or log in

Post as a guest

Related