Grid partitioning error.
I have compiled the latest version of the code from github and was trying to run the DPW tutorial in parallel. During the partitioning of grid the following error crops up.
Kindly help me solve this problem.
I have also encountered this error when trying to partition a very large grid.
"Comm buff: 130.80MB of 50.00MB"
The machine I am running on has 96GB of RAM, so I doubt it is a machine memory issue.
Line 117 of ../Common/include/option_structure.hpp has
const unsigned int MAX_MPI_BUFFER = 52428800; /*!< \brief Buffer size for parallel simulations (50MB). */
I increased this value to 150MB and the partitioning process gets a little farther (to domain 3 of 8), but then still errors out:
[CFD-LINUX:22185] *** An error occurred in MPI_Bsend
[CFD-LINUX:22185] *** on communicator MPI_COMM_WORLD
[CFD-LINUX:22185] *** MPI_ERR_BUFFER: invalid buffer pointer
[CFD-LINUX:22185] *** MPI_ERRORS_ARE_FATAL (your MPI job will now abort)
After googling "MPI_ERR_BUFFER: invalid buffer pointer", it seems the buffer was still not big enough. So I boosted the MAX_MPI_BUFFER to 1.5 GB, and now my 22M element grid will partition successfully.
SU2 developers: what should this buffer be set to? The maximum grid size we expect? For the record, this 22M element grid is 1.8 GB on the disk in *.su2 format.
Thanks for your replies Austin. I will try this out on my machine.
|All times are GMT -4. The time now is 13:31.|