Openfoam solvers and snappyHexMesh stopped working in parallel after update
Hi everyone,
I recently updated to Fedora 33, and no suddenly I get this error whenever I run snappy or an openFoam solver in parallel: Quote:
I have no idea what this means, or why it is happening. simpleFoam and snappyHexMesh produce the same error! Things still work in serial for some reason. I have tried updating, restarting, running various tutorial files, and keep getting the same behavior. I am not even 100% sure this is an installation issue. I removed the OF binary with dnf, and reinstalled it. Could this be a bug? If it is, it probably can't be reproduced off my computer. Any help would be much appreciated. Thanks |
I should clarify what I mean by "suddenly". The programs start ok, and then crash after a short time. For example, snappyHexMesh gets to Feature refinement iteration 2 before it throws the error, which is very strange.
|
This is so insanely frustrating! I tried on a smaller casefile to see what would happen, and get a slightly different IO error!
Code:
[4] --> FOAM FATAL IO ERROR: Here, snappy got further along than before, I think because the mesh is smaller. Its as if something is getting overloaded, causing the crash, and then it sends out and unrelated error from whatever part of the code was currently running. It gives different behavior every time I run it! I just ran it again and snappy made it all the way through, again, and it crashes during castellated, then again and it crashes during the snap! What could possibly cause this?! |
Could be mpi-related.
|
I had that thought, and did the brainless thing of just reinstalling it - no change. I am not an expert on MPI, so if anyone has an ideas of how to diagnose, let me know!
Thanks |
I am having the same issue but on Fedora 34
Quote:
|
Thanks for the info. I ended up tearing everything down and switching to CentOS to appease IT. Hopefully it is helpful to other Fedora people out there...
|
Seems like openmpi in F33 has bugs
Quote:
|
Quote:
|
An update
Quote:
Edit: Seems like a F33 issue, works flawlessly inside F32 docker I created. |
All times are GMT -4. The time now is 22:21. |