Error while running cfx in parallel configuration
Hi everyone,
I am currently trying to run CFX (v16) in parallel configuration using the slurm manager. I used runCFX.sh script which is as follows: Quote:
Quote:
Quote:
|
We have had problems getting Platform MPI Distributed Parallel to run with SLURM, and got exactly the same error as you. Ansys support wont help since they dont support it...
If I remember correctly it was solved by either using Intel MPI Distributed Parallel and/or unsetting the SLURM_GTIDS environment variable. |
Hello sir,
I confirm, the unsettling of the variable SLURM_GTIDS has allowed to solve the problem Thank you a lot |
Thank you, Lance,
We added 'unset SLURM_GTIDS' to the job script and the job runs now, Now instead of 2 we have 5 people on the planet who know of this workaround. :) |
Thanks so much!
I was having this same issue with CFX 17.0 and the unset SLURM_GTIDS command has fixed it. |
Thank you for this info.
In case someone is trying to run Abaqus in parallel on one of XSEDE resources (e.g. SDSC Comet) then adding 'unset SLURM_GTIDS' before the ABQ command will get rid of errors about MPI. Make sure you have parallel_mode=MPI in the ABQ command. Best, |
unset SLURM_GTIDS worked for me too :D
Quote:
|
Also worked here, added:
unset SLURM_GTIDS to my job script below the usual module load commands and voila. Thanks! |
Anyone know how to do this for an interactive session?
|
Alex, do you mean a CFX session on a GNU/Linux computer? Add the "unset SLURM_GTIDS" line to your bashrc or bash profile.
|
All times are GMT -4. The time now is 11:38. |