CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > Software User Forums > OpenFOAM > OpenFOAM Running, Solving & CFD

Fail to run openfoam in multi-nodes

Register Blogs Community New Posts Updated Threads Search

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old   October 13, 2018, 15:36
Default Fail to run openfoam in multi-nodes
  #1
Member
 
Peng Liang
Join Date: Mar 2014
Posts: 59
Rep Power: 12
tjliang is on a distinguished road
Dear Foamers,

I am going to run my 3D simualtion on a cluster. I can run on each node the simulation without problem. But when I want to allocate the simulation into different nodes at the same time using the following pbs script, it turns out to run only in a single node. My openfoam version is 2.4.x.

#!/bin/bash
#PBS -j oe
#PBS -l walltime=10:00:00
#PBS -N 3D-Arc-Welding
#PBS -l nodes=2: ppn=8
#PBS -m abe
#PBS -q default

. ~/.bashrc

cd $PBS_O_WORKDIR

mpirun --mca btl ^openib -np 16 ArgonPlasmaMultiRegionSimpleFoam -parallel>run.\
log

For this cluster this is a sample code similar to my problem, but I don't know how to apply that to my case, itis as follows:

#!/bin/bash
#PBS -l walltime=01:00:00
#PBS -l nice=19
#PBS -q default
#PBS -l nodes=6: ppn=16

mpiexec -n 16 -host n001 hostname : -n 16 -host n002 hostname : -n 16 -host n003 hostname : -n 16 -host n004 hostname : -n 16 -host n005 hostname : -n 16 -host n006 hostname | sort


echo "end"


Can anyone give me suggestions to correct it? Thank you.


Bests,

Peng
tjliang is offline   Reply With Quote

Old   October 13, 2018, 16:18
Default
  #2
Senior Member
 
Join Date: Nov 2010
Location: USA
Posts: 1,232
Rep Power: 24
me3840 is on a distinguished road
If I recall correctly running
--mca btl ^openib
would tell OpenMPI to not use infiniband interfaces, isn't that the opposite of what you want? I would always run --mca btl ^tcp previously, but perhaps something has changed?

The code you have should run fine, can you please post the logfile from the cluster? If it only runs on a single node and you allocate all the MPI ranks to that node, OpenMPI should abort or give you a warning, so I'm not sure how it would run that way.
me3840 is offline   Reply With Quote

Old   October 13, 2018, 16:58
Default
  #3
Member
 
Peng Liang
Join Date: Mar 2014
Posts: 59
Rep Power: 12
tjliang is on a distinguished road
Quote:
Originally Posted by me3840 View Post
If I recall correctly running
--mca btl ^openib
would tell OpenMPI to not use infiniband interfaces, isn't that the opposite of what you want? I would always run --mca btl ^tcp previously, but perhaps something has changed?

The code you have should run fine, can you please post the logfile from the cluster? If it only runs on a single node and you allocate all the MPI ranks to that node, OpenMPI should abort or give you a warning, so I'm not sure how it would run that way.
Thank you for the quick reply, I used your --mca btl ^tcp script, but get the error:

did not launch the job. This error was first reported for process
rank 0; it may have occurred for other processes as well.

NOTE: A common cause for this error is misspelling a mpirun command
line parameter option (remember that mpirun interprets the first
unrecognized command line token as the executable).

Node: n001
Executable: ArgonPlasmaMultiRegionSimpleFoam
--------------------------------------------------------------------------
16 total processes failed to start


And when I use my own script, I actually didn't get any errors if I take the line

". $WM_PROJECT_DIR/etc/bashrc" away and it runs as if no problems happened. But actually only in single node.

But if I add that line, I got the error:

"mpirun was unable to find the specified executable file, and therefore
did not launch the job. This error was first reported for process
rank 0; it may have occurred for other processes as well.

NOTE: A common cause for this error is misspelling a mpirun command
line parameter option (remember that mpirun interprets the first
unrecognized command line token as the executable).

Node: n002
Executable: ArgonPlasmaMultiRegionSimpleFoam
--------------------------------------------------------------------------
16 total processes failed to start"
tjliang is offline   Reply With Quote

Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Frequently Asked Questions about Installing OpenFOAM wyldckat OpenFOAM Installation 3 November 14, 2023 11:58
Some questions about a multi region case run in parallel zfaraday OpenFOAM Running, Solving & CFD 5 February 23, 2017 10:25
New OpenFOAM Forum Structure jola OpenFOAM 2 October 19, 2011 06:55
Cant run in parallel on two nodes using OpenMPI CHristofer Main CFD Forum 0 October 26, 2007 09:54
Minimum number of nodes to run CFX in parallel Rui CFX 3 April 11, 2005 20:46


All times are GMT -4. The time now is 17:38.