|
[Sponsors] |
December 16, 2008, 05:27 |
Parallel Fluent Error in Batch Mode
|
#1 |
Guest
Posts: n/a
|
I am experiencing an error message when trying to run a fluent job in parallel via a PBS batch system on a Unix cluster.
The case will load in interactive mode, but when I try to launch it in batch mode something goes wrong and it gives an error before loading the grid. Here is the error from the fluent output file: Multicore processors detected. Processor affinity set! Reading "Case1FullStack.cas"... MPI Application rank 0 killed before MPI_Finalize() with signal 9 node 999999 retrying on zero socket read..... node 999999 retrying on zero socket read..... 999999 (../../src/mpsystem.c@1123): mpt_read: failed: errno = 11 999999: mpt_read: error: read failed trying to read 4 bytes: Resource temporarily unavailable Here is the batch file: #PBS -N parallel_fluent #PBS -l walltime=1:00:00 #PBS -l nodes=1pn=4 #PBS -l software=fluent:fluentpar+4 #PBS -j oe #PBS -m ae #PBS -S /bin/csh set echo on hostname module load fluent cd $PBS_O_WORKDIR cat $PBS_NODEFILE | sort > pnodes set ncpus=`cat pnodes | wc -l` fluent 3ddp -t$ncpus -pinfiniband.ofed -cnf=pnodes -g < Case1Fullgrdck.input And the input file: file/read-case Case1FullStack.cas grid/check solve/initialize/initialize-flow file/write-data Case1FullStack.dat exit yes Any input on what I am doing wrong would be greatly appreciated. Thanks Justin |
|
December 18, 2008, 22:45 |
Re: Parallel Fluent Error in Batch Mode
|
#2 |
Guest
Posts: n/a
|
Nevermind. I figured it out. It was using too much RAM for a single compute node, so I switched to one core per node on multiple nodes.
|
|
October 27, 2011, 16:13 |
|
#3 | |
New Member
Choon Hwai Yap
Join Date: Oct 2011
Posts: 4
Rep Power: 15 |
Quote:
Hi Justin, could you elaborate how you solved this problem, I just met with the same thing. The funny thing is that the simulation ran for 12 time steps before meeting this problem. Thanks! |
||
November 28, 2016, 11:50 |
|
#4 |
New Member
Zaka Muhammad
Join Date: Jan 2012
Posts: 6
Rep Power: 14 |
Hi, guys did you figure out? I am having the same problem. I guess it is also related to memory.
MPI Application rank 4 killed before MPI_Finalize() with signal 9 Node 12: Process 22312: Received signal SIGTERM. Node 13: Process 22313: Received signal SIGTERM. Node 14: Process 22314: Received signal SIGTERM. Node 8: Process 22308: Received signal SIGTERM. Node 5: Process 22305: Received signal SIGTERM. Node 2: Process 22302: Received signal SIGTERM. Node 11: Process 22311: Received signal SIGTERM. ===============Message from the Cortex Process================================ Fatal error in one of the compute processes. |
|
Thread Tools | Search this Thread |
Display Modes | |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
DPM model in parallel batch mode | Prashanth | FLUENT | 2 | March 6, 2009 08:54 |
Star-CD batch mode error | Karl | Siemens | 11 | June 12, 2008 00:11 |
fluent stops batch mode on error | Ralf Schmidt | FLUENT | 1 | March 1, 2008 11:35 |
CFX5Post error in batch mode with Perl | windhair | CFX | 2 | March 20, 2007 06:41 |
[OpenFOAM] ParaView batch mode error | michael | ParaView | 0 | August 3, 2006 04:04 |