CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > Software User Forums > ANSYS > FLUENT

problem of running parallel Fluent on linux cluster

Register Blogs Community New Posts Updated Threads Search

 
 
LinkBack Thread Tools Search this Thread Display Modes
Prev Previous Post   Next Post Next
Old   July 22, 2009, 16:33
Default problem of running parallel Fluent on linux cluster
  #1
Member
 
Ivan
Join Date: May 2009
Posts: 85
Rep Power: 17
ivanbuz is on a distinguished road
the case runs fine if I require several processors on the SAME node, but if the processors are on different nodes, I have the "Connection refused" problem.

I search online and see that some people have the similar problem, but I can not find a solution to this specific problem. the output from Fluent and the submission script are attached below.

Thanks in advance!


OUTPUT FROM FLUENT
-----------------------------------------
/opt/hpc/Fluent.Inc/fluent6.3.26/bin/fluent -r6.3.26 -pib -cnf=/var/spool/PBS/aux//3666504.cmgr01 -g 2ddp -t6 -i test2.jou
/opt/hpc/Fluent.Inc/fluent6.3.26/cortex/lnamd64/cortex.3.7.3 -f fluent -g -i test2.jou (fluent "2ddp -pib -host -r6.3.26 -t6 -mpi=hp -cnf=/var/spool/PBS/aux//3666504.cmgr01 -path/opt/hpc/Fluent.Inc")
Loading "/opt/hpc/Fluent.Inc/fluent6.3.26/lib/fluent.dmp.114-64"
Done.
/opt/hpc/Fluent.Inc/fluent6.3.26/bin/fluent -r6.3.26 2ddp -pib -host -t6 -mpi=hp -cnf=/var/spool/PBS/aux//3666504.cmgr01 -path/opt/hpc/Fluent.Inc -cx scw-029.i:59263:37434
Starting /opt/hpc/Fluent.Inc/fluent6.3.26/lnamd64/2ddp_host/fluent.6.3.26 host -cx scw-029.i:59263:37434 "(list (rpsetvar (QUOTE parallel/function) "fluent 2ddp -node -r6.3.26 -t6 -pib -mpi=hp -cnf=/var/spool/PBS/aux//3666504.cmgr01 ") (rpsetvar (QUOTE parallel/rhost) "") (rpsetvar (QUOTE parallel/ruser) "") (rpsetvar (QUOTE parallel/nprocs_string) "6") (rpsetvar (QUOTE parallel/auto-spawn?) #t) (rpsetvar (QUOTE parallel/trace-level) 0) (rpsetvar (QUOTE parallel/remote-shell) 0) (rpsetvar (QUOTE parallel/path) "/opt/hpc/Fluent.Inc") (rpsetvar (QUOTE parallel/hostsfile) "/var/spool/PBS/aux//3666504.cmgr01") )"
Welcome to Fluent 6.3.26
Copyright 2006 Fluent Inc.
All Rights Reserved
Loading "/opt/hpc/Fluent.Inc/fluent6.3.26/lib/flprim.dmp.1119-64"
Done.

Host spawning Node 0 on machine "scw-029" (unix).
/opt/hpc/Fluent.Inc/fluent6.3.26/bin/fluent -r6.3.26 2ddp -node -t6 -pib -mpi=hp -cnf=/var/spool/PBS/aux//3666504.cmgr01 -mport 192.168.2.129:192.168.2.129:34193:0
Starting /opt/hpc/Fluent.Inc/fluent6.3.26/multiport/mpi/lnamd64/hp/bin/mpirun -prot -vapi -e MPI_HASIC_VAPI=1 -e MPI_USE_MALLOPT_SBRK_PROTECTION=1 -e MPI_USE_MALLOPT_AVOID_MMAP=1 -f /tmp/fluent-appfile.32087
192.168.2.135: Connection refused
mpirun: Warning one more more remote shell commands exited with non-zero status, which may indicate a remote access problem.





SUBMISSION SCRIPT
-----------------------------------------
#!/bin/sh
#PBS -j oe
#PBS -l nodes=2:ppn=3
#PBS -q main
#PBS -l walltime=00:10:00
cd ${PBS_O_WORKDIR}
cat ${PBS_NODEFILE}
#Set variables for script
# What version of the solver to use
FLUENTSOLVER=2ddp
#HOW MANY CPUS- note that you'll still need to update the $PBS -l nodes line
CPUCOUNT=6
#Which input journal file to use to give fluent?
#INPUT=${PBS_O_WORKDIR}/${PBS_JOBNAME}
INPUT=test2.jou
#Where do we want to put output at?
OUTPUT=${PBS_O_WORKDIR}/${PBS_JOBID}.out

# Run Fluent with:
# -pib use Infiniband parallel
# -cnf=$PBS_NODEFILE get the list of machines PBS is running on from the server
# -t$CPUCOUNT use $CPUCOUNT CPUs total
# -g no graphics, batch mode
# -i read the file in $INPUT
# > $OUTPUT 2>&1 Redirect program output to a file in your home directory.
fluent $FLUENTSOLVER -t$CPUCOUNT -pib cnf=$PBS_NODEFILE -g -i $INPUT > $OUTPUT 2>&1

Last edited by ivanbuz; July 22, 2009 at 16:35. Reason: display error
ivanbuz is offline   Reply With Quote

 

Tags
cluster, fluent, parallel


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
parallel fluent runs being killed at partitioing Ben Aga FLUENT 3 June 8, 2012 10:40
Running on Distibuted Memory linux itanium cluster Josh FLUENT 0 January 29, 2007 00:18
running multiple Fluent parallel jobs Michael Bo Hansen FLUENT 8 June 7, 2006 08:52
Fluent Parallel for Linux? Rajil Saraswat FLUENT 0 June 11, 2003 10:53
Time problem in parallel fluent MZB FLUENT 1 May 13, 2003 02:37


All times are GMT -4. The time now is 03:05.