CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > CFX

Help! Running parallel mpich2

Register Blogs Members List Search Today's Posts Mark Forums Read

Reply
 
LinkBack Thread Tools Display Modes
Old   March 2, 2010, 12:06
Default Help! Running parallel mpich2
  #1
New Member
 
Join Date: Jun 2009
Posts: 11
Rep Power: 8
jpcfd is on a distinguished road
Hi all,

Im trying to run a parallel job using a local network consisting in two quadcores linked with a normal swich. The net seems to be right (both computers see each other) and rsh runs normally ( i can do the tipical remote probe). i also intall in both machines the mpich2 service and register the same user in both computers. Also ive shut down the firewall to avoid problems.

The problem is that it works all well but when the solver shows solver in the output screen it gets stoped and exit with code 0 responding to a command from the master node:

"Command on host returned with code 0" is the message.
at first i obtain code 255 too but now i only get code 0.

Can any one help me? i have read the parallel documentation i dont know were is the fail.

Thanks in advance for reading this and hope someone could help me.

Javier.
jpcfd is offline   Reply With Quote

Old   March 2, 2010, 18:00
Default
  #2
Super Moderator
 
Glenn Horrocks
Join Date: Mar 2009
Location: Sydney, Australia
Posts: 10,826
Rep Power: 85
ghorrocks has a spectacular aura aboutghorrocks has a spectacular aura aboutghorrocks has a spectacular aura about
Step 1 is to determine whether the problem is your simulation, the parallel setup or distributed parallel setup.

Does the simulation run OK serial? Does it run OK local parallel?
ghorrocks is offline   Reply With Quote

Old   March 2, 2010, 18:34
Default
  #3
New Member
 
Join Date: Jun 2009
Posts: 11
Rep Power: 8
jpcfd is on a distinguished road
Thanks Glenn,

The problem arise when i use distributed setup. The model runs in serial and also in local parallel. I have the problem whtn i try to run working with two separates machines linked by a swich. I did the following:

0 be sure that the net is working and both computers can work
1 turn off firewalls
1 install mpich2 services in both
2 activate the services with the same log and pass
3 run the simulation.
4. i obtain error code 0 when the solver start.

Im forgeting something?

I will be pleasure of any help.

Thanks.
jpcfd is offline   Reply With Quote

Old   March 3, 2010, 06:52
Default
  #4
Super Moderator
 
Glenn Horrocks
Join Date: Mar 2009
Location: Sydney, Australia
Posts: 10,826
Rep Power: 85
ghorrocks has a spectacular aura aboutghorrocks has a spectacular aura aboutghorrocks has a spectacular aura about
What OS are you using? Do the other parallel options work (eg HP MPI, PVM)?
ghorrocks is offline   Reply With Quote

Old   March 4, 2010, 08:49
Default
  #5
New Member
 
Join Date: Jun 2009
Posts: 11
Rep Power: 8
jpcfd is on a distinguished road
Hi,

Im using XP64. I try with MPI and it doesnt work aswell.

Thanks.
jpcfd is offline   Reply With Quote

Old   March 5, 2010, 08:31
Default
  #6
Member
 
SanS
Join Date: Mar 2009
Posts: 42
Rep Power: 8
sans is on a distinguished road
Hi, This wont solve your problem but just try switching your master node and slave. See if you get the same error.
sans is offline   Reply With Quote

Old   March 6, 2010, 10:48
Default
  #7
New Member
 
martin
Join Date: Mar 2010
Posts: 2
Rep Power: 0
tvt_mvt is on a distinguished road
try to use a differnet partition mode, e.g. user defined direction
martin

Quote:
Originally Posted by jpcfd View Post
Hi all,

Im trying to run a parallel job using a local network consisting in two quadcores linked with a normal swich. The net seems to be right (both computers see each other) and rsh runs normally ( i can do the tipical remote probe). i also intall in both machines the mpich2 service and register the same user in both computers. Also ive shut down the firewall to avoid problems.

The problem is that it works all well but when the solver shows solver in the output screen it gets stoped and exit with code 0 responding to a command from the master node:

"Command on host returned with code 0" is the message.
at first i obtain code 255 too but now i only get code 0.

Can any one help me? i have read the parallel documentation i dont know were is the fail.

Thanks in advance for reading this and hope someone could help me.

Javier.
tvt_mvt is offline   Reply With Quote

Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
running OpenFoam in parallel vishwa OpenFOAM 22 Yesterday 08:53
Running dieselFoam in parallel. Palminchi OpenFOAM 0 February 17, 2010 05:00
Statically Compiling OpenFOAM Issues herzfeldd OpenFOAM Installation 21 January 6, 2009 10:38
Kubuntu uses dash breaks All scripts in tutorials platopus OpenFOAM Bugs 8 April 15, 2008 07:52
running multiple Fluent parallel jobs Michael Bo Hansen FLUENT 8 June 7, 2006 08:52


All times are GMT -4. The time now is 15:20.