Distributed parallel problem
Hello,
I'm trying to set up a distributed parallel calculation (ANSYS 10, WINNT2000 advanced server). I installed rsh and MPICH for windows according to the manual pg 51 and further. I have a host (#110), 3 slaves (#111, #112 and #113). It's working when I do a calculation on 2 machines (110 and 111). For the other machines I can't get it to work (so the combi 110-112 and 110-113 is not working). Below is the last part of the output of a run when it goes wrong. What can be the problem? Best regards, Maurice Parallel run: Received message from slave ----------------------------------------- Slave partition : 2 Slave routine : ErrAction Master location : RCVBUF,MSGTAG=1052 Message label : 001100279 Message follows below - : +--------------------------------------------------------------------+ | ERROR #001100279 has occurred in subroutine ErrAction. | | Message: | | Stopped in routine RedSht | | | | | | | | | | | +--------------------------------------------------------------------+ |
Re: Distributed parallel problem
Ok, I solved it. (Well, CFX Technical support did). On 2 machines I had installed the Service Pack 1, on 2 I did not. It's crucial that the software is identical on all 4 machines. Best regards, Maurice
|
Re: Distributed parallel problem
Thank you very much. We really appreciate your suggestion. Regards.
|
All times are GMT -4. The time now is 15:28. |