|
[Sponsors] |
![]() |
![]() |
#1 |
Member
Daniel Harlacher
Join Date: Mar 2009
Location: Davis, CA, United States
Posts: 60
Rep Power: 18 ![]() |
Hi,
I am having some trouble to get OpenFOAM to run over two or more PCs. The problem is that I get the error: bash: orted: command not found and I already figured out, what the problem is: My .bashrc is not sourced in the non-interactive bash shell when I use "mpirun". The problem is, that I don't know how to fix the problem. I tried every hint I could find for that problem, that did not require root privileges. I don't know if there is an easy fix for that, but I would assume so, because in Ubuntu I just edited the .bashrc and it worked instantly. If you need any further Information please ask. Thank you for your help. -harly |
|
![]() |
![]() |
![]() |
![]() |
#2 |
Member
Dennis Kingsley
Join Date: Mar 2009
Location: USA
Posts: 45
Rep Power: 18 ![]() |
you need to check your .bashrc and see if you are outputting something to the console. If you are mpirun will stop sourcing the .bashrc on Fedora machines.
We solved this at my last place of employment by checking if the login was interactive in the .bashrc script. If it was not interactive we did not output to the console and that solved the mpirun issue. Unfortunatly I don't remember the syntax. |
|
![]() |
![]() |
![]() |
![]() |
#3 |
Member
Daniel Harlacher
Join Date: Mar 2009
Location: Davis, CA, United States
Posts: 60
Rep Power: 18 ![]() |
Hi,
I tried the following: if [ -z "$PS1" ]; then PATH=$PATH:/home2/harlacher/OpenFOAM/ fi just to check if it would work. But "ssh client printenv |grep PATH" does not return "/home2/harlacher/OpenFOAM/" Any ideas what else I could do ? -harly |
|
![]() |
![]() |
![]() |
![]() |
#4 |
New Member
vijayakrishnan
Join Date: Mar 2009
Posts: 5
Rep Power: 18 ![]() |
I am running OF 1.5 in paralled on two machines kidam and soorya. I get the following error :
Can somebody advice the reason for this [soorya:05105] *** An error occurred in MPI_Waitall [soorya:05105] *** on communicator MPI_COMM_WORLD [soorya:05105] *** MPI_ERR_TRUNCATE: message truncated [soorya:05105] *** MPI_ERRORS_ARE_FATAL (goodbye) [soorya:05104] *** An error occurred in MPI_Waitall [soorya:05104] *** on communicator MPI_COMM_WORLD [soorya:05104] *** MPI_ERR_TRUNCATE: message truncated [soorya:05104] *** MPI_ERRORS_ARE_FATAL (goodbye) mpirun noticed that job rank 2 with PID 4565 on node kidambiHP219 exited on signal 15 (Terminated). |
|
![]() |
![]() |
![]() |
![]() |
#5 |
Assistant Moderator
Bernhard Gschaider
Join Date: Mar 2009
Posts: 4,225
Rep Power: 52 ![]() ![]() |
Could you be a bit more specific WHEN this error appears (before/after the first timestep, didn't even see the OF-Logo .... )
The two most popular problems with MPI are: - too small MPI_BUFFER_SIZE - different OpenMPI-versions on the machines (I suspect that in your case: your "master" machine uses the OF-OpenMPI, the "slave" uses another - system installed - OpenMPI) Berhard
__________________
Note: I don't use "Friend"-feature on this forum out of principle. Ah. And by the way: I'm not on Facebook either. So don't be offended if I don't accept your invitation/friend request |
|
![]() |
![]() |
![]() |
![]() |
#6 |
New Member
vijayakrishnan
Join Date: Mar 2009
Posts: 5
Rep Power: 18 ![]() |
this problem happens after the first time step
. Also both machines used the same version of OpenMPI.. the one supplied with OF |
|
![]() |
![]() |
![]() |
![]() |
#7 |
Assistant Moderator
Bernhard Gschaider
Join Date: Mar 2009
Posts: 4,225
Rep Power: 52 ![]() ![]() |
Hi!
Which solver? That might be an indication. For instance: if it is dieselFoam then a search for MPI_ERR_TRUNCATE might have gotten you two very interesting result. One of them would lead to the question "How do I install these 1.5.x-Bugfixes" Bernhard
__________________
Note: I don't use "Friend"-feature on this forum out of principle. Ah. And by the way: I'm not on Facebook either. So don't be offended if I don't accept your invitation/friend request |
|
![]() |
![]() |
![]() |
![]() |
#8 |
New Member
vijayakrishnan
Join Date: Mar 2009
Posts: 5
Rep Power: 18 ![]() |
the solver is sonicTurbFoam.
I am attaching the log for reference: Exec : sonicTurbFoam -parallel Date : Feb 19 2009 Time : 12:31:39 Host : soorya PID : 5104 Case : /home/openfoam15/OpenFOAM/vijay-1.5/run/vayumach2clus nProcs : 4 Slaves : 3 ( soorya.5105 kidambiHP219.4565 kidambiHP219.4566 ) Pstream initialized with: floatTransfer : 1 nProcsSimpleSum : 0 commsType : nonBlocking // * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * // Create time Create mesh for time = 0 Reading thermophysical properties Selecting thermodynamics package hThermo<puremixture<consttransport<speciethermo<hc onstthermo<perfectgas>>>>> 1 additional process aborted (not shown) ---vijay |
|
![]() |
![]() |
![]() |
![]() |
#9 |
New Member
vijayakrishnan
Join Date: Mar 2009
Posts: 5
Rep Power: 18 ![]() |
Also the is the error messages I get:
Thu Feb 19 12:31:36 IST 2009 nohup: appending output to `nohup.out' [soorya:05105] *** An error occurred in MPI_Waitall [soorya:05105] *** on communicator MPI_COMM_WORLD [soorya:05105] *** MPI_ERR_TRUNCATE: message truncated [soorya:05105] *** MPI_ERRORS_ARE_FATAL (goodbye) [soorya:05104] *** An error occurred in MPI_Waitall [soorya:05104] *** on communicator MPI_COMM_WORLD [soorya:05104] *** MPI_ERR_TRUNCATE: message truncated [soorya:05104] *** MPI_ERRORS_ARE_FATAL (goodbye) mpirun noticed that job rank 2 with PID 4565 on node kidambiHP219 exited on signal 15 (Terminated). Command exited with non-zero status 1 0.02user 0.01system 0:06.14elapsed 0%CPU (0avgtext+0avgdata 0maxresident)k 4752inputs+16outputs (27major+2408minor)pagefaults 0swaps |
|
![]() |
![]() |
![]() |
![]() |
#10 |
New Member
Dominic Spreitz
Join Date: Mar 2009
Location: Lucern, Switzerland
Posts: 10
Rep Power: 18 ![]() |
Harly,
thanks for your post. It was the last piece in my puzzle. Dominic |
|
![]() |
![]() |
![]() |
![]() |
#11 |
Member
Daniel Harlacher
Join Date: Mar 2009
Location: Davis, CA, United States
Posts: 60
Rep Power: 18 ![]() |
Would you mind sharing what you managed to do with the information ?
Because I was not able to get my setup running (I blame fedora 9) because with ubuntu it worked out of the box. - harly |
|
![]() |
![]() |
![]() |
![]() |
#12 |
New Member
Dominic Spreitz
Join Date: Mar 2009
Location: Lucern, Switzerland
Posts: 10
Rep Power: 18 ![]() |
Sorry it took me so long to reply. I put
if [ -z "$PS1" ]; thenin my .bashrc. That way all the required OF environment variables are correctly loaded, when MPI logs in over SSH. At least that's my understanding of the concept with the inter- and non-interactive shells. Anybody correct me, if I am wrong here. Hope this helps. |
|
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Problem with openmpi | mighelone | OpenFOAM Running, Solving & CFD | 42 | July 18, 2012 15:26 |
Problems using local openmpi | stephan | OpenFOAM Installation | 1 | December 5, 2007 19:01 |
OpenMPI performance | vega | OpenFOAM Running, Solving & CFD | 13 | November 27, 2007 02:28 |
running CFX 11.0 on Fedora 7 : display problems | Eric Joubarne | CFX | 1 | October 3, 2007 17:51 |
OpenFOAM 14 with OpenMPI 12 | fhy | OpenFOAM Installation | 0 | July 12, 2007 19:12 |