CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > Software User Forums > OpenFOAM > OpenFOAM Installation

Please help cannot start lamboot

Register Blogs Community New Posts Updated Threads Search

 
 
LinkBack Thread Tools Search this Thread Display Modes
Prev Previous Post   Next Post Next
Old   March 15, 2005, 22:47
Default Hi, I am trying to get para
  #1
Senior Member
 
Pei-Ying Hsieh
Join Date: Mar 2009
Posts: 317
Rep Power: 18
hsieh is on a distinguished road
Hi,

I am trying to get parallel computing going and are running into problem. It will be appreciate if anyone here can help me.

1. I got nfs running. Process1 mounted to process0.
2. I got passwordless ssh working. I can type:
ssh -v phsieh@192.168.254.43 and log in to the remote computer without entering a password.

But, I cannot get lamboot -v ... to start (in the file machines contains 2 nodes).

Here is the error message:
------------------------
[phsieh@brian3 interFoam]$ lamboot -v /home/phsieh/OpenFOAM/phsieh-1.1/run/tutorials/interFoam/damBreakFine/system/mac hines

LAM 7.1.1 - Indiana University

n-1<4730> ssi:boot:base:linear: booting n0 (brian3.hsieh.com)
n-1<4730> ssi:boot:base:linear: booting n1 (kevin3.hsieh.com)
ERROR: LAM/MPI unexpectedly received the following on stderr:
connect to address 192.168.254.32: Connection refused
connect to address 192.168.254.32: Connection refused
trying normal rsh (/usr/bin/rsh)
kevin3.hsieh.com: Connection refused
-----------------------------------------------------------------------------
LAM failed to execute a process on the remote node "kevin3.hsieh.com".
LAM was not trying to invoke any LAM-specific commands yet -- we were
simply trying to determine what shell was being used on the remote
host.

LAM tried to use the remote agent command "rsh"
to invoke "echo $SHELL" on the remote node.

*** PLEASE READ THIS ENTIRE MESSAGE, FOLLOW ITS SUGGESTIONS, AND
*** CONSULT THE "BOOTING LAM" SECTION OF THE LAM/MPI FAQ
*** (http://www.lam-mpi.org/faq/) BEFORE POSTING TO THE LAM/MPI USER'S
*** MAILING LIST.

This usually indicates an authentication problem with the remote
agent, some other configuration type of error in your .cshrc or
.profile file, or you were unable to executable a command on the
remote node for some other reason. The following is a list of items
that you should check on the remote node:

- You have an account and can login to the remote machine
- Incorrect permissions on your home directory (should
probably be 0755)
- Incorrect permissions on your $HOME/.rhosts file (if you are
using rsh -- they should probably be 0644)
- You have an entry in the remote $HOME/.rhosts file (if you
are using rsh) for the machine and username that you are
running from
- Your .cshrc/.profile must not print anything out to the
standard error
- Your .cshrc/.profile should set a correct TERM type
- Your .cshrc/.profile should set the SHELL environment
variable to your default shell

Try invoking the following command at the unix command line:

rsh kevin3.hsieh.com -n 'echo $SHELL'

You will need to configure your local setup such that you will *not*
be prompted for a password to invoke this command on the remote node.
No output should be printed from the remote node before the output of
the command is displayed.

When you can get this command to execute successfully by hand, LAM
will probably be able to function properly.
-----------------------------------------------------------------------------
n-1<4730> ssi:boot:base:linear: Failed to boot n1 (kevin3.hsieh.com)
n-1<4730> ssi:boot:base:linear: aborted!
n-1<4735> ssi:boot:base:linear: booting n0 (brian3.hsieh.com)
n-1<4735> ssi:boot:base:linear: booting n1 (kevin3.hsieh.com)
ERROR: LAM/MPI unexpectedly received the following on stderr:
connect to address 192.168.254.32: Connection refused
connect to address 192.168.254.32: Connection refused
trying normal rsh (/usr/bin/rsh)
kevin3.hsieh.com: Connection refused
-----------------------------------------------------------------------------
LAM failed to execute a process on the remote node "kevin3.hsieh.com".
LAM was not trying to invoke any LAM-specific commands yet -- we were
simply trying to determine what shell was being used on the remote
host.

LAM tried to use the remote agent command "rsh"
to invoke "echo $SHELL" on the remote node.

*** PLEASE READ THIS ENTIRE MESSAGE, FOLLOW ITS SUGGESTIONS, AND
*** CONSULT THE "BOOTING LAM" SECTION OF THE LAM/MPI FAQ
*** (http://www.lam-mpi.org/faq/) BEFORE POSTING TO THE LAM/MPI USER'S
*** MAILING LIST.

This usually indicates an authentication problem with the remote
agent, some other configuration type of error in your .cshrc or
.profile file, or you were unable to executable a command on the
remote node for some other reason. The following is a list of items
that you should check on the remote node:

- You have an account and can login to the remote machine
- Incorrect permissions on your home directory (should
probably be 0755)
- Incorrect permissions on your $HOME/.rhosts file (if you are
using rsh -- they should probably be 0644)
- You have an entry in the remote $HOME/.rhosts file (if you
are using rsh) for the machine and username that you are
running from
- Your .cshrc/.profile must not print anything out to the
standard error
- Your .cshrc/.profile should set a correct TERM type
- Your .cshrc/.profile should set the SHELL environment
variable to your default shell

Try invoking the following command at the unix command line:

rsh kevin3.hsieh.com -n 'echo $SHELL'

You will need to configure your local setup such that you will *not*
be prompted for a password to invoke this command on the remote node.
No output should be printed from the remote node before the output of
the command is displayed.

When you can get this command to execute successfully by hand, LAM
will probably be able to function properly.
-----------------------------------------------------------------------------
n-1<4735> ssi:boot:base:linear: Failed to boot n1 (kevin3.hsieh.com)
n-1<4735> ssi:boot:base:linear: aborted!
lamboot did NOT complete successfully
[phsieh@brian3 interFoam]$

pei
hsieh is offline   Reply With Quote

 


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Lamboot and ssh dmoroian OpenFOAM Running, Solving & CFD 1 November 1, 2006 05:53
Lamboot and mpirun r2d2 OpenFOAM Running, Solving & CFD 2 January 10, 2006 11:31
Lamboot trouble r2d2 OpenFOAM Installation 4 October 17, 2005 04:27
how to start cfd abul basat FLUENT 2 August 1, 2005 03:51
Cannot start using CFX-5.7.1 Atit CFX 8 February 16, 2005 06:40


All times are GMT -4. The time now is 04:39.