Hi Chris,
Many thanks for the feedback and my apologies for not having tested this properly back then :(. The fix for you should be to simply run the respective commands in the new step #3: http://openfoamwiki.net/index.php/In...ian#Debian_6.0 By following the new commands, they will write the necessary custom variables into the "etc/prefs.sh" file, therefore making it the standard settings, therefore making it properly read in the remote shells. And don't forget to run them from the base OpenFOAM folder, namely "$HOME/OpenFOAM". The previous problem is now described here (if you're curious about it ;)): http://www.openfoam.org/mantisbt/view.php?id=231#c1846 - comment #1846 Best regards, Bruno |
prefs.sh
3 Attachment(s)
Hi Bruno,
Thanks for suggesting that fix. I have two machines that use the new instructions. They both have OpenFOAM-2.1.1/etc/prefs.sh. One is a 64 bit machine and the other is a 32 bit machine. The two machines are referred to as 192.168.1.20 and 192.168.1.21. The .20 machine is the master and the .21 is the slave. Both have two cores. I can't seem to get this work: Code:
foamJob -p -s Test-parallel in the laminar directory, I created: laminar/node0 laminar/node1 laminar/node2 laminar/node3 I modified the damBreak example in the following way: 1. I edited decomposeParDict to request distributed computation. I am attaching that file. I see sometimes after 'root' some documentation says nroots, I noticed you don't have that in yours, but I tried both. I removed a '3' that was previously after roots 2. I copied alpha1.org to alpha1 3. setFields 4. blockMesh 5. decomposePar 6. created a machines file. I'll attach that, too. 7. Each node? directory got a copy of the modified damBreak flle set. 8. I temporarily renamed machines to machines.bak and the following works: Code:
foamJob -p -s Test-parallel Code:
$ foamJob -p -s Test-parallel When I rename machines.bak to machines, foamJob picks it up and I get the following error: Code:
$ foamJob -p -s Test-parallel a. I open a root terminal on the slave machine via Gnome, and type 'which Test-parallel', and it points to the correct one. I think the environment is there. b. The node2 and node3 directories were created from the mount command. They actually reside on the slave machine (.21), but I made them part of the master file system to more accurately model your setup. The settings for the mount command were (rw,sync,no_subtree_check) c.I did have to make changes to etc/bashrc that went beyond what the directions recommends for end users. I need to remove where root was being inserted into the path. I replaced it with my username. d. I googled the problem and it appears the solution is to go back to openmpi/mpirun 1.4.3. (mantisbt 296) I think I am misapplying that patch, however. I don't see why I should be getting that error when I introduce a machines file. Any ideas? Your thoughts would be appreciated. Thanks, Chris Attachment 18000 Attachment 18001 Attachment 18002 |
Hi Chris,
:eek: these are a lot of tests and files... I can't figure out from your post if you tested using the two machines with your own user and without using the distributed folders method. I don't have much time to go into details, so I'll try to summarize the suggestions:
Best regards, Bruno |
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
|
Hi Chris,
A quick reply:
Bruno |
Hi Bruno,
A quick response! Quote:
Quote:
Quote:
Thanks, Chris |
Hi Bruno,
I tried your ideas. Your notes are very useful. There were a couple of links that were talking about dealing with non-interactive shells, and that was one my problems. For a default Debian installation, I had to go into my .bashrc and comment out the line that exits if it detects a non-interactive shell. Also, I uninstalled the SYSTEMOPENMPI software, and brought in OpenMPI 1.6.3. That is their latest release and they are claiming application binary interface compatibility with 1.5.3. I went into the OpenFOAM file OpenFOAM-2.1.1/etc/config/settings.sh and changed 1.5.3 to 1.6.3 to make sure I included all of OpenFOAM's requirements for the builds, notably GridEngine. (i.e. I used ./Allwmake) I am hoping GridEngine is supported in 1.6.3 as I noticed it is not working in OpenMPI 1.5.3. I setup a hosts file for the two machines: 192.168.1.25 debian5 192.168.1.26 debian6 I did not try a heterogeneous calculation, so I did everything on two nearly identical 64 bit machines. Finally, I copied the folder structure you outline in "Running OpenFOAM in parallel with different locations for each process" ...and it works. Your debian wiki was instrumental in getting this to work. I propose the following changes: 1. Update the .bashrc file to properly handle non-interactive logins, unless you have a more clever solution. 2. Build openMPI 1.6.3 since the default Debian installation does not have GridEngine support 3. setup a hosts file If you comfortable with these updates, I will be happy to make them. Best Regards, Chris |
Hi Chris,
Good to learn that you've managed to get things working! As for changing the wiki page, points 1 and 2 are OK for that Debian page and feel free to go ahead and do the modifications! Point 3 should be part of a page that is yet to be created, about running OpenFOAM in parallel. If you're willing to kick-start such a page, you can create a tips-n-tricks page: http://openfoamwiki.net/index.php/Main_TipsAndTricks - later on it can be moved to a more global section, when more contributions start coming in! Best regards, Bruno |
The correction in Line 75 of the file
OpenFOAM/ThirdParty-2.1.x/ParaView-3.12.0/Qt/Core/pqServerManagerSelectionModel.h worked for me! Thanks Martin_! |
All times are GMT -4. The time now is 03:58. |