Running macros in parallel in batch mode
Hi,
I'm trying to run a simulation with a macro, with 4 cores on a local machine, in batch mode using the following: starccm+ -np 4 -batch file.sim file.java and I get the following error: Warning : CPU Timer not running Warning : Wall Timer not running Server Error: There are no regions to solve on It works fine using a single core in serial mode (starccm+ -batch file.sim file.java). Any help would be appreciated. Thanks. |
Start the server and then start the java file with "a client session".
I had the same problem... never found a other way. |
I'm running starccm in an optimization loop. Starting it without batch mode requires me to end the starccm process after the max iteration has been reached in order for the optimization to proceed to the next run.
Is there an option to end the client and exit starccm? Thanks. |
Quote:
starccm+ -np 4 -batch file.java -collab -load file.simThis works on our Linux machine since V3.something CU, Matthias |
Hi Matthias, that partly works. All the cores are now available for use (100% CPU usage), but only one core is being used most of the time.
Any ideas? Thanks. |
1 Attachment(s)
Just to shed light on what I've been seeing with the following:
starccm+ -np 4 -batch file.java -collab -load file.sim I've attached an iteration time plot. The one on the left is with a single core, roughly 1.2sec/iteration, while the plot on the right is with the above option, running 4 cores, 0.4sec for the fastest iteration, and 28 sec for the slowest, with the entire parallel run taking longer than in the serial case. Any ideas on why this cycling is happening, and a way to control it? Thanks. |
What are your hardware configuration, mesh size and mpi version?
|
Hi Vinicius,
It's an i7-920 24G RAM, Windows XP64 SP2 mesh size 150,000 cells MPICH2 v1.03. Thanks. |
Maybe you are using too many cores to this small mesh and the parallelization time is higher than the calculation. It's not a rule, but I usually add a core each 250 000 cells.
|
It seems like MPICH2 uses 1 extra process, so if I request 4 cores, it runs 5 processes and it has trouble distributing 5 processes over 4 cores. It runs well with 3 cores.
So the solution would be to use 1 core less than the total available. Issue has been resolved. Thanks to all who replied. |
Quote:
But it's sometimes faster with collab? |
Quote:
On Linux/Bash: alias starccm+="$(which starccm+) -collab" enables this feature constantly. Regards, Matthias |
I know... but why this difference?
|
Hey nomad,
By any chance you can post .java macro you're using? Thanks in advance Regards |
All times are GMT -4. The time now is 13:04. |