CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > Software User Forums > Siemens > STAR-CCM+

128 Core Parallel Job - MPI Error

Register Blogs Members List Search Today's Posts Mark Forums Read

Like Tree1Likes
  • 1 Post By cwl

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old   October 13, 2020, 12:36
Default 128 Core Parallel Job - MPI Error
  #1
New Member
 
Seph
Join Date: Oct 2020
Posts: 1
Rep Power: 0
Seph_01 is on a distinguished road
Hi All,

I'm having issues getting a 4 x 32 core job off the ground. Once submitted I get an MPI Error - File Name Too Long

I feel there may be a file or variable that needs tweaking somewhere. I'm using the version of MPI shipped with StarCCM 2020 and running this all on Windows Server 2019.

If I run this job at 4 x 16 then it runs fine. So MPI itself in addition to networking between the nodes is all good.

Just wondering if anyone has come across this.

Thanks
Seph_01 is offline   Reply With Quote

Old   October 20, 2020, 07:09
Default
  #2
Senior Member
 
Sebastian Engel
Join Date: Jun 2011
Location: Germany
Posts: 566
Rep Power: 20
bluebase will become famous soon enough
There might be an issue with the plattform mpi on windows, since you are not the first one to report it:

Running on the cluster: machine file -> Filename too long???
Running on 4 node error MPI Errors[320798736]

The same is reported in this post:
Running Star on separate machines

However, this member explained he/she used intel mpi instead, which also comes with StarCCM. This avoided the issue.
Have a look into the manual, and look for the Command-Line Reference. There should be an option like "-mpi <driver>" where <driver> needs to be selected.

If this fixes your issue, please share it here.

Best,
Sebastian
bluebase is offline   Reply With Quote

Old   October 20, 2020, 11:55
Default
  #3
cwl
Senior Member
 
Chaotic Water
Join Date: Jul 2012
Location: Elgrin Fau
Posts: 435
Rep Power: 17
cwl is on a distinguished road
Quote:
Originally Posted by bluebase View Post
There might be an issue with the plattform mpi on windows, since you are not the first one to report it:

Running on the cluster: machine file -> Filename too long???
Running on 4 node error MPI Errors[320798736]

The same is reported in this post:
Running Star on separate machines

However, this member explained he/she used intel mpi instead, which also comes with StarCCM. This avoided the issue.
Have a look into the manual, and look for the Command-Line Reference. There should be an option like "-mpi <driver>" where <driver> needs to be selected.

If this fixes your issue, please share it here.

Best,
Sebastian
Yes, "this member" (me) has spent lots of time monitoring commands that Star runs and trying to figure out what was wrong. And it was Platform MPI.

Installing, setting up Intel MPI:
Code:
<installdir>\mpi\intel\<installdir>\win64\bin\mpivars.bat
<installdir>\mpi\intel\<installdir>\win64\bin\hydra_service.exe -install
<installdir>\mpi\intel\<installdir>\win64\bin\hydra_service.exe -start
<installdir>\mpi\intel\<installdir>\win64\bin\mpiexec -register
Running the last command will ask you for user/password.
If user is the same as current one - just hit enter when asked for
Code:
account (domain\user) [ComputerName\UserName]:
Checking:
Code:
<installdir>\mpi\intel\<installdir>\win64\bin>mpiexec -hosts <Computer1> -validate cmd
If you still see command promt (cmd) - type whoami and hit enter, it will show which machine you're logged in now.

First, try starting cmd one the local machine with its name instead of <Computer1>,
then - the name of another machine in the network.

To use Intel MPI when starting Star-CCM+ - right in the very bottom of the New (or Load Simulation) dialog where you see something like:
Code:
starccm+ -server -on Machine1:28,Machine2:28
correct it manually to:
Code:
starccm+ -mpi intel -server -on Machine1:28,Machine2:28
PS I'm actually very confused/disappointed - as that problem still persists 4 years after my first post. It cannot be that no one has ever encountered it never reported.
bluebase likes this.
cwl is offline   Reply With Quote

Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
[swak4Foam] swak4foam openfoam 7 installation problem Andrea23 OpenFOAM Community Contributions 1 February 17, 2020 18:11
Compile problem ivanyao OpenFOAM Running, Solving & CFD 1 October 12, 2012 09:31
Version 15 on Mac OS X gschaider OpenFOAM Installation 113 December 2, 2009 10:23
Problem with compile the setParabolicInlet ivanyao OpenFOAM Running, Solving & CFD 6 September 5, 2008 20:50
DecomposePar links against liblamso0 with OpenMPI jens_klostermann OpenFOAM Bugs 11 June 28, 2007 17:51


All times are GMT -4. The time now is 15:28.